Weicheng Kuo

research

∙ 09/02/2023

Contrastive Feature Masking Open-Vocabulary Vision Transformer

We present Contrastive Feature Masking Vision Transformer (CFM-ViT) - an...

0 Dahun Kim, et al. ∙

research

∙ 06/02/2023

DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model

Observing the close relationship among panoptic, semantic and instance s...

0 Xiuye Gu, et al. ∙

research

∙ 05/11/2023

Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers

We present Region-aware Open-vocabulary Vision Transformers (RO-ViT) - a...

0 Dahun Kim, et al. ∙

research

∙ 04/12/2023

RECLIP: Resource-efficient CLIP by Training with Small Images

We present RECLIP (Resource-efficient CLIP), a simple method that minimi...

0 Runze Li, et al. ∙

research

∙ 03/29/2023

MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks

The development of language models have moved from encoder-decoder to de...

0 Weicheng Kuo, et al. ∙

research

∙ 12/06/2022

Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning

We present a simple approach which can turn a ViT encoder into an effici...

0 AJ Piergiovanni, et al. ∙

research

∙ 09/30/2022

F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models

We present F-VLM, a simple open-vocabulary object detection method built...

0 Weicheng Kuo, et al. ∙

research

∙ 09/14/2022

PaLI: A Jointly-Scaled Multilingual Language-Image Model

Effective scaling and a flexible task interface enable large language mo...

6 Xi Chen, et al. ∙

research

∙ 09/09/2022

Pre-training image-language transformers for open-vocabulary tasks

We present a pre-training approach for vision and language transformer m...

0 AJ Piergiovanni, et al. ∙

research

∙ 08/01/2022

Video Question Answering with Iterative Video-Text Co-Tokenization

Video question answering is a challenging task that requires understandi...

0 AJ Piergiovanni, et al. ∙

research

∙ 05/02/2022

Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering

We present Answer-Me, a task-aware multi-task framework which unifies a ...

0 AJ Piergiovanni, et al. ∙

research

∙ 03/31/2022

FindIt: Generalized Localization with Natural Language Queries

We propose FindIt, a simple and versatile framework that unifies a varie...

0 Weicheng Kuo, et al. ∙

research

∙ 08/20/2021

Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval from a Single Image

3D perception of object shapes from RGB image input is fundamental towar...

11 Weicheng Kuo, et al. ∙

research

∙ 08/15/2021

Learning Open-World Object Proposals without Learning to Classify

Object proposals have become an integral preprocessing steps of many vis...

11 Dahun Kim, et al. ∙

research

∙ 05/03/2021

Noisy Student learning for cross-institution brain hemorrhage detection

Computed tomography (CT) is the imaging modality used in the diagnosis o...

0 Emily Lin, et al. ∙

research

∙ 04/28/2021

Zero-Shot Detection via Vision and Language Knowledge Distillation

Zero-shot image classification has made promising progress by training t...

17 Xiuye Gu, et al. ∙

research

∙ 07/26/2020

Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve

Object recognition has seen significant progress in the image domain, wi...

0 Weicheng Kuo, et al. ∙

research

∙ 04/05/2019

ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors

Instance segmentation aims to detect and segment individual objects in a...

8 Weicheng Kuo, et al. ∙

research

∙ 09/08/2018

Cost-Sensitive Active Learning for Intracranial Hemorrhage Detection

Deep learning for clinical applications is subject to stringent performa...

0 Weicheng Kuo, et al. ∙

research

∙ 06/08/2018

PatchFCN for Intracranial Hemorrhage Detection

This paper studies the problem of detecting acute intracranial hemorrhag...

2 Weicheng Kuo, et al. ∙

research

∙ 05/08/2015

DeepBox: Learning Objectness with Convolutional Networks

Existing object proposal approaches use primarily bottom-up cues to rank...

0 Weicheng Kuo, et al. ∙

Weicheng Kuo

Featured Co-authors

Sign in with Google

Consider DeepAI Pro