Zhecan Wang

research

∙ 07/03/2023

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding

Vision-language tasks, such as VQA, SNLI-VE, and VCR are challenging bec...

0 Rui Sun, et al. ∙

research

∙ 03/23/2023

CoBIT: A Contrastive Bi-directional Image-Text Generation Model

The field of vision and language has witnessed a proliferation of pre-tr...

0 Haoxuan You, et al. ∙

research

∙ 12/14/2022

Find Someone Who: Visual Commonsense Understanding in Human-Centric Grounding

From a visual scene containing multiple people, human is able to disting...

0 Haoxuan You, et al. ∙

research

∙ 11/10/2022

Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense

Visual commonsense understanding requires Vision Language (VL) models to...

0 Zhecan Wang, et al. ∙

research

∙ 04/22/2022

Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks

Cross-modal encoders for vision-language (VL) tasks are often pretrained...

3 Zhecan Wang, et al. ∙

research

∙ 01/15/2022

CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks

Contrastive language-image pretraining (CLIP) links vision and language ...

15 Zhecan Wang, et al. ∙

research

∙ 12/16/2021

SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning

Answering complex questions about images is an ambitious goal for machin...

4 Zhecan Wang, et al. ∙

research

∙ 06/08/2021

Graph-MLP: Node Classification without Message Passing in Graph

Graph Neural Network (GNN) has been demonstrated its effectiveness in de...

0 Yang Hu, et al. ∙

research

∙ 10/24/2020

Weakly-supervised VisualBERT: Pre-training without Parallel Images and Captions

Pre-trained contextual vision-and-language (V L) models have brought i...

0 Liunian Harold Li, et al. ∙

research

∙ 06/17/2020

Learning Visual Commonsense for Robust Scene Graph Generation

Scene graph generation models understand the scene through object and pr...

0 Alireza Zareian, et al. ∙

research

∙ 04/07/2020

Learning to Detect Head Movement in Unconstrained Remote Gaze Estimation in the Wild

Unconstrained remote gaze estimation remains challenging mostly due to i...

4 Zhecan Wang, et al. ∙

Zhecan Wang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro