Zhuowan Li

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Alan Yuille
192 publications
Xiaogang Wang
191 publications
Hongsheng Li
185 publications
Benjamin Van Durme
102 publications
Zhe Lin
100 publications
Adam Kortylewski
49 publications
Cihang Xie
44 publications
Shuai Yi
39 publications
Yixiao Ge
38 publications
Yingwei Li
20 publications
Long Mai
20 publications

research

∙ 12/01/2022

Localization vs. Semantics: How Can Language Benefit Visual Representation Learning?

Despite the superior performance brought by vision-and-language pretrain...

0 Zhuowan Li, et al. ∙

research

∙ 12/01/2022

Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning

Visual Question Answering (VQA) models often perform poorly on out-of-di...

0 Zhuowan Li, et al. ∙

research

∙ 05/04/2022

Visual Commonsense in Pretrained Unimodal and Multimodal Models

Our commonsense knowledge about objects includes their typical visual at...

0 Chenyu Zhang, et al. ∙

research

∙ 04/05/2022

SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering

While Visual Question Answering (VQA) has progressed rapidly, previous w...

11 Vipul Gupta, et al. ∙

research

∙ 10/01/2021

Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images

While neural symbolic methods demonstrate impressive performance in visu...

0 Zhuowan Li, et al. ∙

research

∙ 04/07/2020

Context-Aware Group Captioning via Self-Attention and Contrastive Features

While image captioning has progressed rapidly, existing works focus main...

1 Zhuowan Li, et al. ∙

research

∙ 10/06/2018

FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification

Person re-identification (reID) is an important task that requires to re...

0 Yixiao Ge, et al. ∙

Success!

An error occurred

Zhuowan Li

Featured Co-authors

Localization vs. Semantics: How Can Language Benefit Visual Representation Learning?

Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning

Visual Commonsense in Pretrained Unimodal and Multimodal Models

SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering

Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images

Context-Aware Group Captioning via Self-Attention and Contrastive Features

FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification

Sign in with Google

Consider DeepAI Pro