Weichong Yin

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Yu Sun
127 publications
Han Zhang
115 publications
Hua Wu
104 publications
Haifeng Wang
101 publications
Bin Luo
67 publications
Zhenyu Zhang
47 publications
Hao Tian
46 publications
Yin Zhang
31 publications
Dianhai Yu
30 publications
Wenjin Wang
27 publications
Shikun Feng
25 publications

research

∙ 11/09/2022

ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation

Recent cross-lingual cross-modal works attempt to extend Vision-Language...

0 Bin Shan, et al. ∙

research

∙ 10/27/2022

ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts

Recent progress in diffusion models has revolutionized the popular techn...

0 Zhida Feng, et al. ∙

research

∙ 10/12/2022

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding

Recent years have witnessed the rise and success of pre-training techniq...

12 Qiming Peng, et al. ∙

research

∙ 09/30/2022

ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training

Recent Vision-Language Pre-trained (VLP) models based on dual encoder ha...

7 Bin Shan, et al. ∙

research

∙ 09/18/2022

ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding

Recent efforts of multimodal Transformers have improved Visually Rich Do...

22 Wenjin Wang, et al. ∙

research

∙ 12/31/2021

ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation

Conventional methods for the image-text generation tasks mainly tackle t...

6 Han Zhang, et al. ∙

research

∙ 06/30/2020

ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph

We propose a knowledge-enhanced approach, ERNIE-ViL, to learn joint repr...

10 Fei Yu, et al. ∙

Success!

An error occurred

Weichong Yin

Featured Co-authors

ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation

ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding

ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training

ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding

ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation

ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph

Sign in with Google

Consider DeepAI Pro