Yaya Shi | DeepAI

DeepAI

AI Chat AI Image Generator AI Video AI Music Generator

Featured Co-authors

Wei Wang
492 publications
Fei Huang
134 publications
Jingren Zhou
90 publications
Ming Yan
79 publications
Ji Zhang
74 publications
Chenliang Li
55 publications
Songfang Huang
45 publications
Haiyang Xu
31 publications
Bing Li
30 publications
Weiming Hu
30 publications
Xuan Wu
26 publications

research

∙ 06/07/2023

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks

To promote the development of Vision-Language Pre-training (VLP) and mul...

0 Haiyang Xu, et al. ∙

research

∙ 04/27/2023

mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality

Large language models (LLMs) have demonstrated impressive zero-shot abil...

0 Qinghao Ye, et al. ∙

research

∙ 02/01/2023

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video

Recent years have witnessed a big convergence of language, vision, and m...

0 Haiyang Xu, et al. ∙

research

∙ 10/13/2019

VATEX Captioning Challenge 2019: Multi-modal Information Fusion and Multi-stage Training Strategy for Video Captioning

Multi-modal information is essential to describe what has happened in a ...

0 Ziqi Zhang, et al. ∙