Shuai Bai

research

∙ 08/31/2023

TouchStone: Evaluating Vision-Language Models by Language Models

Large vision-language models (LVLMs) have recently witnessed rapid advan...

0 Shuai Bai, et al. ∙

research

∙ 08/24/2023

Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities

We introduce the Qwen-VL series, a set of large-scale vision-language mo...

0 Jinze Bai, et al. ∙

research

∙ 05/18/2023

ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

In this work, we explore a scalable way for building a general represent...

0 Peng Wang, et al. ∙

research

∙ 12/08/2022

OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models

Generalist models, which are capable of performing diverse multi-modal t...

0 Jinze Bai, et al. ∙

research

∙ 12/06/2022

Pretrained Diffusion Models for Unified Human Motion Synthesis

Generative modeling of human motion has broad applications in computer a...

0 Jianxin Ma, et al. ∙

research

∙ 07/19/2022

Single Stage Virtual Try-on via Deformable Attention Flows

Virtual try-on aims to generate a photo-realistic fitting result given a...

0 Shuai Bai, et al. ∙

research

∙ 05/24/2022

M6-Fashion: High-Fidelity Multi-modal Image Generation and Editing

The fashion industry has diverse applications in multi-modal image gener...

0 Zhikang Li, et al. ∙

research

∙ 02/07/2022

Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

In this work, we pursue a unified paradigm for multimodal pretraining to...

5 Peng Wang, et al. ∙

research

∙ 05/31/2021

Connecting Language and Vision for Natural Language-Based Vehicle Retrieval

Vehicle search is one basic task for the efficient traffic management in...

20 Shuai Bai, et al. ∙

research

∙ 03/30/2021

Dense Relation Distillation with Context-aware Aggregation for Few-Shot Object Detection

Conventional deep learning based methods for object detection require a ...

0 Hanzhe Hu, et al. ∙

research

∙ 07/19/2020

Class-wise Dynamic Graph Convolution for Semantic Segmentation

Recent works have made great progress in semantic segmentation by exploi...

13 Hanzhe Hu, et al. ∙

research

∙ 11/26/2018

Multi-hierarchical Independent Correlation Filters for Visual Tracking

For visual tracking, most of the traditional correlation filters (CF) ba...

2 Shuai Bai, et al. ∙

Shuai Bai

Featured Co-authors

Sign in with Google

Consider DeepAI Pro