b'Junnan Li'

research

∙ 05/31/2023

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

Code intelligence plays a key role in transforming modern software engin...

0 Nghi D. Q. Bui, et al. ∙

research

∙ 05/24/2023

BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing

Subject-driven text-to-image generation models create novel renditions o...

0 Dongxu Li, et al. ∙

research

∙ 05/13/2023

CodeT5+: Open Code Large Language Models for Code Understanding and Generation

Large language models (LLMs) pretrained on vast source code have achieve...

0 Hung Le, et al. ∙

research

∙ 05/11/2023

InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning

General-purpose language models that can solve various language-domain t...

0 Wenliang Dai, et al. ∙

research

∙ 01/30/2023

BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

The cost of vision-and-language pre-training has become increasingly pro...

0 Junnan Li, et al. ∙

research

∙ 12/21/2022

From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models

Large language models (LLMs) have demonstrated excellent zero-shot gener...

1 Jiaxian Guo, et al. ∙

research

∙ 12/06/2022

Tackling Data Heterogeneity in Federated Learning with Class Prototypes

Data heterogeneity across clients in federated learning (FL) settings is...

0 Yutong Dai, et al. ∙

research

∙ 11/29/2022

BotSIM: An End-to-End Bot Simulation Toolkit for Commercial Task-Oriented Dialog Systems

We introduce BotSIM, a modular, open-source Bot SIMulation environment w...

0 Guangsen Wang, et al. ∙

research

∙ 10/17/2022

Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training

Visual question answering (VQA) is a hallmark of vision and language rea...

0 Anthony Meng Huat Tiong, et al. ∙

research

∙ 09/15/2022

LAVIS: A Library for Language-Vision Intelligence

We introduce LAVIS, an open-source deep learning library for LAnguage-VI...

0 Dongxu Li, et al. ∙

research

∙ 06/07/2022

Masked Unsupervised Self-training for Zero-shot Image Classification

State-of-the-art computer vision models are mostly trained with supervis...

0 Junnan Li, et al. ∙

research

∙ 01/28/2022

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Vision-Language Pre-training (VLP) has advanced the performance for many...

0 Junnan Li, et al. ∙

research

∙ 12/17/2021

Align and Prompt: Video-and-Language Pre-training with Entity Prompts

Video-and-language pre-training has shown promising improvements on vari...

0 Dongxu Li, et al. ∙

research

∙ 11/18/2021

Towards Open Vocabulary Object Detection without Human-provided Bounding Boxes

Despite great progress in object detection, most existing methods are li...

7 Mingfei Gao, et al. ∙

research

∙ 10/19/2021

Improving Tail-Class Representation with Centroid Contrastive Learning

In vision domain, large-scale natural datasets typically exhibit long-ta...

0 Anthony Meng Huat Tiong, et al. ∙

research

∙ 10/15/2021

Cascaded Fast and Slow Models for Efficient Semantic Code Search

The goal of natural language semantic code search is to retrieve a seman...

21 Akhilesh Deepak Gotmare, et al. ∙

research

∙ 07/16/2021

Align before Fuse: Vision and Language Representation Learning with Momentum Distillation

Large-scale vision and language representation learning has shown promis...

30 Junnan Li, et al. ∙

research

∙ 11/23/2020

CoMatch: Semi-supervised Learning with Contrastive Graph Regularization

Semi-supervised learning has been an effective paradigm for leveraging u...

0 Junnan Li, et al. ∙

research

∙ 09/17/2020

MoPro: Webly Supervised Learning with Momentum Prototypes

We propose a webly-supervised representation learning method that does n...

8 Junnan Li, et al. ∙

research

∙ 07/23/2020

The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation

Most existing object instance detection and segmentation models only wor...

0 PetsTime, et al. ∙

research

∙ 05/11/2020

Prototypical Contrastive Learning of Unsupervised Representations

This paper presents Prototypical Contrastive Learning (PCL), an unsuperv...

10 Junnan Li, et al. ∙

research

∙ 03/30/2020

Improving out-of-distribution generalization via multi-task self-supervised pretraining

Self-supervised feature representations have been shown to be useful for...

0 Isabela Albuquerque, et al. ∙

research

∙ 03/03/2020

Towards Noise-resistant Object Detection with Noisy Annotations

Training deep object detectors requires significant amount of human-anno...

2 Junnan Li, et al. ∙

research

∙ 02/18/2020

DivideMix: Learning with Noisy Labels as Semi-supervised Learning

Deep neural networks are known to be annotation-hungry. Numerous efforts...

0 Junnan Li, et al. ∙

research

∙ 02/09/2020

Weakly-Supervised Multi-Person Action Recognition in 360^∘ Videos

The recent development of commodity 360^∘ cameras have enabled a single ...

0 Junnan Li, et al. ∙

research

∙ 02/09/2020

GradMix: Multi-source Transfer across Domains and Tasks

The computer vision community is witnessing an unprecedented rate of new...

0 Junnan Li, et al. ∙

research

∙ 10/29/2019

Classification Calibration for Long-tail Instance Segmentation

Remarkable progress has been made in object instance detection and segme...

23 PetsTime, et al. ∙

research

∙ 12/13/2018

Visual Social Relationship Recognition

Social relationships form the basis of social structure of humans. Devel...

0 Junnan Li, et al. ∙

research

∙ 12/13/2018

Learning to Learn from Noisy Labeled Data

Despite the success of deep neural networks (DNNs) in image classificati...

46 Junnan Li, et al. ∙

research

∙ 09/06/2018

Unsupervised Learning of View-invariant Action Representations

The recent success in human action recognition with deep learning method...

0 Junnan Li, et al. ∙

research

∙ 08/29/2018

Interact as You Intend: Intention-Driven Human-Object Interaction Detection

The recent advances in instance-level detection tasks lay strong foundat...

0 Bingjie Xu, et al. ∙

research

∙ 07/25/2018

Video Storytelling

Bridging vision and natural language is a longstanding goal in computer ...

2 Junnan Li, et al. ∙

research

∙ 08/03/2017

Attention Transfer from Web Images for Video Recognition

Training deep learning based video classifiers for action recognition re...

0 Junnan Li, et al. ∙

research

∙ 08/02/2017

Dual-Glance Model for Deciphering Social Relationships

Since the beginning of early civilizations, social relationships derived...

0 Junnan Li, et al. ∙

Junnan Li

Featured Co-authors

Sign in with Google

Consider DeepAI Pro