Code intelligence plays a key role in transforming modern software
engin...
Subject-driven text-to-image generation models create novel renditions o...
Large language models (LLMs) pretrained on vast source code have achieve...
General-purpose language models that can solve various language-domain t...
The cost of vision-and-language pre-training has become increasingly
pro...
Large language models (LLMs) have demonstrated excellent zero-shot
gener...
Data heterogeneity across clients in federated learning (FL) settings is...
We introduce BotSIM, a modular, open-source Bot SIMulation environment w...
Visual question answering (VQA) is a hallmark of vision and language
rea...
We introduce LAVIS, an open-source deep learning library for LAnguage-VI...
State-of-the-art computer vision models are mostly trained with supervis...
Vision-Language Pre-training (VLP) has advanced the performance for many...
Video-and-language pre-training has shown promising improvements on vari...
Despite great progress in object detection, most existing methods are li...
In vision domain, large-scale natural datasets typically exhibit long-ta...
The goal of natural language semantic code search is to retrieve a
seman...
Large-scale vision and language representation learning has shown promis...
Semi-supervised learning has been an effective paradigm for leveraging
u...
We propose a webly-supervised representation learning method that does n...
Most existing object instance detection and segmentation models only wor...
This paper presents Prototypical Contrastive Learning (PCL), an unsuperv...
Self-supervised feature representations have been shown to be useful for...
Training deep object detectors requires significant amount of human-anno...
Deep neural networks are known to be annotation-hungry. Numerous efforts...
The recent development of commodity 360^∘ cameras have enabled a
single ...
The computer vision community is witnessing an unprecedented rate of new...
Remarkable progress has been made in object instance detection and
segme...
Social relationships form the basis of social structure of humans. Devel...
Despite the success of deep neural networks (DNNs) in image classificati...
The recent success in human action recognition with deep learning method...
The recent advances in instance-level detection tasks lay strong foundat...
Bridging vision and natural language is a longstanding goal in computer
...
Training deep learning based video classifiers for action recognition
re...
Since the beginning of early civilizations, social relationships derived...