Despite their impressive capabilities, large language models (LLMs) are ...
Summarizing lengthy documents is a common and essential task in our dail...
This work studies how to transform an album to vivid and coherent storie...
Many applications of text generation such as summarization benefit from
...
Tailoring outputs of large language models, such as ChatGPT, to specific...
Large language models (LLMs), such as ChatGPT, are able to generate
huma...
Unsupervised domain adaption has been widely adopted in tasks with scarc...
Exploring dense matching between the current frame and past frames for
l...
This paper focuses on analyzing and improving the commonsense ability of...
This paper presents OmniVL, a new foundation model to support both
image...
People say, "A picture is worth a thousand words". Then how can we get t...
This paper revisits visual representation in knowledge-based visual ques...
Human intelligence is multimodal; we integrate visual, linguistic, and
a...
Recent state-of-the-art computer vision systems are trained from natural...
We consider a regression problem, where the correspondence between input...
Self-attention mechanisms have achieved great success on a variety of NL...
The top-k operation, i.e., finding the k largest or smallest elements fr...
This paper proposes a new meta-learning method -- named HARMLESS (HAwkes...
Optimal Transport (OT) naturally arises in many machine learning
applica...
Deep neural networks have achieved great success in multiple learning
pr...
Machine learning methods play increasingly important roles in pre-proced...
Wasserstein distance plays increasingly important roles in machine learn...