Existing text-to-image diffusion models struggle to synthesize realistic...
Video moment retrieval (VMR) aims to identify the specific moment in an
...
The deployment of large-scale generative models is often restricted by t...
In this paper, we tackle the challenging task of Panoramic Image-to-Imag...
Token-based masked generative models are gaining popularity for their fa...
Text-to-3D generation has shown rapid progress in recent days with the a...
Generated synthetic data in medical research can substitute privacy and
...
Image blending aims to combine multiple images seamlessly. It remains
ch...
Multi-resolution hash encoding has recently been proposed to reduce the
...
Efficient video-language modeling should consider the computational cost...
Neural networks trained with ERM (empirical risk minimization) sometimes...
Video corpus moment retrieval (VCMR) is the task to retrieve the most
re...
There are growing interests in adapting large-scale language models usin...
Text-to-image generation and image captioning are recently emerged as a ...
Visual dialog (VisDial) is a task of answering a sequence of questions
g...
Cross-domain few-shot learning (CD-FSL), where there are few target samp...
Cross-domain few-shot learning has drawn increasing attention for handli...
We present the efficiency of semi-orthogonal embedding for unsupervised
...
Gradient-based meta-learning approaches have been successful in few-shot...
Visual dialog is a task of answering a sequence of questions grounded in...
We propose a video story question-answering (QA) architecture, Multimoda...
Attention networks in multimodal learning provide an efficient way to ut...
The visual explanation of learned representation of models helps to
unde...
In this work, we propose a goal-driven collaborative task that contains
...
Bilinear models provide rich representations compared with linear models...
Deep neural networks continue to advance the state-of-the-art of image
r...
The rnn package provides components for implementing a wide range of
Rec...