Model pre-training on large text corpora has been demonstrated effective...
Contrastive loss has been increasingly used in learning representations ...
To solve video-and-language grounding tasks, the key is for the network ...
We introduce a novel privacy-preserving methodology for performing Visua...
GuessWhat?! is a two-player visual dialog guessing game where player A a...
Current conversational AI systems aim to understand a set of pre-designe...
We introduce the "adversarial code learning" (ACL) module that improves
...
Recommender systems are designed to help mitigate information overload u...
In this paper, we introduce attribute-aware fashion-editing, a novel tas...
The present study proposes LitStoryTeller, an interactive system for vis...
With the prevalence of video sharing, there are increasing demands for
a...