Video-language pre-training (VLP) has become increasingly important due ...
This technical report describes the EgoTask Translation approach that
ex...
Different video understanding tasks are typically treated in isolation, ...
With the broad growth of video capturing devices and applications on the...
Training computer vision models usually requires collecting and labeling...
Weakly supervised object detection (WSOD) enables object detectors to be...
Visual attention helps achieve robust perception under noise, corruption...
Contrastive learning relies on an assumption that positive pairs contain...
Contrastive learning has delivered impressive results in many audio-visu...
Creating presentation materials requires complex multimodal reasoning sk...
Large-scale datasets are the cornerstone of self-supervised representati...
The recent success of Transformers in the language domain has motivated
...
We study the problem of animating images by transferring spatio-temporal...
Current multi-reference style transfer models for Text-to-Speech (TTS)
p...
Deep generative models have led to significant advances in cross-modal
g...
Training deep neural networks typically requires large amounts of labele...
Generative adversarial networks have led to significant advances in
cros...
Visual-semantic embedding aims to find a shared latent space where relat...
Video prediction aims to generate realistic future frames by learning dy...
Traditional cross-modal retrieval assumes explicit association of concep...
Given a still photograph, one can imagine how dynamic objects might move...
Video consumption is being shifted from sit-and-watch to selective skimm...
Learning to rank has recently emerged as an attractive technique to trai...
The ability of learning from noisy labels is very useful in many visual
...
Esports has gained global popularity in recent years and several compani...
We describe a sketch interpretation system that detects and classifies c...
With the recent popularity of animated GIFs on social media, there is ne...