Self-supervised learning (SSL) has gained remarkable success, for which
...
Text-to-Text Transfer Transformer (T5) has recently been considered for ...
Neural Implicit Representations (NIR) have gained significant attention
...
Unsupervised speech recognition (ASR-U) is the problem of learning autom...
Automatic Speech Recognition (ASR) systems have attained unprecedented
p...
Inspired by Regularized Lottery Ticket Hypothesis (RLTH), which states t...
An algorithm based on a deep probabilistic architecture referred to as a...
Studies have shown that modern neural networks tend to be poorly calibra...
An unbiased scene graph generation (SGG) algorithm referred to as Skew
C...
Word Sense Disambiguation (WSD) is an NLP task aimed at determining the
...
Video-grounded Dialogue (VGD) aims to decode an answer sentence to a que...
Self-supervised learning (SSL) approaches have shown promising capabilit...
Video moment retrieval (VMR) aims to localize target moments in untrimme...
Existing state-of-the-art 3D point cloud instance segmentation methods r...
Inspired by Regularized Lottery Ticket Hypothesis (RLTH), which hypothes...
Exponential Moving Average (EMA or momentum) is widely used in modern
se...
Adversarial training (AT) for robust representation learning and
self-su...
A learning algorithm referred to as Maximum Margin (MM) is proposed for
...
Contrastive learning (CL) is widely known to require many negative sampl...
To avoid collapse in self-supervised learning (SSL), a contrastive loss ...
Existing state-of-the-art 3D instance segmentation methods perform seman...
Adversarial training (AT) and its variants are the most effective approa...
Single Image Super-Resolution (SISR) is a very active research field. Th...
This paper proposes a method for prioritizing the replay experience refe...
This paper defines fair principal component analysis (PCA) as minimizing...
In this work, we propose a novel methodology for self-supervised learnin...
A video-grounded dialogue system referred to as the Structured Co-refere...
Developing an agent in reinforcement learning (RL) that is capable of
pe...
Model agnostic meta-learning (MAML) is a popular state-of-the-art
meta-l...
This paper considers a video caption generating network referred to as
S...
Cascaded architectures have brought significant performance improvement ...
Video Moment Retrieval (VMR) is a task to localize the temporal moment i...
This paper considers a network referred to as Modality Shifting Attentio...
This paper considers an architecture referred to as Cascade Region Propo...
This paper proposes a method to gain extra supervision via multi-task
le...
In this paper, we propose a novel edge-labeling graph neural network (EG...
This paper proposes the progressive attention memory network (PAMN) for ...
Determining the appropriate batch size for mini-batch gradient descent i...
Deep learning typically requires training a very capable architecture us...
Obtaining compact and discriminative features is one of the major challe...