Text-to-Text Transfer Transformer (T5) has recently been considered for ...
Automatic Speech Recognition (ASR) systems have attained unprecedented
p...
Studies have shown that modern neural networks tend to be poorly calibra...
Word Sense Disambiguation (WSD) is an NLP task aimed at determining the
...
Video-grounded Dialogue (VGD) aims to decode an answer sentence to a que...
Video moment retrieval (VMR) aims to localize target moments in untrimme...