Self-supervised learning (SSL) is at the origin of unprecedented improve...
Automatic dialogue summarization is a well-established task that aims to...
The Multi-modal Multiple Appropriate Facial Reaction Generation Challeng...
Voice Activity Detection (VAD) aims at detecting speech segments on an a...
Most automatic emotion recognition systems exploit time-continuous
annot...
Pre-trained language models have established the state-of-the-art on var...
Self-Supervised Learning (SSL) using huge unlabeled data has been
succes...
The Audio/Visual Emotion Challenge and Workshop (AVEC 2019) "State-of-Mi...
Natural human-computer interaction and audio-visual human behaviour sens...