Generalisation – the ability of a model to perform well on unseen data –...
Automatic Speech Recognition (ASR) systems often struggle with transcrib...
Integrated circuit verification has gathered considerable interest in re...
The task of converting text input into video content is becoming an impo...
With the ever increasing complexity of specifications, manual sizing for...
Multimodal speech recognition aims to improve the performance of automat...
Speech synthesis has come a long way as current text-to-speech (TTS) mod...
The task of video-to-speech aims to translate silent video of lip moveme...
Quantifying the confidence (or conversely the uncertainty) of a predicti...
We describe the submission of the Quo Vadis team to the Traffic4cast
com...
This paper addresses the problem of building a speech recognition system...