Hard of hearing or profoundly deaf people make use of cued speech (CS) a...
Several recent studies have tested the use of transformer language model...
The human perception system is often assumed to recruit motor knowledge ...
This paper proposes a simple and effective approach for automatic recogn...
We propose a computational model of speech production combining a pre-tr...
The Variational Autoencoder (VAE) is a powerful deep generative model th...
It is increasingly considered that human speech perception and productio...
The prosody of a spoken word is determined by its surrounding context. I...
In incremental text to speech synthesis (iTTS), the synthesizer produces...
The Variational Autoencoder (VAE) is a powerful deep generative model th...
This study investigates the use of non-linear unsupervised dimensionalit...