Recently, large pretrained language models have demonstrated strong lang...
End-to-end spoken language understanding (SLU) remains elusive even with...
Given the recent success of diffusion in producing natural-sounding synt...
Compared to conventional artificial neurons that produce dense and
real-...
Current speech recognition architectures perform very well from the poin...
Using Bayes's theorem, we derive a unit-wise recurrence as well as a bac...
Neural Network (NN) classifiers can assign extreme probabilities to samp...
We begin by reiterating that common neural network activation functions ...
The quest for comprehensive generative models of intonation that link
li...
Phoneme-based multilingual training and different cross-lingual adaptati...
Most current very low bit rate (VLBR) speech coding systems use hidden M...