Stuttering is a speech disorder where the natural flow of speech is
inte...
Contrastive Predictive Coding (CPC) is a representation learning method ...
We present an approach to reduce the performance disparity between geogr...
We address performance fairness for speaker verification using the
adver...
Modern speaker verification models use deep neural networks to encode
ut...
In this paper, we propose an approach to quantitatively analyze impacts ...
End-to-end (E2E) automatic speech recognition (ASR) models have recently...
We propose a simple yet effective method to compress an RNN-Transducer
(...
There is a recent trend in machine learning to increase model quality by...
Text-to-speech systems recently achieved almost indistinguishable qualit...
Comprehending the overall intent of an utterance helps a listener recogn...
Wav2vec-C introduces a novel representation learning technique combining...
Spoken language understanding (SLU) systems extract transcriptions, as w...
This paper describes two novel complementary techniques that improve the...
In this work, we propose a novel and efficient minimum word error rate (...
Recently, the acoustic-to-word model based on the Connectionist Temporal...
Recent work in automatic recognition of conversational telephone speech ...
Unsupervised single-channel overlapped speech recognition is one of the
...
We propose to train bi-directional neural network language model(NNLM) w...