The last year has seen astonishing progress in text-prompted image gener...
Unpaired text and audio injection have emerged as dominant methods for
i...
Dual learning is a paradigm for semi-supervised machine learning that se...
The careful construction of audio representations has become a dominant
...
It is well known that many machine learning systems demonstrate bias tow...
In this paper, we explore self-supervised audio-visual models that learn...
Multimodal self-supervised learning is getting more and more attention a...
While improvements have been made in automatic speech recognition perfor...
A major focus of recent research in spoken language understanding (SLU) ...
United States Courts make audio recordings of oral arguments available a...
Training an end-to-end (E2E) neural network speech-to-intent (S2I) syste...
Current methods for learning visually grounded language from videos ofte...
The past decade has witnessed great progress in Automatic Speech Recogni...
Decentralized Parallel SGD (D-PSGD) and its asynchronous variant Asynchr...
Bipolar disorder, a severe chronic mental illness characterized by
patho...
There has been huge progress in speech recognition over the last several...
In automatic speech recognition (ASR), wideband (WB) and narrowband (NB)...
Evolutionary stochastic gradient descent (ESGD) was proposed as a
popula...
Modern Automatic Speech Recognition (ASR) systems rely on distributed de...
With recent advances in deep learning, considerable attention has been g...
In this paper, we propose and investigate a variety of distributed deep
...
Direct acoustics-to-word (A2W) systems for end-to-end automatic speech
r...
We propose a population-based Evolutionary Stochastic Gradient Descent (...
Direct acoustics-to-word (A2W) models in the end-to-end paradigm have
re...
Recent work on end-to-end automatic speech recognition (ASR) has shown t...
One of the most difficult speech recognition tasks is accurate recogniti...
We study large-scale kernel methods for acoustic modeling in speech
reco...
We study large-scale kernel methods for acoustic modeling and compare to...
The computational complexity of kernel methods has often been a major ba...