Online speech recognition, where the model only accesses context to the ...
We introduce AudioPaLM, a large language model for speech understanding ...
Unpaired text and audio injection have emerged as dominant methods for
i...
We introduce the Universal Speech Model (USM), a single large model that...
Dual learning is a paradigm for semi-supervised machine learning that se...
Automatic speech recognition (ASR) systems typically rely on an external...
Language identification is critical for many downstream tasks in automat...
We propose automatic speech recognition (ASR) models inspired by echo st...
End-to-end automatic speech recognition (ASR) models, including both
att...
Given the recent surge in developments of deep learning, this article
pr...
Lingvo is a Tensorflow framework offering a complete solution for
collab...
We present two end-to-end models: Audio-to-Byte (A2B) and Byte-to-Audio
...