Most research on task oriented dialog modeling is based on written text
...
Research on speech-to-speech translation (S2ST) has progressed rapidly i...
Transfer tasks in text-to-speech (TTS) synthesis - where one or more asp...
End-to-end speech-to-speech translation (S2ST) without relying on
interm...
We introduce XTREME-S, a new benchmark to evaluate universal cross-lingu...
We present mSLAM, a multilingual Speech and LAnguage Model that learns
c...
We introduce CVSS, a massively multilingual-to-English speech-to-speech
...
In this paper we present VDTTS, a Visually-Driven Text-to-Speech model.
...
Unsupervised pre-training is now the predominant approach for both text ...
We present Translatotron 2, a neural direct speech-to-speech translation...
This paper introduces PnG BERT, a new encoder model for neural TTS. This...
This paper introduces Parallel Tacotron 2, a non-autoregressive neural
t...
Although neural end-to-end text-to-speech models can synthesize highly
n...
This paper presents Non-Attentive Tacotron based on the Tacotron 2
text-...
In this paper, we propose Textual Echo Cancellation (TEC) - a framework ...
Recently, a semi-supervised learning method known as "noisy student trai...
Automatic speaker verification (ASV) is one of the most natural and
conv...
Recent success of the Tacotron speech synthesis architecture and its var...
We present a multispeaker, multilingual text-to-speech (TTS) synthesis m...
We present an attention-based sequence-to-sequence neural network which ...
We describe Parrotron, an end-to-end-trained speech-to-speech conversion...
This paper introduces a new speech corpus called "LibriTTS" designed for...
Lingvo is a Tensorflow framework offering a complete solution for
collab...
End-to-end Speech Translation (ST) models have many potential advantages...
This paper proposes a neural end-to-end text-to-speech (TTS) model which...
In this paper, we present a novel system that separates the voice of a t...
We describe a neural network-based system for text-to-speech (TTS) synth...
In this work, we propose "global style tokens" (GSTs), a bank of embeddi...