We propose a novel framework for electrolaryngeal speech intelligibility...
We propose PromptTTS++, a prompt-based text-to-speech (TTS) synthesis sy...
This paper describes the design of NNSVS, an open-source software for ne...
We propose a lightweight end-to-end text-to-speech model using multi-ban...
Several fully end-to-end text-to-speech (TTS) models have been proposed ...
Neural audio super-resolution models are typically trained on low- and
h...
This paper proposes an effective emotional text-to-speech (TTS) system w...
Recent advances in synthetic speech quality have enabled us to train
tex...
Data augmentation via voice conversion (VC) has been successfully applie...
Most text-to-speech (TTS) methods use high-quality speech corpora record...
This paper describes ESPnet2-TTS, an end-to-end text-to-speech (E2E-TTS)...
We propose a novel phrase break prediction method that combines implicit...
This paper proposes a spectral-domain perceptual weighting technique for...
This paper proposes voicing-aware conditional discriminators for Paralle...
In this paper, we propose a text-to-speech (TTS)-driven data augmentatio...
We propose Parallel WaveGAN, a distillation-free, fast, and small-footpr...
This paper introduces a new end-to-end text-to-speech (E2E-TTS) toolkit ...
Sequence-to-sequence models have been widely used in end-to-end speech
p...
This paper proposes an effective probability density distillation (PDD)
...