For personalized speech generation, a neural text-to-speech (TTS) model ...
Several fully end-to-end text-to-speech (TTS) models have been proposed ...
This paper proposes an effective emotional text-to-speech (TTS) system w...
Recent advances in synthetic speech quality have enabled us to train
tex...
Data augmentation via voice conversion (VC) has been successfully applie...
This paper proposes a spectral-domain perceptual weighting technique for...
This paper proposes voicing-aware conditional discriminators for Paralle...
In this paper, we propose a text-to-speech (TTS)-driven data augmentatio...
We propose Parallel WaveGAN, a distillation-free, fast, and small-footpr...
In this paper, we propose a high-quality generative text-to-speech (TTS)...
This paper proposes an effective probability density distillation (PDD)
...
This paper proposes a WaveNet-based neural excitation model (ExcitNet) f...
This paper proposes speaker-adaptive neural vocoders for statistical
par...