Previous pitch-controllable text-to-speech (TTS) models rely on directly...
Previous generative adversarial network (GAN)-based neural vocoders are
...
In this paper, we present SANE-TTS, a stable and natural end-to-end
mult...
In this work, we propose a joint system combining a talking face generat...