We address the problem of human-in-the-loop control for generating
highl...
In this paper, we introduce Kathaka, a model trained with a novel two-st...
In English, prosody adds a broad range of information to segment sequenc...
Unlike human speakers, typical text-to-speech (TTS) systems are unable t...