Direct speech-to-speech translation (S2ST) aims to convert speech from o...
Text-to-speech(TTS) has undergone remarkable improvements in performance...
Speech-to-SQL (S2SQL) aims to convert spoken questions into SQL queries ...
We are interested in a challenging task, Realistic-Music-Score based Sin...
Denoising diffusion probabilistic models (DDPMs) have recently achieved
...
Direct speech-to-speech translation (S2ST) systems leverage recent progr...