In speech translation, leveraging multimodal data to improve model
perfo...
How can speech-to-text translation (ST) perform as well as machine
trans...
End-to-end Speech Translation (E2E ST) aims to translate source speech i...
Training speech translation (ST) models requires large and high-quality
...
How can we learn unified representations for spoken utterances and their...
This paper introduces GigaST, a large-scale pseudo speech translation (S...
How to learn a better speech representation for end-to-end speech-to-tex...
This paper describes the systems submitted to IWSLT 2021 by the Volctran...
End-to-end speech translation models have become a new trend in the rese...
How to generate descriptions from structured data organized in tables?
E...