Speaker extraction and diarization are two crucial enabling techniques f...
Self-supervised pre-training has been successful in both text and speech...
Speech is the surface form of a finite set of phonetic units, which can ...
The rapid development of single-modal pre-training has prompted research...
This paper describes the submission of our end-to-end YiTrans speech
tra...
This paper studies a novel pre-training technique with unpaired speech d...