We introduce O-1, a new self-training objective to reduce training bias ...
This work studies knowledge distillation (KD) and addresses its constrai...
Dysarthric speech recognition has posed major challenges due to lack of
...
Masked speech modeling (MSM) methods such as wav2vec2 or w2v-BERT learn
...
Self-supervised ASR-TTS models suffer in out-of-domain data conditions. ...
The paper describes the BUT Automatic Speech Recognition (ASR) systems
s...
Sequence-to-sequence ASR models require large quantities of data to atta...
This paper investigates the applications of various multilingual approac...
In this paper, we present promising accurate prefix boosting (PAPB), a
d...
In this paper, we explore several new schemes to train a seq2seq model t...
This work explores better adaptation methods to low-resource languages u...
Sequence-to-sequence (seq2seq) approach for low-resource ASR is a relati...
Training deep recurrent neural network (RNN) architectures is complicate...