Graph Neural Networks (GNNs) is an architecture for structural data, and...
Accents mismatching is a critical problem for end-to-end ASR. This paper...
Non-parallel many-to-many voice conversion, as well as zero-shot voice
c...
Non-parallel many-to-many voice conversion, as well as zero-shot voice
c...
Connectionist temporal classification (CTC) training criterion provides ...
Furui first demonstrated that the identity of both consonant and vowel c...
Multi-channel speech enhancement with ad-hoc sensors has been a challeng...
The performance of automatic speech recognition systems degrades with
in...
Most mainstream Automatic Speech Recognition (ASR) systems consider all
...
Most mainstream Automatic Speech Recognition (ASR) systems consider all
...
Natural language understanding and dialogue policy learning are both
ess...
This paper tests the hypothesis that distinctive feature classifiers anc...