Achieving high accuracy with low latency has always been a challenge in
...
End-to-end neural diarization (EEND) with encoder-decoder-based attracto...
During conversations, humans are capable of inferring the intention of t...
This paper describes our system submitted to Dialogue Robot Competition ...
This paper proposes a method for improved CTC inference with searched
in...
End-to-end automatic speech recognition (ASR) directly maps input speech...
This paper proposes InterAug: a novel training method for CTC-based ASR ...
This paper proposes a novel label-synchronous speech-to-text alignment
t...
Voice Activity Detection (VAD) refers to the problem of distinguishing s...
This paper addresses the problem of automatic speech recognition (ASR) o...