Text-to-Text Transfer Transformer (T5) has recently been considered for ...
This paper proposes a new regularization algorithm referred to as macro-...
With the recent developments in cross-lingual Text-to-Speech (TTS) syste...
This report describes our 1st place solution to ECCV 2022 challenge on H...
Despite the widespread use of unsupervised models, very few methods are
...
Transformers have become a default architecture in computer vision, but
...
Intonations take an important role in delivering the intention of the
sp...
Speech recognition on smart devices is challenging owing to the small me...
In this paper, we propose a three-stage training methodology to improve ...
In this paper, we present a comparative study on the robustness of two
d...
Simultaneous Speech-to-text Translation (SimulST) systems translate sour...
In this paper, we present a streaming end-to-end speech recognition mode...
Recently, simultaneous translation has gathered a lot of attention since...
In this paper, we review various end-to-end automatic speech recognition...
In this paper, we present a Small Energy Masking (SEM) algorithm, which ...
In this paper, we present a new on-device automatic speech recognition (...
In this paper, we propose a refined multi-stage multi-task training stra...
In this paper, we describe the Maximum Uniformity of Distribution (MUD)
...
In this paper, we present an end-to-end training framework for building
...
End-to-end Speech Translation (ST) models have several advantages such a...
In this paper, we describe how to efficiently implement an acoustic room...