ESPnet-ST-v2 is a revamp of the open-source ESPnet-ST toolkit necessitat...
We present a large-scale comparative study of self-supervised speech
rep...
Anomalous sound detection systems must detect unknown, atypical sounds u...
This paper proposes acoustic event detection (AED) with classifier chain...
Deep learning based models have significantly improved the performance o...
This work presents a self-supervised method to learn dense semantically ...
This paper describes ESPnet2-TTS, an end-to-end text-to-speech (E2E-TTS)...
This paper introduces S3PRL-VC, an open-source voice conversion (VC)
fra...
In voice conversion (VC), an approach showing promising results in the l...
An anomalous sound detection system to detect unknown anomalous sounds
u...
This paper proposes a novel voice conversion (VC) method based on
non-au...
In this paper, we present an open-source software for developing a
nonpa...
This paper describes the recent development of ESPnet
(https://github.co...
We present ESPnet-SE, which is designed for the quick development of spe...
In this study, we present recent developments on ESPnet: End-to-End Spee...
We present a novel approach to any-to-one (A2O) voice conversion (VC) in...
This paper presents the sequence-to-sequence (seq2seq) baseline system f...
Sequence-to-sequence (seq2seq) voice conversion (VC) models are attracti...
In this paper, we propose a quasi-periodic parallel WaveGAN (QPPWG) wave...
In this paper, a pitch-adaptive waveform generative model named
Quasi-Pe...
In this paper, we propose a parallel WaveGAN (PWG)-like neural vocoder w...
This paper proposes a new end-to-end text-to-speech (E2E-TTS) model base...
We present ESPnet-ST, which is designed for the quick development of
spe...
In this paper, we integrate a simple non-parallel voice conversion (VC)
...
This paper integrates a voice activity detection (VAD) function with
end...
We introduce a novel sequence-to-sequence (seq2seq) voice conversion (VC...
This paper introduces a new end-to-end text-to-speech (E2E-TTS) toolkit ...
Sequence-to-sequence models have been widely used in end-to-end speech
p...
In this paper, we present a novel technique for a non-parallel voice
con...
In this paper, we investigate the effectiveness of a quasi-periodic Wave...
In this paper, we propose a quasi-periodic neural network (QPNet) vocode...
In this work, we investigate the effectiveness of two techniques for
imp...
This paper presents a refinement framework of WaveNet vocoders for
varia...
This paper presents a method to train end-to-end automatic speech recogn...
In this paper we propose a novel data augmentation method for attention-...
In this paper, we propose a technique to alleviate quality degradation c...
This paper presents a new network architecture called multi-head decoder...
This paper introduces a new open source platform for end-to-end speech
p...