Eunwoo Song

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Hong-Goo Kang
23 publications
Ryuichi Yamamoto
19 publications
Kentaro Tachibana
12 publications
Jae-Min Kim
11 publications
Min-Jae Hwang
8 publications
Changhwan Kim
7 publications
Kyungguen Byun
6 publications
Hyun-Wook Yoon
6 publications
Jin Seob Kim
4 publications
Ohsung Kwon
4 publications
Yuma Shirahata
4 publications

research

∙ 08/28/2023

Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech

For personalized speech generation, a neural text-to-speech (TTS) model ...

0 Hyungchan Yoon, et al. ∙

research

∙ 10/28/2022

Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis

Several fully end-to-end text-to-speech (TTS) models have been proposed ...

0 Yuma Shirahata, et al. ∙

research

∙ 06/30/2022

Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems

This paper proposes an effective emotional text-to-speech (TTS) system w...

0 Hyun-Wook Yoon, et al. ∙

research

∙ 06/30/2022

TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder

Recent advances in synthetic speech quality have enabled us to train tex...

0 Eunwoo Song, et al. ∙

research

∙ 04/21/2022

Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation

Data augmentation via voice conversion (VC) has been successfully applie...

0 Ryo Terashima, et al. ∙

research

∙ 01/19/2021

Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss

This paper proposes a spectral-domain perceptual weighting technique for...

0 Eunwoo Song, et al. ∙

research

∙ 10/27/2020

Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators

This paper proposes voicing-aware conditional discriminators for Paralle...

0 Ryuichi Yamamoto, et al. ∙

research

∙ 10/26/2020

TTS-by-TTS: TTS-driven Data Augmentation for Fast and High-Quality Speech Synthesis

In this paper, we propose a text-to-speech (TTS)-driven data augmentatio...

0 Min-Jae Hwang, et al. ∙

research

∙ 10/25/2019

Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram

We propose Parallel WaveGAN, a distillation-free, fast, and small-footpr...

0 Ryuichi Yamamoto, et al. ∙

research

∙ 05/21/2019

Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems

In this paper, we propose a high-quality generative text-to-speech (TTS)...

0 Ohsung Kwon, et al. ∙

research

∙ 04/09/2019

Probability density distillation with generative adversarial networks for high-quality parallel waveform generation

This paper proposes an effective probability density distillation (PDD) ...

0 Ryuichi Yamamoto, et al. ∙

research

∙ 11/09/2018

ExcitNet vocoder: A neural excitation model for parametric speech synthesis systems

This paper proposes a WaveNet-based neural excitation model (ExcitNet) f...

0 Eunwoo Song, et al. ∙

research

∙ 11/08/2018

Speaker-adaptive neural vocoders for statistical parametric speech synthesis systems

This paper proposes speaker-adaptive neural vocoders for statistical par...

0 Eunwoo Song, et al. ∙

Success!

An error occurred

Eunwoo Song

Featured Co-authors

Sign in with Google

Consider DeepAI Pro