b'Shinji Takaki'

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Xin Wang
382 publications
Junichi Yamagishi
127 publications
Yi Zhao
50 publications
Gustav Eje Henter
40 publications
Hirokazu Kameoka
35 publications
Jaime Lorenzo-Trueba
29 publications
Lauri Juvela
14 publications
Hieu-Thi Luong
11 publications
Yusuke Yasuda
10 publications
Yoshihiko Nankaku
9 publications
Keiichi Tokuda
9 publications

research

∙ 11/21/2022

Embedding a Differentiable Mel-cepstral Synthesis Filter to a Neural Speech Synthesis System

This paper integrates a classic mel-cepstral synthesis filter into a mod...

0 Takenori Yoshimura, et al. ∙

research

∙ 02/15/2021

PeriodNet: A non-autoregressive waveform generation model with a structure separating periodic and aperiodic components

We propose PeriodNet, a non-autoregressive (non-AR) waveform generation ...

0 Yukiya Hono, et al. ∙

research

∙ 11/10/2019

Transformation of low-quality device-recorded speech to high-quality speech using improved SEGAN model

Nowadays vast amounts of speech data are recorded from low-quality recor...

0 Seyyed Saeed Sarfjoo, et al. ∙

research

∙ 10/24/2019

Fast and High-Quality Singing Voice Synthesis System based on Convolutional Neural Networks

The present paper describes singing voice synthesis based on convolution...

0 Kazuhiro Nakamura, et al. ∙

research

∙ 04/27/2019

Neural source-filter waveform models for statistical parametric speech synthesis

Neural waveform models such as WaveNet have demonstrated better performa...

0 Xin Wang, et al. ∙

research

∙ 03/29/2019

Training a Neural Speech Waveform Model using Spectral Losses of Short-Time Fourier Transform and Continuous Wavelet Transform

Recently, we proposed short-time Fourier transform (STFT)-based loss fun...

0 Shinji Takaki, et al. ∙

research

∙ 03/29/2019

Does the Lombard Effect Improve Emotional Communication in Noise? - Analysis of Emotional Speech Acted in Noise -

Speakers usually adjust their way of talking in noisy environments invol...

0 Yi Zhao, et al. ∙

research

∙ 10/29/2018

Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language

End-to-end speech synthesis is a promising approach that directly conver...

0 Yusuke Yasuda, et al. ∙

research

∙ 10/29/2018

Neural source-filter-based waveform model for statistical parametric speech synthesis

Neural waveform models such as the WaveNet are used in many recent text-...

0 Xin Wang, et al. ∙

research

∙ 10/29/2018

STFT spectral loss for training a neural speech waveform model

This paper proposes a new loss using short-time Fourier transform (STFT)...

0 Shinji Takaki, et al. ∙

research

∙ 07/31/2018

Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder

Recent neural networks such as WaveNet and sampleRNN that learn directly...

2 Yi Zhao, et al. ∙

research

∙ 04/07/2018

A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis

Recent advances in speech synthesis suggest that limitations such as the...

0 Xin Wang, et al. ∙

research

∙ 03/27/2018

Complex-Valued Restricted Boltzmann Machine for Direct Speech Parameterization from Complex Spectra

This paper describes a novel energy-based probabilistic distribution tha...

0 Toru Nakashika, et al. ∙

Shinji Takaki

Featured Co-authors

Sign in with Google

Consider DeepAI Pro