Yinghao Aaron Li

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Nima Mesgarani
25 publications
Cong Han
21 publications
Xilin Jiang
5 publications
Gavin Mischler
2 publications
Vishal Choudhari
1 publication
Ali Zare
1 publication
Vinay S. Raghavan
1 publication

research

∙ 09/18/2023

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

Recent advancements in speech synthesis have leveraged GAN-based network...

0 Yinghao Aaron Li, et al. ∙

research

∙ 07/18/2023

SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs

In recent years, large-scale pre-trained speech language models (SLMs) h...

0 Yinghao Aaron Li, et al. ∙

research

∙ 06/13/2023

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

In this paper, we present StyleTTS 2, a text-to-speech (TTS) model that ...

0 Yinghao Aaron Li, et al. ∙

research

∙ 05/29/2023

DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes

Lifelong audio feature extraction involves learning new sound classes in...

0 Xilin Jiang, et al. ∙

research

∙ 02/11/2023

Improved Decoding of Attentional Selection in Multi-Talker Environments with Self-Supervised Learned Speech Representation

Auditory attention decoding (AAD) is a technique used to identify and am...

0 Cong Han, et al. ∙

research

∙ 01/20/2023

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Large-scale pre-trained language models have been shown to be helpful in...

0 Yinghao Aaron Li, et al. ∙

research

∙ 12/29/2022

StyleTTS-VC: One-Shot Voice Conversion by Knowledge Transfer from Style-Based TTS Models

One-shot voice conversion (VC) aims to convert speech from any source sp...

0 Yinghao Aaron Li, et al. ∙

research

∙ 05/30/2022

StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech Synthesis

Text-to-Speech (TTS) has recently seen great progress in synthesizing hi...

0 Yinghao Aaron Li, et al. ∙

research

∙ 07/21/2021

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

We present an unsupervised non-parallel many-to-many voice conversion (V...

0 Yinghao Aaron Li, et al. ∙

Success!

An error occurred

Yinghao Aaron Li

Featured Co-authors

Sign in with Google

Consider DeepAI Pro