Zhengchen Zhang

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Wei Zhang
393 publications
Chao Zhang
239 publications
Yang Yu
155 publications
Tao Mei
152 publications
Xiaodong He
101 publications
Xiaoxiao Li
78 publications
Zhi Liu
69 publications
Bowen Zhou
69 publications
Jin Zhang
62 publications
Meng Chen
32 publications
Dan Zeng
31 publications

research

∙ 11/11/2022

MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy

Humans often speak in a continuous manner which leads to coherent and co...

0 Ya-Jie Zhang, et al. ∙

research

∙ 11/02/2022

Singing Voice Synthesis with Vibrato Modeling and Latent Energy Representation

This paper proposes an expressive singing voice synthesis system by intr...

0 Yingjie Song, et al. ∙

research

∙ 11/02/2022

Multi-Speaker Multi-Style Speech Synthesis with Timbre and Style Disentanglement

Disentanglement of a speaker's timbre and style is very important for st...

0 Wei Song, et al. ∙

research

∙ 10/26/2021

ViDA-MAN: Visual Dialog with Digital Humans

We demonstrate ViDA-MAN, a digital-human agent for multi-modal interacti...

0 Tong Shen, et al. ∙

research

∙ 10/08/2021

SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition

End-to-end Automatic Speech Recognition (ASR) models are usually trained...

7 Li Fu, et al. ∙

research

∙ 11/06/2020

Improving Prosody Modelling with Cross-Utterance BERT Embeddings for End-to-end Speech Synthesis

Despite prosody is related to the linguistic information up to the disco...

0 Guanghui Xu, et al. ∙

research

∙ 12/15/2016

Transition-based Parsing with Context Enhancement and Future Reward Reranking

This paper presents a novel reranking model, future reward reranking, to...

0 Fugen Zhou, et al. ∙

Success!

An error occurred

Zhengchen Zhang

Featured Co-authors

MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy

Singing Voice Synthesis with Vibrato Modeling and Latent Energy Representation

Multi-Speaker Multi-Style Speech Synthesis with Timbre and Style Disentanglement

ViDA-MAN: Visual Dialog with Digital Humans

SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition

Improving Prosody Modelling with Cross-Utterance BERT Embeddings for End-to-end Speech Synthesis

Transition-based Parsing with Context Enhancement and Future Reward Reranking

Sign in with Google

Consider DeepAI Pro