Ziyang Ma

research

∙ 09/19/2023

Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition

In this paper, we explored how to boost speech emotion recognition (SER)...

0 Ziyang Ma, et al. ∙

research

∙ 09/14/2023

Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS

Self-supervised learning (SSL) proficiency in speech-related tasks has d...

0 Yifan Yang, et al. ∙

research

∙ 09/10/2023

VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching

Although diffusion models in text-to-speech have become a popular choice...

0 Yiwei Guo, et al. ∙

research

∙ 08/28/2023

Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition

In recent years, speech-based self-supervised learning (SSL) has made si...

0 Zhisheng Zheng, et al. ∙

research

∙ 06/15/2023

Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation

The excellent generalization ability of self-supervised learning (SSL) f...

0 Ziyang Ma, et al. ∙

research

∙ 06/14/2023

Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation

Recently, end-to-end (E2E) automatic speech recognition (ASR) models hav...

0 Zheng Liang, et al. ∙

research

∙ 06/12/2023

LTCR: Long-Text Chinese Rumor Detection Dataset

False information can spread quickly on social media, negatively influen...

0 Ziyang Ma, et al. ∙

research

∙ 03/09/2023

Improving Few-Shot Learning for Talking Face System with TTS Data Augmentation

Audio-driven talking face has attracted broad interest from academia and...

0 Qi Chen, et al. ∙

research

∙ 02/18/2023

Front-End Adapter: Adapting Front-End Input of Speech based Self-Supervised Learning for Speech Recognition

Recent years have witnessed a boom in self-supervised learning (SSL) in ...

0 Xie Chen, et al. ∙

research

∙ 11/24/2022

TESSP: Text-Enhanced Self-Supervised Speech Pre-training

Self-supervised speech pre-training empowers the model with the contextu...

0 Zhuoyuan Yao, et al. ∙

research

∙ 11/14/2022

MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets

In this paper, we provide a new perspective on self-supervised speech mo...

0 Ziyang Ma, et al. ∙

research

∙ 10/27/2022

Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition

Recent years have witnessed great strides in self-supervised learning (S...

0 Yujin Wang, et al. ∙

research

∙ 10/31/2021

Hierarchical Deep Residual Reasoning for Temporal Moment Localization

Temporal Moment Localization (TML) in untrimmed videos is a challenging ...

0 Ziyang Ma, et al. ∙

Ziyang Ma

Featured Co-authors

Sign in with Google

Consider DeepAI Pro