Alexander H. Liu

research

∙ 05/18/2023

Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering

Self-supervised speech representation models have succeeded in various t...

0 Heng-Jui Chang, et al. ∙

research

∙ 05/17/2023

DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning

In this paper, we introduce self-distillation and online clustering for ...

0 Alexander H. Liu, et al. ∙

research

∙ 04/06/2022

Simple and Effective Unsupervised Speech Synthesis

We introduce the first unsupervised speech synthesis system based on a s...

0 Alexander H. Liu, et al. ∙

research

∙ 04/05/2022

Towards End-to-end Unsupervised Speech Recognition

Unsupervised speech recognition has shown great potential to make Automa...

0 Alexander H. Liu, et al. ∙

research

∙ 10/04/2021

On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis

Are end-to-end text-to-speech (TTS) models over-parametrized? To what ex...

1 Cheng-I Jeff Lai, et al. ∙

research

∙ 06/10/2021

PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition

Recent work on speech self-supervised learning (speech SSL) demonstrated...

4 Cheng-I Jeff Lai, et al. ∙

research

∙ 06/10/2021

Cross-Modal Discrete Representation Learning

Recent advances in representation learning have demonstrated an ability ...

0 Alexander H. Liu, et al. ∙

research

∙ 11/01/2020

Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies

Self-supervised speech representations have been shown to be effective i...

0 Alexander H. Liu, et al. ∙

research

∙ 05/21/2020

Worse WER, but Better BLEU? Leveraging Word Embedding as Intermediate in Multitask End-to-End Speech Translation

Speech translation (ST) aims to learn transformations from speech in the...

0 Shun-Po Chuang, et al. ∙

research

∙ 05/16/2020

Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation

Recently, end-to-end multi-speaker text-to-speech (TTS) systems gain suc...

0 Tao Tu, et al. ∙

research

∙ 05/05/2020

End-to-end Whispered Speech Recognition with Frequency-weighted Approaches and Layer-wise Transfer Learning

Whispering is an important mode of human speech, but no end-to-end recog...

0 Heng-Jui Chang, et al. ∙

research

∙ 10/28/2019

Sequence-to-sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding

In this paper, we investigate the benefit that off-the-shelf word embedd...

0 Alexander H. Liu, et al. ∙

research

∙ 10/28/2019

Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning

In this paper we propose a Sequential Representation Quantization AutoEn...

0 Alexander H. Liu, et al. ∙

research

∙ 11/02/2018

Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model

In this paper we proposed a novel Adversarial Training (AT) approach for...

0 Alexander H. Liu, et al. ∙

Alexander H. Liu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro