Heng-Jui Chang

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Shinji Watanabe
239 publications
Hung-Yi Lee
187 publications
James Glass
123 publications
Michael Auli
68 publications
Wei-Ning Hsu
59 publications
Abdelrahman Mohamed
41 publications
Wen-Chin Huang
39 publications
Xuankai Chang
37 publications
David Harwath
35 publications
Lin-shan Lee
34 publications
Shang-Wen Li
34 publications

research

∙ 09/14/2023

CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders

Large-scale self-supervised pre-trained speech encoders outperform conve...

0 Heng-Jui Chang, et al. ∙

research

∙ 05/18/2023

Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering

Self-supervised speech representation models have succeeded in various t...

0 Heng-Jui Chang, et al. ∙

research

∙ 05/17/2023

DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning

In this paper, we introduce self-distillation and online clustering for ...

0 Alexander H. Liu, et al. ∙

research

∙ 11/02/2022

M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval

This work investigates the use of large-scale, pre-trained models (CLIP ...

0 Layne Berry, et al. ∙

research

∙ 10/03/2022

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model

Data-driven speech processing models usually perform well with a large a...

0 Yi-Jen Shih, et al. ∙

research

∙ 03/14/2022

SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities

Transfer learning has proven to be crucial in advancing the state of spe...

0 Hsiang-Sheng Tsai, et al. ∙

research

∙ 10/07/2021

Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models

Code-switching (CS) is common in daily conversations where more than one...

0 Liang-Hsuan Tseng, et al. ∙

research

∙ 10/05/2021

DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT

Self-supervised speech representation learning methods like wav2vec 2.0 ...

0 Heng-Jui Chang, et al. ∙

research

∙ 04/06/2021

Non-autoregressive Mandarin-English Code-switching Speech Recognition with Pinyin Mask-CTC and Word Embedding Regularization

Mandarin-English code-switching (CS) is frequently used among East and S...

0 Shun-Po Chuang, et al. ∙

research

∙ 04/04/2021

Towards Lifelong Learning of End-to-end ASR

Automatic speech recognition (ASR) technologies today are primarily opti...

0 Heng-Jui Chang, et al. ∙

research

∙ 05/05/2020

End-to-end Whispered Speech Recognition with Frequency-weighted Approaches and Layer-wise Transfer Learning

Whispering is an important mode of human speech, but no end-to-end recog...

0 Heng-Jui Chang, et al. ∙

Success!

An error occurred

Heng-Jui Chang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro