Guangzhi Sun

research

∙ 09/17/2023

Enhancing Quantised End-to-End ASR Models via Personalisation

Recent end-to-end automatic speech recognition (ASR) models have become ...

0 Qiuming Zhao, et al. ∙

research

∙ 07/04/2023

Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data

Manually annotating fine-grained slot-value labels for task-oriented dia...

0 Guangzhi Sun, et al. ∙

research

∙ 06/02/2023

Can Contextual Biasing Remain Effective with Whisper and GPT-2?

End-to-end automatic speech recognition (ASR) and large language models,...

0 Guangzhi Sun, et al. ∙

research

∙ 05/30/2023

Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator

The incorporation of biasing words obtained through contextual knowledge...

0 Guangzhi Sun, et al. ∙

research

∙ 10/29/2022

End-to-end Spoken Language Understanding with Tree-constrained Pointer Generator

End-to-end spoken language understanding (SLU) suffers from the long-tai...

0 Guangzhi Sun, et al. ∙

research

∙ 10/24/2022

Spectral Clustering-aware Learning of Embeddings for Speaker Diarisation

In speaker diarisation, speaker embedding extraction models often suffer...

0 Evonne P. C. Lee, et al. ∙

research

∙ 07/02/2022

Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition

Incorporating biasing words obtained as contextual knowledge is critical...

0 Guangzhi Sun, et al. ∙

research

∙ 05/18/2022

Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator

Contextual knowledge is essential for reducing speech recognition errors...

0 Guangzhi Sun, et al. ∙

research

∙ 09/01/2021

Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition

Contextual knowledge is important for real-world automatic speech recogn...

0 Guangzhi Sun, et al. ∙

research

∙ 10/22/2020

Combination of Deep Speaker Embeddings for Diarisation

Recently, significant progress has been made in speaker diarisation afte...

0 Guangzhi Sun, et al. ∙

research

∙ 02/06/2020

Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis

This paper proposes a hierarchical, fine-grained and interpretable laten...

0 Guangzhi Sun, et al. ∙

research

∙ 02/06/2020

Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior

Recent neural text-to-speech (TTS) models with fine-grained latent featu...

0 Guangzhi Sun, et al. ∙

research

∙ 02/08/2019

Speaker diarisation using 2D self-attentive combination of embeddings

Speaker diarisation systems often cluster audio segments using speaker e...

0 Guangzhi Sun, et al. ∙

Guangzhi Sun

Featured Co-authors

Sign in with Google

Consider DeepAI Pro