Hank Liao

research

∙ 09/15/2023

Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network

While standard speaker diarization attempts to answer the question "who ...

0 Yiling Huang, et al. ∙

research

∙ 09/14/2023

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models

We introduce a multilingual speaker change detection model (USM-SCD) tha...

0 Guanlong Zhao, et al. ∙

research

∙ 05/11/2022

End-to-End Multi-Person Audio/Visual Automatic Speech Recognition

Traditionally, audio-visual automatic speech recognition has been studie...

0 Otavio Braga, et al. ∙

research

∙ 11/08/2019

Recurrent Neural Network Transducer for Audio-Visual Speech Recognition

This work presents a large-scale audio-visual speech recognition system ...

0 Takaki Makino, et al. ∙

research

∙ 11/06/2019

A comparison of end-to-end models for long-form speech recognition

End-to-end automatic speech recognition (ASR) models, including both att...

0 Chung-Cheng Chiu, et al. ∙

research

∙ 06/17/2019

Adversarial Training for Multilingual Acoustic Modeling

Multilingual training has been shown to improve acoustic modeling perfor...

0 Ke Hu, et al. ∙

research

∙ 03/07/2019

Neural Language Modeling with Visual Features

Multimodal language models attempt to incorporate non-linguistic feature...

0 Antonios Anastasopoulos, et al. ∙

research

∙ 07/13/2018

Large-Scale Visual Speech Recognition

This work presents a scalable solution to open-vocabulary visual speech ...

68 Brendan Shillingford, et al. ∙

research

∙ 11/15/2017

Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition

Recurrent neural network (RNN) language models (LMs) and Long Short Term...

0 Shankar Kumar, et al. ∙

research

∙ 10/31/2016

Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition

We present results that show it is possible to build a competitive, grea...

0 Hagen Soltau, et al. ∙

Hank Liao

Featured Co-authors

Sign in with Google

Consider DeepAI Pro