Dirk Padfield | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Yu Zhang
406 publications
Wei Han
71 publications
Ankur Bapna
47 publications
Jiahui Yu
39 publications
Yongqiang Wang
34 publications
Neil Zeghidour
31 publications
Marco Tagliasacchi
31 publications
Colin Cherry
26 publications
Eugene Kharitonov
21 publications
Zhishuai Zhang
21 publications
Damien Vincent
20 publications

research

∙ 06/22/2023

AudioPaLM: A Large Language Model That Can Speak and Listen

We introduce AudioPaLM, a large language model for speech understanding ...

0 Paul K. Rubenstein, et al. ∙

research

∙ 05/19/2023

MultiTurnCleanup: A Benchmark for Multi-Turn Spoken Conversational Transcript Cleanup

Current disfluency detection models focus on individual utterances each ...

0 Hua Shen, et al. ∙

research

∙ 08/05/2022

Chronological Self-Training for Real-Time Speaker Diarization

Diarization partitions an audio stream into segments based on the voices...

0 Dirk Padfield, et al. ∙

research

∙ 05/02/2022

Teaching BERT to Wait: Balancing Accuracy and Latency for Streaming Disfluency Detection

In modern interactive speech-based systems, speech is consumed and trans...

0 Angelica Chen, et al. ∙

research

∙ 09/14/2021

Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech

Automatic Speech Recognition (ASR) systems are often optimized to work b...

0 Katrin Tomanek, et al. ∙

research

∙ 10/21/2020

Sentence Boundary Augmentation For Neural Machine Translation Robustness

Neural Machine Translation (NMT) models have demonstrated strong state o...

0 Daniel Li, et al. ∙

Success!

An error occurred