Arun Narayanan

research

∙ 09/14/2022

A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation

Recent work has shown that it is possible to train a single model to per...

0 Tom O'Malley, et al. ∙

research

∙ 05/17/2022

Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments

One of the most challenging scenarios for smart speakers is multi-talker...

0 Joe Caroselli, et al. ∙

research

∙ 05/06/2022

A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy

Acoustic Echo Cancellation (AEC) is essential for accurate recognition o...

0 Sankaran Panchapagesan, et al. ∙

research

∙ 04/26/2022

Mask scalar prediction for improving robust automatic speech recognition

Using neural network based acoustic frontends for improving robustness o...

0 Arun Narayanan, et al. ∙

research

∙ 04/25/2022

Cleanformer: A microphone array configuration-invariant, streaming, multichannel neural enhancement frontend for ASR

This work introduces the Cleanformer, a streaming multichannel neural ba...

0 Joseph Caroselli, et al. ∙

research

∙ 04/18/2022

Extracting Targeted Training Data from ASR Models, and How to Mitigate It

Recent work has designed methods to demonstrate that model updates in AS...

0 Ehsan Amid, et al. ∙

research

∙ 04/08/2022

Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition

Personalization of on-device speech recognition (ASR) has seen explosive...

0 Shaojin Ding, et al. ∙

research

∙ 11/18/2021

A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation

We present a frontend for improving robustness of automatic speech recog...

0 Tom O'Malley, et al. ∙

research

∙ 11/01/2021

SNRi Target Training for Joint Speech Enhancement and Recognition

This study aims to improve the performance of automatic speech recogniti...

0 Yuma Koizumi, et al. ∙

research

∙ 10/30/2021

Cross-attention conformer for context modeling in speech enhancement for ASR

This work introduces cross-attention conformer, an attention-based archi...

0 Arun Narayanan, et al. ∙

research

∙ 09/21/2021

Home Energy Management Systems: Operation and Resilience of Heuristics against Cyberattacks

Internet of Things (IoT) and advanced communication technologies have de...

0 Hafiz Majid Hussain, et al. ∙

research

∙ 04/28/2021

Personalized Keyphrase Detection using Speaker and Environment Information

In this paper, we introduce a streaming keyphrase detection system that ...

0 Rajeev Rikhye, et al. ∙

research

∙ 02/01/2021

Virtual Microgrid Management via Software-defined Energy Network for Electricity Sharing

Digitalization has led to radical changes in the distribution of goods a...

0 Pedro H. J. Nardelli, et al. ∙

research

∙ 12/12/2020

Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging

End-to-end models that condition the output label sequence on all previo...

5 Rohit Prabhavalkar, et al. ∙

research

∙ 11/21/2020

A Better and Faster End-to-End Model for Streaming ASR

End-to-end (E2E) models have shown to outperform state-of-the-art conven...

0 Bo Li, et al. ∙

research

∙ 10/27/2020

Cascaded encoders for unifying streaming and non-streaming ASR

End-to-end (E2E) automatic speech recognition (ASR) models, by now, have...

0 Arun Narayanan, et al. ∙

research

∙ 10/22/2020

Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data

Streaming end-to-end automatic speech recognition (ASR) models are widel...

0 Thibault Doutre, et al. ∙

research

∙ 10/21/2020

FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization

Streaming automatic speech recognition (ASR) aims to emit each hypothesi...

5 Jiahui Yu, et al. ∙

research

∙ 08/10/2020

From private to public governance: The case for reconfiguring energy systems as a commons

The discussions around the unsustainability of the dominant socio-econom...

0 Chris Giotitsas, et al. ∙

research

∙ 05/07/2020

RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions

In recent years, all-neural end-to-end approaches have obtained state-of...

0 Chung-Cheng Chiu, et al. ∙

research

∙ 03/28/2020

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

Thus far, end-to-end (E2E) models have not been shown to outperform stat...

0 Tara N. Sainath, et al. ∙

research

∙ 11/06/2019

A comparison of end-to-end models for long-form speech recognition

End-to-end automatic speech recognition (ASR) models, including both att...

0 Chung-Cheng Chiu, et al. ∙

research

∙ 10/24/2019

Recognizing long-form speech using streaming end-to-end models

All-neural end-to-end (E2E) automatic speech recognition (ASR) systems t...

0 Arun Narayanan, et al. ∙

research

∙ 09/24/2018

From Audio to Semantics: Approaches to end-to-end spoken language understanding

Conventional spoken language understanding systems consist of two main c...

0 Parisa Haghani, et al. ∙

research

∙ 08/16/2018

Toward domain-invariant speech recognition via large scale training

Current state-of-the-art automatic speech recognition systems are traine...

0 Arun Narayanan, et al. ∙

research

∙ 12/09/2017

Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models

In this paper, we describe how to efficiently implement an acoustic room...

0 Chanwoo Kim, et al. ∙

Arun Narayanan

Featured Co-authors

Sign in with Google

Consider DeepAI Pro