Ehsan Variani

research

∙ 04/25/2023

LAST: Scalable Lattice-Based Speech Modelling in JAX

We introduce LAST, a LAttice-based Speech Transducer library in JAX. Wit...

0 Ke Wu, et al. ∙

research

∙ 02/16/2023

JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition

We propose JEIT, a joint end-to-end (E2E) model and internal language mo...

0 Zhong Meng, et al. ∙

research

∙ 12/22/2022

Alignment Entropy Regularization

Existing training criteria in automatic speech recognition(ASR) permit t...

0 Ehsan Variani, et al. ∙

research

∙ 10/31/2022

Modular Hybrid Autoregressive Transducer

Text-only adaptation of a transducer model remains challenging for end-t...

0 Zhong Meng, et al. ∙

research

∙ 07/02/2022

UserLibri: A Dataset for ASR Personalization Using Only Text

Personalization of speech models on mobile devices (on-device personaliz...

0 Theresa Breiner, et al. ∙

research

∙ 05/26/2022

Global Normalization for Streaming Speech Recognition in a Modular Framework

We introduce the Globally Normalized Autoregressive Transducer (GNAT) fo...

0 Ehsan Variani, et al. ∙

research

∙ 04/15/2022

Improving Rare Word Recognition with LM-aware MWER Training

Language models (LMs) significantly improve the recognition accuracy of ...

0 Weiran Wang, et al. ∙

research

∙ 10/27/2020

Cascaded encoders for unifying streaming and non-streaming ASR

End-to-end (E2E) automatic speech recognition (ASR) models, by now, have...

0 Arun Narayanan, et al. ∙

research

∙ 03/12/2020

Hybrid Autoregressive Transducer (hat)

This paper proposes and evaluates the hybrid autoregressive transducer (...

0 Ehsan Variani, et al. ∙

research

∙ 02/26/2020

A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition

This article describes a density ratio approach to integrating external ...

0 Erik McDermott, et al. ∙

research

∙ 11/20/2018

WEST: Word Encoded Sequence Transducers

Most of the parameters in large vocabulary models are used in embedding ...

0 Ehsan Variani, et al. ∙

research

∙ 12/09/2017

Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models

In this paper, we describe how to efficiently implement an acoustic room...

0 Chanwoo Kim, et al. ∙

research

∙ 04/22/2015

Non-Adaptive Policies for 20 Questions Target Localization

The problem of target localization with noise is addressed. The target i...

0 Ehsan Variani, et al. ∙

Ehsan Variani

Featured Co-authors

Sign in with Google

Consider DeepAI Pro