Preethi Jyothi

research

∙ 07/11/2023

Improving RNN-Transducers with Acoustic LookAhead

RNN-Transducers (RNN-Ts) have gained widespread acceptance as an end-to-...

0 Vinit S. Unni, et al. ∙

research

∙ 06/10/2023

Adversarial Training For Low-Resource Disfluency Correction

Disfluencies commonly occur in conversational speech. Speech with disflu...

0 Vineet Bhat, et al. ∙

research

∙ 05/26/2023

DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction

Conversational speech often consists of deviations from the speech plan,...

0 Vineet Bhat, et al. ∙

research

∙ 11/02/2022

Towards Zero-Shot Code-Switched Speech Recognition

In this work, we seek to build effective code-switched (CS) automatic sp...

0 Brian Yan, et al. ∙

research

∙ 10/30/2022

Partitioned Gradient Matching-based Data Subset Selection for Compute-Efficient Robust ASR Training

Training state-of-the-art ASR systems such as RNN-T often has a high ass...

0 Ashish Mittal, et al. ∙

research

∙ 10/13/2022

DICTDIS: Dictionary Constrained Disambiguation for Improved NMT

Domain-specific neural machine translation (NMT) systems (e.g., in educa...

0 Ayush Maheshwari, et al. ∙

research

∙ 04/02/2022

Accurate Online Posterior Alignments for Principled Lexically-Constrained Decoding

Online alignment in machine translation refers to the task of aligning a...

0 Soumya Chatterjee, et al. ∙

research

∙ 03/31/2022

Investigating Modality Bias in Audio Visual Video Parsing

We focus on the audio-visual video parsing (AVVP) problem that involves ...

0 Piyush Singh Pasi, et al. ∙

research

∙ 02/21/2022

Adaptive Discounting of Implicit Language Models in RNN-Transducers

RNN-Transducer (RNN-T) models have become synonymous with streaming end-...

0 Vinit Unni, et al. ∙

research

∙ 02/02/2022

Error Correction in ASR using Sequence-to-Sequence Models

Post-editing in Automatic Speech Recognition (ASR) entails automatically...

0 Samrat Dutta, et al. ∙

research

∙ 10/10/2021

Personalizing ASR with limited data using targeted subset selection

We study the task of personalizing ASR models to a target non-native spe...

0 Mayank Kothyari, et al. ∙

research

∙ 07/21/2021

The Effectiveness of Intermediate-Task Training for Code-Switched Natural Language Understanding

While recent benchmarks have spurred a lot of new work on improving the ...

0 Archiki Prasad, et al. ∙

research

∙ 07/14/2021

From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text

Generating code-switched text is a problem of growing interest, especial...

0 Ishan Tarunesh, et al. ∙

research

∙ 06/02/2021

Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights

Automatic speech recognition (ASR) in Sanskrit is interesting, owing to ...

0 Devaraja Adiga, et al. ∙

research

∙ 04/01/2021

Multilingual and code-switching ASR challenges for low resource Indian languages

Recently, there is increasing interest in multilingual automatic speech ...

0 Anuj Diwan, et al. ∙

research

∙ 04/01/2021

Collaborative Learning to Generate Audio-Video Jointly

There have been a number of techniques that have demonstrated the genera...

21 Vinod K. Kurmi, et al. ∙

research

∙ 03/09/2021

Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering

Multimodal IR, spanning text corpus, knowledge graph and images, called ...

0 Aman Jain, et al. ∙

research

∙ 03/04/2021

Error-driven Fixed-Budget ASR Personalization for Accented Speakers

We consider the task of personalizing ASR models while being constrained...

0 Abhijeet Awasthi, et al. ∙

research

∙ 02/11/2021

An Investigation of End-to-End Models for Robust Speech Recognition

End-to-end models for robust automatic speech recognition (ASR) have not...

0 Archiki Prasad, et al. ∙

research

∙ 01/25/2021

Meta-Learning for Effective Multi-task and Multilingual Modelling

Natural language processing (NLP) tasks (e.g. question-answering in Engl...

0 Ishan Tarunesh, et al. ∙

research

∙ 10/19/2020

Reduce and Reconstruct: Improving Low-resource End-to-end ASR Via Reconstruction Using Reduced Vocabularies

End-to-end automatic speech recognition (ASR) systems are increasingly b...

0 Anuj Diwan, et al. ∙

research

∙ 10/12/2020

Improving Low Resource Code-switched ASR using Augmented Code-switched TTS

Building Automatic Speech Recognition (ASR) systems for code-switched sp...

0 Yash Sharma, et al. ∙

research

∙ 06/24/2020

Black-box Adaptation of ASR for Accented Speech

We introduce the problem of adapting a black-box, cloud-based ASR system...

0 Kartik Khandelwal, et al. ∙

research

∙ 10/25/2019

Stem-driven Language Models for Morphologically Rich Languages

Neural language models (LMs) have shown to benefit significantly from en...

0 Yash Shah, et al. ∙

research

∙ 06/22/2019

End-to-End ASR for Code-switched Hindi-English Speech

End-to-end (E2E) models have been explored for large speech corpora and ...

0 Brij Mohan Lal Srivastava, et al. ∙

research

∙ 06/06/2019

Cross-Lingual Training for Automatic Question Generation

Automatic question generation (QG) is a challenging problem in natural l...

0 Vishwajeet Kumar, et al. ∙

research

∙ 09/06/2018

Code-switched Language Models Using Dual RNNs and Same-Source Pretraining

This work focuses on building language models (LMs) for code-switched te...

0 Saurabh Garg, et al. ∙

research

∙ 08/23/2018

Revisiting the Importance of Encoding Logic Rules in Sentiment Classification

We analyze the performance of different sentiment classification models ...

0 Kalpesh Krishna, et al. ∙

research

∙ 04/28/2018

Generalizing Across Domains via Cross-Gradient Training

We present CROSSGRAD, a method to use multi-domain training data to lear...

0 Shiv Shankar, et al. ∙

research

∙ 12/25/2017

Leveraging Native Language Speech for Accent Identification using Deep Siamese Networks

The problem of automatic accent identification is important for several ...

0 Aditya Siddhant, et al. ∙

research

∙ 11/03/2017

Dual Language Models for Code Mixed Speech Recognition

In this work, we present a new approach to language modeling for bilingu...

0 Saurabh Garg, et al. ∙

research

∙ 12/13/2016

Performance Improvements of Probabilistic Transcript-adapted ASR with Recurrent Neural Network and Language-specific Constraints

Mismatched transcriptions have been proposed as a mean to acquire probab...

0 Xiang Kong, et al. ∙

Preethi Jyothi

Featured Co-authors

Sign in with Google

Consider DeepAI Pro