Desh Raj

research

∙ 09/18/2023

Training dynamic models using early exits for automatic speech recognition on resource-constrained devices

The possibility of dynamically modifying the computational load of neura...

0 George August Wright, et al. ∙

research

∙ 06/23/2023

The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios

The CHiME challenges have played a significant role in the development a...

0 Samuele Cornell, et al. ∙

research

∙ 06/18/2023

SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition

The Streaming Unmixing and Recognition Transducer (SURT) model was propo...

0 Desh Raj, et al. ∙

research

∙ 12/10/2022

GPU-accelerated Guided Source Separation for Meeting Transcription

Guided source separation (GSS) is a type of target-speaker extraction me...

0 Desh Raj, et al. ∙

research

∙ 11/01/2022

Adapting self-supervised models to multi-talker speech recognition using speaker embeddings

Self-supervised learning (SSL) methods which learn representations of da...

0 Zili Huang, et al. ∙

research

∙ 10/20/2022

Anchored Speech Recognition with Neural Transducers

Neural transducers have gained popularity in production ASR systems, ach...

0 Desh Raj, et al. ∙

research

∙ 10/10/2021

Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models

Self-supervised model pre-training has recently garnered significant int...

6 Matthew Wiesner, et al. ∙

research

∙ 09/17/2021

Continuous Streaming Multi-Talker ASR with Dual-path Transducers

Streaming recognition of multi-talker conversations has so far been eval...

0 Desh Raj, et al. ∙

research

∙ 04/05/2021

Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem

We recently proposed DOVER-Lap, a method for combining overlap-aware spe...

0 Desh Raj, et al. ∙

research

∙ 02/02/2021

The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap

This paper provides a detailed description of the Hitachi-JHU system tha...

15 Shota Horiguchi, et al. ∙

research

∙ 11/05/2020

Multi-class Spectral Clustering with Overlaps for Speaker Diarization

This paper describes a method for overlap-aware speaker diarization. Giv...

0 Desh Raj, et al. ∙

research

∙ 11/04/2020

Frustratingly Easy Noise-aware Training of Acoustic Models

Environmental noises and reverberation have a detrimental effect on the ...

0 Desh Raj, et al. ∙

research

∙ 11/03/2020

Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis

Multi-speaker speech recognition of unsegmented recordings has diverse a...

0 Desh Raj, et al. ∙

research

∙ 11/03/2020

DOVER-Lap: A Method for Combining Overlap-aware Diarization Outputs

Several advances have been made recently towards handling overlapping sp...

0 Desh Raj, et al. ∙

research

∙ 06/14/2020

The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge

This paper summarizes the JHU team's efforts in tracks 1 and 2 of the CH...

0 Ashish Arora, et al. ∙

research

∙ 09/13/2019

Probing the Information Encoded in x-vectors

Deep neural network based speaker embeddings, such as x-vectors, have be...

0 Desh Raj, et al. ∙

Desh Raj

Featured Co-authors

Sign in with Google

Consider DeepAI Pro