Jinxi Guo

research

∙ 07/21/2023

Prompting Large Language Models with Speech Recognition Abilities

Large language models have proven themselves highly flexible, able to so...

0 Yassir Fathullah, et al. ∙

research

∙ 11/04/2022

Biased Self-supervised learning for ASR

Self-supervised learning via masked prediction pre-training (MPPT) has s...

0 Florian L. Kreyssig, et al. ∙

research

∙ 02/22/2022

VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition

While end-to-end models have shown great success on the Automatic Speech...

0 Jinhan Wang, et al. ∙

research

∙ 12/14/2020

REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling

Accents mismatching is a critical problem for end-to-end ASR. This paper...

0 Hu Hu, et al. ∙

research

∙ 08/08/2020

Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verification

The effects of speaking-style variability on automatic speaker verificat...

0 Amber Afshan, et al. ∙

research

∙ 07/27/2020

Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognition

In this work, we propose a novel and efficient minimum word error rate (...

0 Jinxi Guo, et al. ∙

research

∙ 03/11/2019

Singing voice conversion with non-parallel data

Singing voice conversion is a task to convert a song sang by a source si...

0 Xin Chen, et al. ∙

research

∙ 02/19/2019

A spelling correction model for end-to-end speech recognition

Attention-based sequence-to-sequence models for speech recognition joint...

0 Jinxi Guo, et al. ∙

research

∙ 10/16/2018

Deep neural network based i-vector mapping for speaker verification using short utterances

Text-independent speaker recognition using short utterances is a highly ...

0 Jinxi Guo, et al. ∙

Jinxi Guo

Featured Co-authors

Sign in with Google

Consider DeepAI Pro