Xian Shi

research

∙ 09/11/2023

SlideSpeech: A Large-Scale Slide-Enriched Audio-Visual Corpus

Multi-Modal automatic speech recognition (ASR) techniques aim to leverag...

0 Haoxu Wang, et al. ∙

research

∙ 08/07/2023

SeACo-Paraformer: A Non-Autoregressive ASR System with Flexible and Effective Hotword Customization Ability

Hotword customization is one of the important issues remained in ASR fie...

0 Xian Shi, et al. ∙

research

∙ 05/18/2023

FunASR: A Fundamental End-to-End Speech Recognition Toolkit

This paper introduces FunASR, an open-source speech recognition toolkit ...

0 Zhifu Gao, et al. ∙

research

∙ 05/18/2023

Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System

Estimating confidence scores for recognition results is a classic task i...

0 Xian Shi, et al. ∙

research

∙ 01/29/2023

Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model

Conventional ASR systems use frame-level phoneme posterior to conduct fo...

0 Xian Shi, et al. ∙

research

∙ 05/02/2022

Open-Set Semi-Supervised Learning for 3D Point Cloud Understanding

Semantic understanding of 3D point cloud relies on learning models with ...

5 Xian Shi, et al. ∙

research

∙ 04/07/2022

Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition

General accent recognition (AR) models tend to directly extract low-leve...

0 Qijie Shao, et al. ∙

research

∙ 04/07/2021

Darts-Conformer: Towards Efficient Gradient-Based Neural Architecture Search For End-to-End ASR

Neural architecture search (NAS) has been successfully applied to tasks ...

0 Xian Shi, et al. ∙

research

∙ 02/20/2021

The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods

The variety of accents has posed a big challenge to speech recognition. ...

0 Xian Shi, et al. ∙

research

∙ 01/18/2021

Label-Efficient Point Cloud Semantic Segmentation: An Active Learning Approach

Semantic segmentation of 3D point clouds relies on training deep models ...

15 Xian Shi, et al. ∙

research

∙ 11/17/2020

Cascade RNN-Transducer: Syllable Based Streaming On-device Mandarin Speech Recognition with a Syllable-to-Character Converter

End-to-end models are favored in automatic speech recognition (ASR) beca...

0 Xiong Wang, et al. ∙

research

∙ 09/10/2020

CAD-PU: A Curvature-Adaptive Deep Learning Solution for Point Set Upsampling

Point set is arguably the most direct approximation of an object or scen...

5 Jiehong Lin, et al. ∙

research

∙ 07/12/2020

The ASRU 2019 Mandarin-English Code-Switching Speech Recognition Challenge: Open Datasets, Tracks, Methods and Results

Code-switching (CS) is a common phenomenon and recognizing CS speech is ...

0 Xian Shi, et al. ∙

research

∙ 06/06/2018

The stabilizer for n-qubit symmetric states

The stabilizer group for an n-qubit state |ϕ〉 is the set of all invertib...

0 Xian Shi, et al. ∙

Xian Shi

Featured Co-authors

Sign in with Google

Consider DeepAI Pro