Shansong Liu

research

∙ 09/18/2023

HumTrans: A Novel Open-Source Dataset for Humming Melody Transcription and Beyond

This paper introduces the HumTrans dataset, which is publicly available ...

0 Shansong Liu, et al. ∙

research

∙ 09/18/2023

Unified Pretraining Target Based Video-music Retrieval With Music Rhythm And Video Optical Flow Information

Background music (BGM) can enhance the video's emotion. However, selecti...

0 Tianjun Mao, et al. ∙

research

∙ 08/22/2023

Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning

Text-to-music generation (T2M-Gen) faces a major obstacle due to the sca...

0 Shansong Liu, et al. ∙

research

∙ 06/28/2022

A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion

Typically, singing voice conversion (SVC) depends on an embedding vector...

0 Xu Li, et al. ∙

research

∙ 03/19/2022

Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition

Articulatory features are inherently invariant to acoustic signal distor...

0 Shujie Hu, et al. ∙

research

∙ 01/15/2022

Recent Progress in the CUHK Dysarthric Speech Recognition System

Despite the rapid progress of automatic speech recognition (ASR) technol...

0 Shansong Liu, et al. ∙

research

∙ 01/14/2022

Investigation of Data Augmentation Techniques for Disordered Speech Recognition

Disordered speech recognition is a highly challenging task. The underlyi...

0 Mengzhe Geng, et al. ∙

research

∙ 01/14/2022

Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition

Automatic recognition of disordered speech remains a highly challenging ...

0 Mengzhe Geng, et al. ∙

research

∙ 01/08/2022

Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks

State-of-the-art automatic speech recognition (ASR) system development i...

0 Shoukang Hu, et al. ∙

research

∙ 08/02/2021

Adversarial Data Augmentation for Disordered Speech Recognition

Automatic recognition of disordered speech remains a highly challenging ...

0 Zengrui Jin, et al. ∙

research

∙ 02/09/2021

Bayesian Transformer Language Models for Speech Recognition

State-of-the-art neural language models (LMs) represented by Transformer...

0 Boyang Xue, et al. ∙

research

∙ 07/17/2020

Neural Architecture Search for Speech Recognition

Deep neural networks (DNNs) based automatic speech recognition (ASR) sys...

0 Shoukang Hu, et al. ∙

research

∙ 01/06/2020

Audio-visual Recognition of Overlapped speech for the LRS2 dataset

Automatic recognition of overlapped speech remains a highly challenging ...

0 Jianwei Yu, et al. ∙

Shansong Liu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro