Rongzhi Gu

research

∙ 08/31/2023

ReZero: Region-customizable Sound Extraction

We introduce region-customizable sound extraction (ReZero), a general an...

0 Rongzhi Gu, et al. ∙

research

∙ 08/21/2023

Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression

Echo cancellation and noise reduction are essential for full-duplex comm...

0 Hangting Chen, et al. ∙

research

∙ 08/14/2023

The Sound Demixing Challenge 2023 x2013 Cinematic Demixing Track

This paper summarizes the cinematic demixing (CDX) track of the Sound De...

0 Stefan Uhlich, et al. ∙

research

∙ 04/17/2023

Fast Random Approximation of Multi-channel Room Impulse Response

Modern neural-network-based speech processing systems are typically requ...

0 Yi Luo, et al. ∙

research

∙ 02/27/2023

3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty

Multi-channel speech separation using speaker's directional information ...

0 Rongzhi Gu, et al. ∙

research

∙ 12/16/2022

Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation

Recently, frequency domain all-neural beamforming methods have achieved ...

0 Rongzhi Gu, et al. ∙

research

∙ 10/28/2022

Parameter-efficient transfer learning of pre-trained Transformer models for speaker verification using adapters

Recently, the pre-trained Transformer models have received a rising inte...

0 Junyi Peng, et al. ∙

research

∙ 05/03/2022

Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention

Hand-crafted spatial features, such as inter-channel intensity differenc...

0 Xinmeng Xu, et al. ∙

research

∙ 04/15/2022

Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction

Dominant researches adopt supervised training for speaker extraction, wh...

0 Zifeng Zhao, et al. ∙

research

∙ 04/04/2022

Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches

Recently, end-to-end speaker extraction has attracted increasing attenti...

0 Zifeng Zhao, et al. ∙

research

∙ 03/31/2022

Learning Decoupling Features Through Orthogonality Regularization

Keyword spotting (KWS) and speaker verification (SV) are two important t...

0 Li Wang, et al. ∙

research

∙ 08/12/2021

Text Anchor Based Metric Learning for Small-footprint Keyword Spotting

Keyword Spotting (KWS) remains challenging to achieve the trade-off betw...

0 Li Wang, et al. ∙

research

∙ 04/26/2021

Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain

To date, mainstream target speech separation (TSS) approaches are formul...

0 Rongzhi Gu, et al. ∙

research

∙ 04/08/2021

Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency

Transformer-based self-supervised models are trained as feature extracto...

0 Jinchuan Tian, et al. ∙

research

∙ 03/16/2020

Multi-modal Multi-channel Target Speech Separation

Target speech separation refers to extracting a target speaker's voice f...

0 Rongzhi Gu, et al. ∙

research

∙ 03/09/2020

Enhancing End-to-End Multi-channel Speech Separation via Spatial Feature Learning

Hand-crafted spatial features (e.g., inter-channel phase difference, IPD...

0 Rongzhi Gu, et al. ∙

research

∙ 01/02/2020

Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation

Target speech separation refers to extracting the target speaker's speec...

0 Rongzhi Gu, et al. ∙

research

∙ 05/17/2019

A comprehensive study of speech separation: spectrogram vs waveform separation

Speech separation has been studied widely for single-channel close-talk ...

0 Fahimeh Bahmaninezhad, et al. ∙

research

∙ 05/15/2019

End-to-End Multi-Channel Speech Separation

The end-to-end approach for single-channel speech separation has been st...

0 Rongzhi Gu, et al. ∙

Rongzhi Gu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro