Speech Denoising in the Waveform Domain with Self-Attention

02/15/2022
by   Zhifeng Kong, et al.
6

In this work, we present CleanUNet, a causal speech denoising model on the raw waveform. The proposed model is based on an encoder-decoder architecture combined with several self-attention blocks to refine its bottleneck representations, which is crucial to obtain good results. The model is optimized through a set of losses defined over both waveform and multi-resolution spectrograms. The proposed method outperforms the state-of-the-art models in terms of denoised speech quality from various objective and subjective evaluation metrics.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset