Sound event localization and detection (SELD) systems estimate
direction...
This paper summarizes the cinematic demixing (CDX) track of the Sound
De...
While direction of arrival (DOA) of sound events is generally estimated ...
Diffusion-based speech enhancement (SE) has been investigated recently, ...
This paper presents the crossing scheme (X-scheme) for improving the
per...
Audio classification and restoration are among major downstream tasks in...
We have developed a diffusion-based speech refiner that improves the
ref...
Although music is typically multi-label, many works have studied hierarc...
Although deep neural network (DNN)-based speech enhancement (SE) methods...
In this paper we propose a novel generative approach, DiffRoll, to tackl...
This report presents the Sony-TAu Realistic Spatial Soundscapes 2022
(ST...
One noted issue of vector-quantized variational autoencoder (VQ-VAE) is ...
Sound event localization and detection (SELD) involves identifying the
d...
Recording and annotating real sound events for a sound event localizatio...
While deep neural network-based music source separation (MSS) is very
ef...
Data augmentation methods have shown great importance in diverse supervi...
A deep neural network (DNN)-based speech enhancement (SE) aiming to maxi...
This report describes our systems submitted to the DCASE2021 challenge t...
This paper presents a new deep clustering (DC) method called manifold-aw...
Variational autoencoders (VAEs) often suffer from posterior collapse, wh...
Neural-network (NN)-based methods show high performance in sound event
l...
This paper proposes several improvements for music separation with deep
...
Our systems submitted to the DCASE2020 task 3: Sound Event Localization ...