AVECL-UMONS database for audio-visual event classification and localization

10/02/2020
by   Mathilde Brousmiche, et al.
0

We introduce the AVECL-UMons dataset for audio-visual event classification and localization in the context of office environments. The audio-visual dataset is composed of 11 event classes recorded at several realistic positions in two different rooms. Two types of sequences are recorded according to the number of events in the sequence. The dataset comprises 2662 unilabel sequences and 2724 multilabel sequences corresponding to a total of 5.24 hours. The dataset is publicly accessible online : https://zenodo.org/record/3965492#.X09wsobgrCI.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset