Finding the right sound effects (SFX) to match moments in a video is a
d...
Spoken language recognition (SLR) is the task of automatically identifyi...
We propose a self-supervised approach for learning to perform audio sour...
Multi-modal contrastive learning techniques in the audio-text domain hav...
Consumer-grade music recordings such as those captured by mobile devices...
Tag-based music retrieval is crucial to browse large-scale music librari...
The mood of a song is a highly relevant feature for exploration and
reco...
Online audio advertising is a particular form of advertising used abunda...
The lack of data tends to limit the outcomes of deep learning research -...