We abstract the features (i.e. learned representations) of multi-modal d...
Considering the microphone is easily affected by noise and soundproof
ma...
From the patter of rain to the crunch of snow, the sounds we hear often
...
Dubbing is a post-production process of re-recording actors' dialogues, ...
Learning multi-modal representations is an essential step towards real-w...
Cycle consistent generative adversarial network (CycleGAN) and variation...
Recently, Convolutional Neural Network (CNN) and Long short-term memory
...
In terms of source separation task, most of deep neural networks have tw...