Humans can easily perceive the direction of sound sources in a visual sc...
The objective of this work is to explore the learning of visually ground...
The task of Visual Question Answering (VQA) is known to be plagued by th...
Human brain is continuously inundated with the multisensory information ...
The objective of this work is to localize the sound sources in visual sc...