Diffusion models have demonstrated impressive generative capabilities, b...
Decoding visual stimuli from neural responses recorded by functional Mag...
Denoising Diffusion Probabilistic Models (DDPM) have shown remarkable
ef...
The traditional methods for data compression are typically based on the
...
In this work, we study the problem of Embodied Referring Expression
Grou...
This paper attacks the problem of language-guided navigation in a new
pe...
Visual dialog is a vision-language task where an agent needs to answer a...
Knowledge-based visual question answering (VQA) is a vision-language tas...
The lottery ticket hypothesis states that sparse subnetworks exist in
ra...