The open-ended Visual Question Answering (VQA) task requires AI models t...
In this paper, we study how to use masked signal modeling in vision and
...
This paper considers making active learning more sensible from a medical...
In this paper, we study the challenging instance-wise vision-language ta...
Generalized zero-shot learning (GZSL) aims at training a model that can
...
In this paper, we propose a model-based characterization of neural netwo...
Visual explanations are logical arguments based on visual features that
...
Learning representations that clearly distinguish between normal and abn...
In this paper, we utilize weight gradients from backpropagation to
chara...
In this paper, we generate and control semantically interpretable filter...
We propose a perceptual video quality assessment (PVQA) metric for disto...
In this paper, we investigate the robustness of traffic sign recognition...