Referring expression segmentation aims to segment an object described by...
Learning with large-scale unlabeled data has become a powerful tool for
...
We present TFGM (Training Free Graph Matching), a framework to boost the...
We introduce NExT-QA, a rigorously designed video question answering
(Vi...
In this paper, we explore a novel task named visual Relation Grounding i...