Recent trends in Video Instance Segmentation (VIS) have seen a growing
r...
A major goal of multimodal research is to improve machine understanding ...
Recent transformer-based offline video instance segmentation (VIS) appro...
A comprehensive representation of an image requires understanding object...
It is essential for safety-critical applications of deep neural networks...
Video Object Segmentation (VOS) has been targeted by various fully-super...
A serious problem in image classification is that a trained model might
...
Visual Question Answering (VQA) is concerned with answering free-form
qu...
Identifying objects in an image and their mutual relationships as a scen...
Visual question answering is concerned with answering free-form question...
The identification of objects in an image, together with their mutual
re...