Recently, there has been a growing trend toward feature-based approaches...
Existing end-to-end Multi-Object Tracking (e2e-MOT) methods have not
sur...
Sequential video understanding, as an emerging video understanding task,...
This is a brief technical report of our proposed method for Multiple-Obj...
This paper tackles an emerging and challenging vision-language task, 3D
...
Online action detection aims at the accurate action prediction of the cu...
In this paper, we propose a novel sequence verification task that aims t...
Referring Image Segmentation (RIS) aims at segmenting the target object ...
Anomaly detection in medical images refers to the identification of abno...
An LBYL (`Look Before You Leap') Network is proposed for end-to-end trai...
This paper proposes a framework for the interactive video object segment...
Almost all existing amodal segmentation methods make the inferences of
o...
Spatial Description Resolution, as a language-guided localization task, ...
Anomaly detection in retinal image refers to the identification of
abnor...
Cameras are prevalent in our daily lives, and enable many useful systems...
In this paper, we present a novel framework to detect line segments in
m...
Diabetic Retinopathy (DR) is a non-negligible eye disease among patients...
Anomaly detection in videos refers to the identification of events that ...