The curve-based lane representation is a popular approach in many lane
d...
A new trend in the computer vision community is to capture objects of
in...
Monocular depth estimation is known as an ill-posed task in which object...
Referring video object segmentation (RVOS) aims at segmenting an object ...
End-to-end text spotting is a vital computer vision task that aims to
in...
This paper addresses the problem of 3D referring expression comprehensio...
Monocular 3D object detection has become a mainstream approach in automa...
Existing referring understanding tasks tend to involve the detection of ...
Recent years have witnessed huge successes in 3D object detection to
rec...
In this paper, we provide an intuitive viewing to simplify the Siamese-b...
LiDAR-based 3D object detection is an indispensable task in advanced
aut...
The pioneering method for unsupervised meta-learning, CACTUs, is a
clust...
Previous works for LiDAR-based 3D object detection mainly focus on the
s...
Dominated point cloud-based 3D object detectors in autonomous driving
sc...
Existing approaches for unsupervised point cloud pre-training are constr...
Since the rise of vision-language navigation (VLN), great progress has b...
Sparsely annotated semantic segmentation (SASS) aims to train a segmenta...
Subspace clustering is a classical technique that has been widely used f...
Answering semantically-complicated questions according to an image is
ch...
Appearance and motion are two important sources of information in video
...
On existing public benchmarks, face forgery detection techniques have
ac...
Recently, numerous algorithms have been developed to tackle the problem ...
In recent years, Siamese-based trackers have achieved promising performa...
Existing RGB-D salient object detection (SOD) models usually treat RGB a...
Salient object detection (SOD), which simulates the human visual percept...
It is laborious to manually label point cloud data for training high-qua...
Visible-infrared person re-identification (VI-ReID) is a challenging
cro...
Vision-language navigation (VLN) is the task of entailing an agent to ca...
How to make a segmentation model to efficiently adapt to a specific vide...
Retinal fundus images are widely used for clinical screening and diagnos...
Coronavirus Disease 2019 (COVID-19) spread globally in early 2020, causi...
In this paper, we solve the sample shortage problem in the human parsing...
Existing LiDAR-based 3D object detectors usually focus on the single-fra...
Current popular online multi-object tracking (MOT) solutions apply singl...
We propose a new method for video object segmentation (VOS) that address...
Human parsing is for pixel-wise human semantic understanding. As human b...
Rapid progress has been witnessed for human-object interaction (HOI)
rec...
Graph convolutional neural networks (GCNs) have recently demonstrated
pr...
Magnetic resonance imaging (MRI) is a widely used neuroimaging technique...
This paper proposes a human-aware deblurring model that disentangles the...
We introduce a novel network, called CO-attention Siamese Network (COSNe...
This work proposes a novel attentive graph neural network (AGNN) for
zer...
This work proposes to combine neural networks with the compositional
hie...
Due to the advantages of real-time detection and improved performance,
s...
Person re-identification (Re-ID) aims at retrieving a person of interest...
Segmentation is a fundamental task in medical image analysis. However, m...
With the development of Siamese network based trackers, a variety of
tec...
Retinal image quality assessment (RIQA) is essential for controlling the...
Recent years have witnessed a surge in the popularity of attention mecha...
To automate the process of segmenting an anatomy of interest, we can lea...