Sparse algorithms offer great flexibility for multi-view temporal percep...
Annotating data for supervised learning is expensive and tedious, and we...
Bird-eye-view (BEV) based methods have made great progress recently in
m...
The goal of contrastive learning based pre-training is to leverage large...
Video object detection is a fundamental problem in computer vision and h...
Tremendous efforts have been made to improve mask localization accuracy ...
Deep convolution-based single image super-resolution (SISR) networks emb...
Recent works in multiple object tracking use sequence model to calculate...
Traditional video compression technologies have been developed over deca...
Object detection and instance segmentation are two fundamental computer
...
Sparsity and varied density are two of the main obstacles for 3D detecti...
More powerful feature representations derived from deep neural networks
...
Recent cutting-edge feature aggregation paradigms for video object detec...
Video object segmentation (VOS) aims at pixel-level object tracking give...
Letting a deep network be aware of the quality of its own predictions is...
Long-range dependencies can capture useful contextual information to ben...
Traditional multiple object tracking methods divide the task into two pa...
Many standard robotic platforms are equipped with at least a fixed 2D la...
How can a single fully convolutional neural network (FCN) perform on obj...
This paper proposes a reconfigurable model to recognize and detect multi...