Existing methods proposed for hand reconstruction tasks usually paramete...
Recently, research efforts have been concentrated on revealing how
pre-t...
Video segmentation approaches are of great importance for numerous visio...
Discriminatively localizing sounding objects in cocktail-party, i.e., mi...
Object detection is one of the most important areas in computer vision, ...
Recently, most of the state-of-the-art human pose estimation methods are...
Current multi-object tracking and segmentation (MOTS) methods follow the...
Multiple-object tracking and segmentation (MOTS) is a novel computer vis...
Object detection from 3D point clouds remains a challenging task, though...
This paper reviews the NTIRE 2020 challenge on video quality mapping (VQ...
3D object detection is an essential task in autonomous driving and robot...
Multi-label image and video classification are fundamental yet challengi...
Images or videos always contain multiple objects or actions. Multi-label...
Prior normalization methods rely on affine transformations to produce
ar...
In this work, we introduce a new problem, named as story-preserving lon...
In this paper, we propose a novel perspective-guided convolution (PGC) f...
Most convolutional network (CNN)-based inpainting methods adopt standard...
Existing action localization approaches adopt shallow temporal convoluti...
Video Recognition has drawn great research interest and great progress h...
Temporal action proposal generation is an challenging and promising task...
Recently, image super-resolution has been widely studied and achieved
si...
Arbitrary attribute editing generally can be tackled by incorporating
en...
The task of video grounding, which temporally localizes a natural langua...
Despite the success of deep learning for static image understanding, it
...
This report demonstrates our solution for the Open Images 2018 Challenge...
In this report, our approach to tackling the task of ActivityNet 2018
Ki...
Recently, substantial research effort has focused on how to apply CNNs o...
This paper describes our solution for the video recognition task of
Acti...
The modern image search system requires semantic understanding of image,...
This paper describes our solution for the video recognition task of the
...
We propose a dynamic computational time model to accelerate the average
...
A key challenge in fine-grained recognition is how to find and represent...