Shilei Wen

research

∙ 03/28/2023

MeMaHand: Exploiting Mesh-Mano Interaction for Single Image Two-Hand Reconstruction

Existing methods proposed for hand reconstruction tasks usually paramete...

0 Congyi Wang, et al. ∙

research

∙ 03/10/2021

Beyond Self-Supervision: A Simple Yet Effective Network Distillation Alternative to Improve Backbones

Recently, research efforts have been concentrated on revealing how pre-t...

0 Cheng Cui, et al. ∙

research

∙ 10/25/2020

Coherent Loss: A Generic Framework for Stable Video Segmentation

Video segmentation approaches are of great importance for numerous visio...

0 Mingyang Qian, et al. ∙

research

∙ 10/12/2020

Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching

Discriminatively localizing sounding objects in cocktail-party, i.e., mi...

0 Di Hu, et al. ∙

research

∙ 07/23/2020

PP-YOLO: An Effective and Efficient Implementation of Object Detector

Object detection is one of the most important areas in computer vision, ...

31 Xiang Long, et al. ∙

research

∙ 07/21/2020

Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement

Recently, most of the state-of-the-art human pose estimation methods are...

0 Jian Wang, et al. ∙

research

∙ 07/03/2020

Segment as Points for Efficient Online Multi-Object Tracking and Segmentation

Current multi-object tracking and segmentation (MOTS) methods follow the...

9 Zhenbo Xu, et al. ∙

research

∙ 07/03/2020

PointTrack++ for Effective Online Multi-Object Tracking and Segmentation

Multiple-object tracking and segmentation (MOTS) is a novel computer vis...

9 Zhenbo Xu, et al. ∙

research

∙ 06/08/2020

Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection

Object detection from 3D point clouds remains a challenging task, though...

0 Liang Du, et al. ∙

research

∙ 05/05/2020

NTIRE 2020 Challenge on Video Quality Mapping: Methods and Results

This paper reviews the NTIRE 2020 challenge on video quality mapping (VQ...

2 Dario Fuoli, et al. ∙

research

∙ 03/01/2020

ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection

3D object detection is an essential task in autonomous driving and robot...

6 Zhenbo Xu, et al. ∙

research

∙ 12/17/2019

Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification

Multi-label image and video classification are fundamental yet challengi...

0 Renchun You, et al. ∙

research

∙ 11/21/2019

Multi-Label Classification with Label Graph Superimposing

Images or videos always contain multiple objects or actions. Multi-label...

0 Ya Wang, et al. ∙

research

∙ 11/16/2019

Dynamic Instance Normalization for Arbitrary Style Transfer

Prior normalization methods rely on affine transformations to produce ar...

0 Yongcheng Jing, et al. ∙

research

∙ 10/14/2019

TruNet: Short Videos Generation from Long Videos via Story-Preserving Truncation

In this work, we introduce a new problem, named as story-preserving lon...

34 Fan Yang, et al. ∙

research

∙ 09/16/2019

Perspective-Guided Convolution Networks for Crowd Counting

In this paper, we propose a novel perspective-guided convolution (PGC) f...

6 Zhaoyi Yan, et al. ∙

research

∙ 09/03/2019

Image Inpainting with Learnable Bidirectional Attention Maps

Most convolutional network (CNN)-based inpainting methods adopt standard...

0 Chaohao Xie, et al. ∙

research

∙ 08/26/2019

Deep Concept-wise Temporal Convolutional Networks for Action Localization

Existing action localization approaches adopt shallow temporal convoluti...

2 Xin Li, et al. ∙

research

∙ 07/31/2019

Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition

Video Recognition has drawn great research interest and great progress h...

14 Wenhao Wu, et al. ∙

research

∙ 07/23/2019

BMN: Boundary-Matching Network for Temporal Action Proposal Generation

Temporal action proposal generation is an challenging and promising task...

1 Tianwei Lin, et al. ∙

research

∙ 05/07/2019

Adapting Image Super-Resolution State-of-the-arts and Learning Multi-model Ensemble for Video Super-Resolution

Recently, image super-resolution has been widely studied and achieved si...

0 Chao Li, et al. ∙

research

∙ 04/22/2019

STGAN: A Unified Selective Transfer Network for Arbitrary Image Attribute Editing

Arbitrary attribute editing generally can be tackled by incorporating en...

0 Ming Liu, et al. ∙

research

∙ 01/21/2019

Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos

The task of video grounding, which temporally localizes a natural langua...

0 Dongliang He, et al. ∙

research

∙ 11/05/2018

StNet: Local and Global Spatial-Temporal Modeling for Action Recognition

Despite the success of deep learning for static image understanding, it ...

0 Dongliang He, et al. ∙

research

∙ 10/15/2018

Solution for Large-Scale Hierarchical Object Detection Datasets with Incomplete Annotation and Data Imbalance

This report demonstrates our solution for the Open Images 2018 Challenge...

0 Yuan Gao, et al. ∙

research

∙ 06/27/2018

Exploiting Spatial-Temporal Modelling and Multi-Modal Fusion for Human Action Recognition

In this report, our approach to tackling the task of ActivityNet 2018 Ki...

0 Dongliang He, et al. ∙

research

∙ 11/27/2017

Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification

Recently, substantial research effort has focused on how to apply CNNs o...

0 Xiang Long, et al. ∙

research

∙ 08/12/2017

Revisiting the Effectiveness of Off-the-shelf Temporal Modeling Approaches for Large-scale Video Classification

This paper describes our solution for the video recognition task of Acti...

0 Yunlong Bian, et al. ∙

research

∙ 08/04/2017

Deep Metric Learning with Angular Loss

The modern image search system requires semantic understanding of image,...

0 Jian Wang, et al. ∙

research

∙ 07/14/2017

Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding

This paper describes our solution for the video recognition task of the ...

0 Fu Li, et al. ∙

research

∙ 03/30/2017

Dynamic Computational Time for Visual Attention

We propose a dynamic computational time model to accelerate the average ...

0 Zhichao Li, et al. ∙

research

∙ 05/20/2016

Localizing by Describing: Attribute-Guided Attention Localization for Fine-Grained Recognition

A key challenge in fine-grained recognition is how to find and represent...

0 Xiao Liu, et al. ∙

Shilei Wen

Featured Co-authors

Sign in with Google

Consider DeepAI Pro