Shuyang Sun

research

∙ 07/21/2023

OxfordTVG-HIC: Can Machine Make Humorous Captions from Images?

This paper presents OxfordTVG-HIC (Humorous Image Captions), a large-sca...

0 Runjia Li, et al. ∙

research

∙ 06/29/2023

ReMaX: Relaxing for Better Training on Efficient Panoptic Segmentation

This paper presents a new mechanism to facilitate the training of mask t...

0 Shuyang Sun, et al. ∙

research

∙ 11/29/2022

LUMix: Improving Mixup by Better Modelling Label Uncertainty

Modern deep networks can be better generalized when trained with noisy s...

0 Shuyang Sun, et al. ∙

research

∙ 03/10/2022

Knowledge Distillation as Efficient Pre-training: Faster Convergence, Higher Data-efficiency, and Better Transferability

Large-scale pre-training has been proven to be crucial for various compu...

0 Ruifei He, et al. ∙

research

∙ 12/16/2021

Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation

Video Panoptic Segmentation (VPS) aims at assigning a class label to eac...

0 Yi Zhou, et al. ∙

research

∙ 11/18/2021

TransMix: Attend to Mix for Vision Transformers

Mixup-based augmentation has been found to be effective for generalizing...

0 Jie-Neng Chen, et al. ∙

research

∙ 08/03/2021

Vision Transformer with Progressive Sampling

Transformers with powerful global relation modeling abilities have been ...

0 Xiaoyu Yue, et al. ∙

research

∙ 07/13/2021

Visual Parser: Representing Part-whole Hierarchies with Transformers

Human vision is able to capture the part-whole hierarchical information ...

0 Shuyang Sun, et al. ∙

research

∙ 11/24/2020

Learning to Sample the Most Useful Training Patches from Images

Some image restoration tasks like demosaicing require difficult training...

0 Shuyang Sun, et al. ∙

research

∙ 09/12/2020

Exploring the Hierarchy in Relation Labels for Scene Graph Generation

By assigning each relationship a single label, current approaches formul...

11 Yi Zhou, et al. ∙

research

∙ 09/09/2019

Robust Multi-Modality Multi-Object Tracking

Multi-sensor perception is crucial to ensure the reliability and accurac...

4 Wenwei Zhang, et al. ∙

research

∙ 06/17/2019

MMDetection: Open MMLab Detection Toolbox and Benchmark

We present MMDetection, an object detection toolbox that contains a rich...

1 Kai Chen, et al. ∙

research

∙ 01/22/2019

Hybrid Task Cascade for Instance Segmentation

Cascade is a classic yet powerful architecture that has boosted performa...

6 Kai Chen, et al. ∙

research

∙ 01/11/2019

FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction

The basic principles in designing convolutional neural network (CNN) str...

0 Shuyang Sun, et al. ∙

research

∙ 11/29/2017

Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition

Motion representation plays a vital role in human action recognition in ...

0 Shuyang Sun, et al. ∙

Shuyang Sun

Featured Co-authors

Sign in with Google

Consider DeepAI Pro