Yibing Song

research

∙ 08/22/2023

Domain Generalization via Rationale Invariance

This paper offers a new perspective to ease the challenge of domain gene...

0 Liang Chen, et al. ∙

research

∙ 07/21/2023

Advancing Visual Grounding with Scene Knowledge: Benchmark and Method

Visual grounding (VG) aims to establish fine-grained alignment between v...

0 Zhihong Chen, et al. ∙

research

∙ 07/21/2023

Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation

Parameter Efficient Tuning (PET) has gained attention for reducing the n...

0 Zunnan Xu, et al. ∙

research

∙ 04/17/2023

Efficient Video Action Detection with Token Dropout and Context Refinement

Streaming video clips with large-scale video tokens impede vision transf...

0 Lei Chen, et al. ∙

research

∙ 04/10/2023

Improved Test-Time Adaptation for Domain Generalization

The main challenge in domain generalization (DG) is to handle the distri...

0 Liang Chen, et al. ∙

research

∙ 03/30/2023

Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning

Contrastive learning methods train visual encoders by comparing views fr...

0 Chongjian Ge, et al. ∙

research

∙ 03/28/2023

CycleACR: Cycle Modeling of Actor-Context Relations for Video Action Detection

The relation modeling between actors and scene context advances video ac...

0 Lei Chen, et al. ∙

research

∙ 11/17/2022

DiffusionDet: Diffusion Model for Object Detection

We propose DiffusionDet, a new framework that formulates object detectio...

0 Shoufa Chen, et al. ∙

research

∙ 05/26/2022

AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition

Although the pre-trained Vision Transformers (ViTs) achieved great succe...

0 Shoufa Chen, et al. ∙

research

∙ 03/23/2022

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Pre-training video transformers on extra large-scale datasets is general...

33 Zhan Tong, et al. ∙

research

∙ 03/23/2022

Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection

Recent studies in deepfake detection have yielded promising results when...

0 Liang Chen, et al. ∙

research

∙ 02/16/2022

Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations

Vision Transformers (ViTs) take all the image patches as tokens and cons...

9 Youwei Liang, et al. ∙

research

∙ 01/28/2022

DynaMixer: A Vision MLP Architecture with Dynamic Mixing

Recently, MLP-like vision models have achieved promising performances on...

5 Ziyu Wang, et al. ∙

research

∙ 01/13/2022

MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning

Dancing video retargeting aims to synthesize a video that transfers the ...

6 Yuying Ge, et al. ∙

research

∙ 10/11/2021

Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning

Studies on self-supervised visual representation learning (SSL) improve ...

0 Chongjian Ge, et al. ∙

research

∙ 05/05/2021

PD-GAN: Probabilistic Diverse GAN for Image Inpainting

We propose PD-GAN, a probabilistic diverse GAN for image inpainting. Giv...

2 Hongyu Liu, et al. ∙

research

∙ 03/31/2021

ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows

Universal style transfer retains styles from reference images in content...

7 Jie An, et al. ∙

research

∙ 03/27/2021

IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object Tracking

Adversarial attack arises due to the vulnerability of deep neural networ...

0 Shuai Jia, et al. ∙

research

∙ 03/23/2021

DeFLOCNet: Deep Image Editing via Flexible Low-level Controls

User-intended visual content fills the hole regions of an input image in...

5 Hongyu Liu, et al. ∙

research

∙ 03/17/2021

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On

Image virtual try-on replaces the clothes on a person image with a desir...

0 Chongjian Ge, et al. ∙

research

∙ 03/10/2021

VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples

MoCo is effective for unsupervised image representation learning. In thi...

0 Tian Pan, et al. ∙

research

∙ 03/09/2021

Stabilized Medical Image Attacks

Convolutional Neural Networks (CNNs) have advanced existing medical syst...

0 Gege Qi, et al. ∙

research

∙ 03/08/2021

Parser-Free Virtual Try-on via Distilling Appearance Flows

Image virtual try-on aims to fit a garment image (target clothes) to a p...

11 Yuying Ge, et al. ∙

research

∙ 08/03/2020

Rethinking Image Deraining via Rain Streaks and Vapors

Single image deraining regards an input image as a fusion of a backgroun...

0 Yinglong Wang, et al. ∙

research

∙ 07/22/2020

Unsupervised Deep Representation Learning for Real-Time Tracking

The advancement of visual tracking has continuously been brought by deep...

23 Ning Wang, et al. ∙

research

∙ 07/20/2020

Robust Tracking against Adversarial Attacks

While deep convolutional neural networks (CNNs) are vulnerable to advers...

0 Shuai Jia, et al. ∙

research

∙ 07/14/2020

Rethinking Image Inpainting via a Mutual Encoder-Decoder with Feature Equalizations

Deep encoder-decoder based CNNs have advanced image inpainting methods f...

0 Hongyu Liu, et al. ∙

research

∙ 10/25/2019

Self-supervised Learning of Detailed 3D Face Reconstruction

In this paper, we present an end-to-end learning framework for detailed ...

58 Yajing Chen, et al. ∙

research

∙ 07/23/2019

Real-Time Correlation Tracking via Joint Model Compression and Transfer

Correlation filters (CF) have received considerable attention in visual ...

3 Ning Wang, et al. ∙

research

∙ 04/09/2019

MVF-Net: Multi-View 3D Face Morphable Model Regression

We address the problem of recovering the 3D geometry of a human face fro...

10 Fanzi Wu, et al. ∙

research

∙ 04/03/2019

Unsupervised Deep Tracking

We propose an unsupervised visual tracking method in this paper. Differe...

16 Ning Wang, et al. ∙

research

∙ 11/22/2018

Joint Face Hallucination and Deblurring via Structure Generation and Detail Enhancement

We address the problem of restoring a high-resolution face image from a ...

0 Yibing Song, et al. ∙

research

∙ 10/09/2018

Deep Attentive Tracking via Reciprocative Learning

Visual attention, derived from cognitive neuroscience, facilitates human...

4 Shi Pu, et al. ∙

research

∙ 09/27/2018

Deformable Object Tracking with Gated Fusion

The tracking-by-detection framework receives growing attentions through ...

4 Wenxi Liu, et al. ∙

research

∙ 04/12/2018

Image Correction via Deep Reciprocating HDR Transformation

Image correction aims to adjust an input image into a visually pleasing ...

0 Xin Yang, et al. ∙

research

∙ 04/12/2018

VITAL: VIsual Tracking via Adversarial Learning

The tracking-by-detection framework consists of two stages, i.e., drawin...

0 Yibing Song, et al. ∙

research

∙ 08/28/2017

Stylizing Face Images via Multiple Exemplars

We address the problem of transferring the style of a headshot photo to ...

0 Yibing Song, et al. ∙

research

∙ 08/01/2017

CREST: Convolutional Residual Learning for Visual Tracking

Discriminative correlation filters (DCFs) have been shown to perform sup...

0 Yibing Song, et al. ∙

research

∙ 08/01/2017

Fast Preprocessing for Robust Face Sketch Synthesis

Exemplar-based face sketch synthesis methods usually meet the challengin...

0 Yibing Song, et al. ∙

research

∙ 08/01/2017

Learning to Hallucinate Face Images via Component Generation and Enhancement

We propose a two-stage method for face hallucination. First, we generate...

0 Yibing Song, et al. ∙

Yibing Song

Featured Co-authors

Sign in with Google

Consider DeepAI Pro