Wayne Wu

research

∙ 09/05/2023

ReliTalk: Relightable Talking Portrait Generation from a Single Video

Recent years have witnessed great progress in creating vivid audio-drive...

0 Haonan Qiu, et al. ∙

research

∙ 08/31/2023

Audio-Driven Dubbing for User Generated Contents via Style-Aware Semi-Parametric Synthesis

Existing automated dubbing methods are usually designed for Professional...

0 Linsen Song, et al. ∙

research

∙ 08/05/2023

Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis

Implicit neural representations have shown powerful capacity in modeling...

0 Yuxin Wang, et al. ∙

research

∙ 05/22/2023

RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars

Synthesizing high-fidelity head avatars is a central problem for compute...

3 Dongwei Pan, et al. ∙

research

∙ 04/17/2023

Text2Performer: Text-Driven Human Video Generation

Text-driven content creation has evolved to be a transformative techniqu...

0 Yuming Jiang, et al. ∙

research

∙ 04/04/2023

MonoHuman: Animatable Human Neural Field from Monocular Video

Animating virtual avatars with free-view control is crucial for various ...

0 Zhengming Yu, et al. ∙

research

∙ 03/30/2023

SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling

Synthetic data has emerged as a promising source for 3D human research a...

2 Zhitao Yang, et al. ∙

research

∙ 01/18/2023

OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation

Recent advances in modeling 3D objects mostly rely on synthetic datasets...

4 Tong Wu, et al. ∙

research

∙ 12/14/2022

3DHumanGAN: Towards Photo-Realistic 3D-Aware Human Image Generation

We present 3DHumanGAN, a 3D-aware generative adversarial network (GAN) t...

9 Zhuoqian Yang, et al. ∙

research

∙ 12/05/2022

Audio-Driven Co-Speech Gesture Video Generation

Co-speech gesture is crucial for human-machine interaction and digital e...

3 Xian Liu, et al. ∙

research

∙ 12/03/2022

VLG: General Video Recognition with Web Textual Knowledge

Video recognition in an open and dynamic world is quite challenging, as ...

0 Jintao Lin, et al. ∙

research

∙ 10/12/2022

MotionBERT: Unified Pretraining for Human Motion Analysis

We present MotionBERT, a unified pretraining framework, to tackle differ...

12 Wentao Zhu, et al. ∙

research

∙ 08/16/2022

StyleFaceV: Face Video Generation via Decomposing and Recomposing Pretrained StyleGAN3

Realistic generative face video synthesis has long been a pursuit in bot...

9 Haonan Qiu, et al. ∙

research

∙ 07/11/2022

Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis

Video-to-Video synthesis (Vid2Vid) has achieved remarkable results in ge...

0 Long Zhuo, et al. ∙

research

∙ 06/30/2022

Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding Approach

Generic event boundary detection (GEBD) is an important yet challenging ...

0 Jiaqi Tang, et al. ∙

research

∙ 05/31/2022

Text2Human: Text-Driven Controllable Human Image Generation

Generating high-quality and diverse human images is an important yet cha...

9 Yuming Jiang, et al. ∙

research

∙ 05/30/2022

EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model

Although significant progress has been made to audio-driven talking face...

7 Xinya Ji, et al. ∙

research

∙ 04/25/2022

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

Unconditional human image generation is an important task in vision and ...

8 Jianglin Fu, et al. ∙

research

∙ 04/25/2022

Generalizable Neural Performer: Learning Robust Radiance Fields for Human Novel View Synthesis

This work targets at using a general deep learning framework to synthesi...

1 Wei Cheng, et al. ∙

research

∙ 04/25/2022

Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing

This paper focuses on the weakly-supervised audio-visual video parsing t...

2 Haoyue Cheng, et al. ∙

research

∙ 03/31/2022

TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

Recent advances like StyleGAN have promoted the growth of controllable f...

5 Yanbo Xu, et al. ∙

research

∙ 03/24/2022

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation

Generating speech-consistent body and gesture movements is a long-standi...

0 Xian Liu, et al. ∙

research

∙ 01/19/2022

Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation

Animating high-fidelity video portrait with speech audio is crucial for ...

2 Xian Liu, et al. ∙

research

∙ 12/19/2021

MoCaNet: Motion Retargeting in-the-wild via Canonicalization Networks

We present a novel framework that brings the 3D motion retargeting task ...

11 Wentao Zhu, et al. ∙

research

∙ 12/09/2021

Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection

Generic event boundary detection is an important yet challenging task in...

0 Jiaqi Tang, et al. ∙

research

∙ 11/12/2021

Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data

Generative adversarial networks (GANs) typically require ample data for ...

16 Liming Jiang, et al. ∙

research

∙ 04/22/2021

Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation

While accurate lip synchronization has been achieved for arbitrary-subje...

13 Hang Zhou, et al. ∙

research

∙ 04/15/2021

Audio-Driven Emotional Video Portraits

Despite previous success in generating audio-driven talking heads, most ...

0 Xinya Ji, et al. ∙

research

∙ 04/07/2021

Everything's Talkin': Pareidolia Face Reenactment

We present a new application direction named Pareidolia Face Reenactment...

0 Linsen Song, et al. ∙

research

∙ 02/18/2021

DeeperForensics Challenge 2020 on Real-World Face Forgery Detection: Methods and Results

This paper reports methods and results in the DeeperForensics Challenge ...

12 Liming Jiang, et al. ∙

research

∙ 12/23/2020

Focal Frequency Loss for Generative Models

Despite the remarkable success of generative models in creating photorea...

23 Liming Jiang, et al. ∙

research

∙ 07/17/2020

Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation

Depth information has proven to be a useful cue in the semantic segmenta...

0 Xiaokang Chen, et al. ∙

research

∙ 05/14/2020

TAM: Temporal Adaptive Module for Video Recognition

Temporal modeling is crucial for capturing spatiotemporal structure in v...

0 Zhaoyang Liu, et al. ∙

research

∙ 03/31/2020

TransMoMo: Invariance-Driven Unsupervised Video Motion Retargeting

We present a lightweight video motion retargeting approach TransMoMo tha...

19 Zhuoqian Yang, et al. ∙

research

∙ 01/15/2020

Everybody's Talkin': Let Me Talk as You Want

We present a method to edit a target portrait footage by taking a sequen...

28 Linsen Song, et al. ∙

research

∙ 01/09/2020

DeeperForensics-1.0: A Large-Scale Dataset for Real-World Face Forgery Detection

In this paper, we present our on-going effort of constructing a large-sc...

22 Liming Jiang, et al. ∙

research

∙ 10/26/2019

FAB: A Robust Facial Landmark Detection Framework for Motion-Blurred Videos

Recently, facial landmark detection algorithms have achieved remarkable ...

13 Keqiang Sun, et al. ∙

research

∙ 08/20/2019

Make a Face: Towards Arbitrary High Fidelity Face Manipulation

Recent studies have shown remarkable success in face manipulation task w...

24 Shengju Qian, et al. ∙

research

∙ 08/18/2019

Aggregation via Separation: Boosting Facial Landmark Detector with Semi-Supervised Style Translation

Facial landmark detection, or face alignment, is a fundamental task that...

12 Shengju Qian, et al. ∙

research

∙ 05/11/2019

Disentangling Content and Style via Unsupervised Geometry Distillation

It is challenging to disentangle an object into two orthogonal spaces of...

7 Wayne Wu, et al. ∙

research

∙ 04/21/2019

TransGaGa: Geometry-Aware Unsupervised Image-to-Image Translation

Unsupervised image-to-image translation aims at learning a mapping betwe...

48 Wayne Wu, et al. ∙

research

∙ 07/29/2018

ReenactGAN: Learning to Reenact Faces via Boundary Transfer

We present a novel learning-based framework for face reenactment. The pr...

14 Wayne Wu, et al. ∙

research

∙ 05/26/2018

Look at Boundary: A Boundary-Aware Face Alignment Algorithm

We present a novel boundary-aware face alignment algorithm by utilising ...

0 Wayne Wu, et al. ∙

Wayne Wu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro