Jing Shao

research

∙ 06/19/2023

UniG3D: A Unified 3D Object Generation Dataset

The field of generative AI has a transformative impact on various areas,...

0 Qinghong Sun, et al. ∙

research

∙ 06/11/2023

LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark

Large language models have become a potential pathway toward achieving a...

0 Zhenfei Yin, et al. ∙

research

∙ 05/16/2023

Latent Distribution Adjusting for Face Anti-Spoofing

With the development of deep learning, the field of face anti-spoofing (...

0 Qinghong Sun, et al. ∙

research

∙ 04/01/2023

Mask Hierarchical Features For Self-Supervised Learning

This paper shows that Masking the Deep hierarchical features is an effic...

0 Fenggang Liu, et al. ∙

research

∙ 03/31/2023

Siamese DETR

Recent self-supervised methods are mainly designed for representation le...

3 Zeren Chen, et al. ∙

research

∙ 01/29/2023

Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline

Recently, perception task based on Bird's-Eye View (BEV) representation ...

0 Yangguang Li, et al. ∙

research

∙ 01/19/2023

Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception

Recently, the pure camera-based Bird's-Eye-View (BEV) perception removes...

0 Bin Huang, et al. ∙

research

∙ 01/09/2023

Parallel Reasoning Network for Human-Object Interaction Detection

Human-Object Interaction (HOI) detection aims to learn how human interac...

0 Huan Peng, et al. ∙

research

∙ 10/22/2022

R^2F: A General Retrieval, Reading and Fusion Framework for Document-level Natural Language Inference

Document-level natural language inference (DOCNLI) is a new challenging ...

0 Hao Wang, et al. ∙

research

∙ 10/20/2022

PalGAN: Image Colorization with Palette Generative Adversarial Networks

Multimodal ambiguity and color bleeding remain challenging in colorizati...

3 Yi Wang, et al. ∙

research

∙ 09/03/2022

Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies

Existing Binary Neural Networks (BNNs) mainly operate on local convoluti...

0 Xingrun Xing, et al. ∙

research

∙ 08/05/2022

Task-Balanced Distillation for Object Detection

Mainstream object detectors are commonly constituted of two sub-tasks, i...

3 Ruining Tang, et al. ∙

research

∙ 07/14/2022

Benchmarking Omni-Vision Representation through the Lens of Visual Realms

Though impressive performance has been achieved in specific visual realm...

0 Yuanhan Zhang, et al. ∙

research

∙ 06/27/2022

ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning for Action Recognition

Capitalizing on large pre-trained models for various downstream tasks of...

7 Junting Pan, et al. ∙

research

∙ 04/27/2022

Robust Face Anti-Spoofing with Dual Probabilistic Modeling

The field of face anti-spoofing (FAS) has witnessed great progress with ...

0 Yuanhan Zhang, et al. ∙

research

∙ 04/15/2022

ERGO: Event Relational Graph Transformer for Document-level Event Causality Identification

Document-level Event Causality Identification (DECI) aims to identify ca...

14 Meiqi Chen, et al. ∙

research

∙ 04/12/2022

Few-shot Forgery Detection via Guided Adversarial Interpolation

Realistic visual media synthesis is becoming a critical societal issue w...

3 Haonan Qiu, et al. ∙

research

∙ 03/16/2022

X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation

In computer vision, pre-training models based on largescale supervised l...

0 Yinan He, et al. ∙

research

∙ 03/15/2022

Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy

Large-scale datasets play a vital role in computer vision. Existing data...

15 Yuanhan Zhang, et al. ∙

research

∙ 03/11/2022

Democratizing Contrastive Language-Image Pre-training: A CLIP Benchmark of Data, Model, and Supervision

Contrastive Language-Image Pretraining (CLIP) has emerged as a novel par...

0 Yufeng Cui, et al. ∙

research

∙ 02/24/2022

Prompt for Extraction? PAIE: Prompting Argument Interaction for Event Argument Extraction

In this paper, we propose an effective yet efficient model PAIE for both...

6 Yubo Ma, et al. ∙

research

∙ 01/18/2022

RePre: Improving Self-Supervised Vision Transformer with Reconstructive Pre-training

Recently, self-supervised vision transformers have attracted unprecedent...

6 Luya Wang, et al. ∙

research

∙ 01/16/2022

SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples

Unsupervised sentence embedding aims to obtain the most appropriate embe...

2 Hao Wang, et al. ∙

research

∙ 12/15/2021

ForgeryNet – Face Forgery Analysis Challenge 2021: Methods and Results

The rapid progress of photorealistic synthesis techniques has reached a ...

0 Yinan He, et al. ∙

research

∙ 11/29/2021

A Simple Long-Tailed Recognition Baseline via Vision-Language Model

The visual world naturally exhibits a long-tailed distribution of open c...

9 Teli Ma, et al. ∙

research

∙ 10/11/2021

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

Recently, large-scale Contrastive Language-Image Pre-training (CLIP) has...

0 Yangguang Li, et al. ∙

research

∙ 06/27/2021

Few-Shot Domain Expansion for Face Anti-Spoofing

Face anti-spoofing (FAS) is an indispensable and widely used module in f...

0 Bowen Yang, et al. ∙

research

∙ 03/09/2021

ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis

The rapid progress of photorealistic synthesis techniques has reached at...

0 Yinan He, et al. ∙

research

∙ 02/25/2021

CelebA-Spoof Challenge 2020 on Face Anti-Spoofing: Methods and Results

As facial interaction systems are prevalently deployed, security and rel...

0 Yuanhan Zhang, et al. ∙

research

∙ 11/02/2020

PV-NAS: Practical Neural Architecture Search for Video Recognition

Recently, deep learning has been utilized to solve video recognition pro...

0 Zihao Wang, et al. ∙

research

∙ 08/19/2020

Learning Connectivity of Neural Networks from a Topological Perspective

Seeking effective neural networks is a critical and practical field in d...

39 Kun Yuan, et al. ∙

research

∙ 07/24/2020

CelebA-Spoof: Large-Scale Face Anti-Spoofing Dataset with Rich Annotations

As facial interaction systems are prevalently deployed, security and rel...

6 Yuanhan Zhang, et al. ∙

research

∙ 07/18/2020

Thinking in Frequency: Face Forgery Detection by Mining Frequency-aware Clues

As realistic facial manipulation technologies have achieved remarkable p...

11 Yuyang Qian, et al. ∙

research

∙ 06/16/2020

1st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020

This technical report introduces our winning solution to the spatio-temp...

0 Siyu Chen, et al. ∙

research

∙ 06/14/2020

Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization

Localizing persons and recognizing their actions from videos is a challe...

7 Junting Pan, et al. ∙

research

∙ 11/30/2019

Morphing and Sampling Network for Dense Point Cloud Completion

3D point cloud completion, the task of inferring the complete geometric ...

16 Minghua Liu, et al. ∙

research

∙ 10/15/2019

Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis

Semantic image synthesis aims at generating photorealistic images from s...

33 Xihui Liu, et al. ∙

research

∙ 09/12/2019

CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval

Text-image cross-modal retrieval is a challenging task in the field of l...

0 Zihao Wang, et al. ∙

research

∙ 04/02/2019

Semantics Disentangling for Text-to-Image Generation

Synthesizing photo-realistic images from text descriptions is a challeng...

0 Guojun Yin, et al. ∙

research

∙ 04/02/2019

Context and Attribute Grounded Dense Captioning

Dense captioning aims at simultaneously localizing semantic regions and ...

0 Guojun Yin, et al. ∙

research

∙ 03/11/2019

Video Generation from Single Semantic Label Map

This paper proposes the novel task of video generation conditioned on a ...

16 Junting Pan, et al. ∙

research

∙ 03/03/2019

Unsupervised Bi-directional Flow-based Video Generation from one Snapshot

Imagining multiple consecutive frames given one single snapshot is chall...

6 Lu Sheng, et al. ∙

research

∙ 03/03/2019

Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing

Referring expression grounding aims at locating certain objects or perso...

0 Xihui Liu, et al. ∙

research

∙ 09/16/2018

Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection

Multi-label image classification is a fundamental but challenging task t...

2 Yongcheng Liu, et al. ∙

research

∙ 08/28/2018

Localization Guided Learning for Pedestrian Attribute Recognition

Pedestrian attribute recognition has attracted many attentions due to it...

0 Pengze Liu, et al. ∙

research

∙ 08/16/2018

BlockQNN: Efficient Block-wise Neural Network Architecture Generation

Convolutional neural networks have gained a remarkable success in comput...

4 Zhao Zhong, et al. ∙

research

∙ 07/13/2018

Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition

Recognizing visual relationships <subject-predicate-object> among any pa...

4 Guojun Yin, et al. ∙

research

∙ 05/10/2018

Avatar-Net: Multi-scale Zero-shot Style Transfer by Feature Decoration

Zero-shot artistic style transfer is an important image synthesis proble...

2 Lu Sheng, et al. ∙

research

∙ 04/10/2018

Exploring Disentangled Feature Representation Beyond Face Identification

This paper proposes learning disentangled but complementary face feature...

2 Yu Liu, et al. ∙

research

∙ 03/22/2018

Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data

The aim of image captioning is to generate similar captions by machine a...

0 Xihui Liu, et al. ∙

Jing Shao

Featured Co-authors

Sign in with Google

Consider DeepAI Pro