Enze Xie

research

∙ 08/26/2023

Beyond One-to-One: Rethinking the Referring Image Segmentation

Referring image segmentation aims to segment the target object referred ...

0 Yutao Hu, et al. ∙

research

∙ 07/12/2023

T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation

Despite the stunning ability to generate high-quality images by recent t...

0 Kaiyi Huang, et al. ∙

research

∙ 07/09/2023

Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird's Eye View

Recent vision-only perception models for autonomous driving achieved pro...

1 Jiayu Yang, et al. ∙

research

∙ 07/05/2023

DiffFlow: A Unified SDE Framework for Score-Based Diffusion Models and Generative Adversarial Networks

Generative models can be categorized into two types: explicit generative...

0 Jingwei Zhang, et al. ∙

research

∙ 07/04/2023

DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation

Recent Diffusion Transformers (e.g., DiT) have demonstrated their powerf...

0 Shentong Mo, et al. ∙

research

∙ 06/28/2023

DiffComplete: Diffusion-based Generative 3D Shape Completion

We introduce a new diffusion-based approach for shape completion on 3D r...

0 Ruihang Chu, et al. ∙

research

∙ 06/07/2023

Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt

Diffusion models have attracted significant attention due to their remar...

0 Kai Chen, et al. ∙

research

∙ 05/15/2023

Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts

The text-driven image and video diffusion models have achieved unprecede...

0 Yuyang Zhao, et al. ∙

research

∙ 04/19/2023

MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation

Perception systems in modern autonomous driving vehicles typically take ...

0 Chongjian Ge, et al. ∙

research

∙ 04/19/2023

Progressive-Hint Prompting Improves Reasoning in Large Language Models

The performance of Large Language Models (LLMs) in reasoning tasks depen...

0 Chuanyang Zheng, et al. ∙

research

∙ 04/13/2023

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning

Diffusion models have proven to be highly effective in generating high-q...

0 Enze Xie, et al. ∙

research

∙ 04/03/2023

DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving

Safety is the primary priority of autonomous driving. Nevertheless, no p...

0 Tianqi Wang, et al. ∙

research

∙ 03/30/2023

DDP: Diffusion Model for Dense Visual Prediction

We propose a simple, efficient, yet powerful framework for dense visual ...

0 Yuanfeng Ji, et al. ∙

research

∙ 03/19/2023

Vehicle-Infrastructure Cooperative 3D Object Detection via Feature Flow Prediction

Cooperatively utilizing both ego-vehicle and infrastructure sensor data ...

0 Haibao Yu, et al. ∙

research

∙ 01/29/2023

Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline

Recently, perception task based on Bird's-Eye View (BEV) representation ...

0 Yangguang Li, et al. ∙

research

∙ 01/19/2023

Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception

Recently, the pure camera-based Bird's-Eye-View (BEV) perception removes...

0 Bin Huang, et al. ∙

research

∙ 05/10/2022

UNITS: Unsupervised Intermediate Training Stage for Scene Text Detection

Recent scene text detection methods are almost based on deep learning an...

0 Youhui Guo, et al. ∙

research

∙ 04/26/2022

Understanding The Robustness in Vision Transformers

Recent studies show that Vision Transformers(ViTs) exhibit strong robust...

13 Daquan Zhou, et al. ∙

research

∙ 04/11/2022

M^2BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation

In this paper, we propose M^2BEV, a unified framework that jointly perfo...

17 Enze Xie, et al. ∙

research

∙ 04/04/2022

Improving Monocular Visual Odometry Using Learned Depth

Monocular visual odometry (VO) is an important task in robotics and comp...

0 Libo Sun, et al. ∙

research

∙ 03/31/2022

BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers

3D visual perception tasks, including 3D detection and map segmentation ...

0 Zhiqi Li, et al. ∙

research

∙ 03/16/2022

WegFormer: Transformers for Weakly Supervised Semantic Segmentation

Although convolutional neural networks (CNNs) have achieved remarkable p...

0 Chunmeng Liu, et al. ∙

research

∙ 11/03/2021

FAST: Searching for a Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation

We propose an accurate and efficient scene text detection framework, ter...

5 Zhe Chen, et al. ∙

research

∙ 09/08/2021

Panoptic SegFormer

We present Panoptic SegFormer, a general framework for end-to-end panopt...

8 Zhiqi Li, et al. ∙

research

∙ 07/21/2021

CycleMLP: A MLP-like Architecture for Dense Prediction

This paper presents a simple MLP-like architecture, CycleMLP, which is a...

0 Shoufa Chen, et al. ∙

research

∙ 05/31/2021

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

We present SegFormer, a simple, efficient yet powerful semantic segmenta...

6 Enze Xie, et al. ∙

research

∙ 05/05/2021

PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond

Reducing the complexity of the pipeline of instance segmentation is cruc...

0 Enze Xie, et al. ∙

research

∙ 03/24/2021

FakeMix Augmentation Improves Transparent Object Detection

Detecting transparent objects in natural scenes is challenging due to th...

0 Yang Cao, et al. ∙

research

∙ 03/22/2021

Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization

We present an extremely simple Ultra-Resolution Style Transfer framework...

7 Zhe Chen, et al. ∙

research

∙ 03/08/2021

Unsupervised Pretraining for Object Detection by Patch Reidentification

Unsupervised representation learning achieves promising performances in ...

0 Jian Ding, et al. ∙

research

∙ 02/09/2021

DetCo: Unsupervised Contrastive Learning for Object Detection

Unsupervised contrastive learning achieves great success in learning ima...

16 Enze Xie, et al. ∙

research

∙ 01/21/2021

Trans2Seg: Transparent Object Segmentation with Transformer

This work presents a new fine-grained transparent object segmentation da...

11 Enze Xie, et al. ∙

research

∙ 12/31/2020

TransTrack: Multiple-Object Tracking with Transformer

Multiple-object tracking(MOT) is mostly dominated by complex and multi-s...

20 Peize Sun, et al. ∙

research

∙ 12/10/2020

OneNet: Towards End-to-End One-Stage Object Detection

End-to-end one-stage object detection trailed thus far. This paper disco...

0 Peize Sun, et al. ∙

research

∙ 11/26/2020

SelfText Beyond Polygon: Unconstrained Text Detection with Box Supervision and Dynamic Self-Training

Although a polygon is a more accurate representation than an upright bou...

0 Weijia Wu, et al. ∙

research

∙ 09/03/2020

Synthetic-to-Real Unsupervised Domain Adaptation for Scene Text Detection in the Wild

Deep learning-based scene text detection can achieve preferable performa...

0 Weijia Wu, et al. ∙

research

∙ 08/03/2020

AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting

Scene text spotting aims to detect and recognize the entire word or sent...

0 Wenhai Wang, et al. ∙

research

∙ 07/23/2020

Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation

Multi-person pose estimation is challenging because it localizes body ke...

0 Sheng Jin, et al. ∙

research

∙ 05/07/2020

Scene Text Image Super-Resolution in the Wild

Low-resolution text images are often seen in natural scenes such as docu...

0 Wenjia Wang, et al. ∙

research

∙ 03/31/2020

Segmenting Transparent Objects in the Wild

Transparent objects such as windows and bottles made by glass widely exi...

0 Enze Xie, et al. ∙

research

∙ 03/17/2020

1st Place Solutions for OpenImage2019 – Object Detection and Instance Segmentation

This article introduces the solutions of the two champion teams, `MMfrui...

7 Yu Liu, et al. ∙

research

∙ 09/29/2019

PolarMask: Single Shot Instance Segmentation with Polar Representation

In this paper, we introduce an anchor-box free and single shot instance ...

23 Enze Xie, et al. ∙

research

∙ 09/16/2019

TextSR: Content-Aware Text Super-Resolution Guided by Recognition

Scene text recognition has witnessed rapid development with the advance ...

3 Wenjia Wang, et al. ∙

research

∙ 08/16/2019

Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network

Scene text detection, an important step of scene text reading systems, h...

2 Wenhai Wang, et al. ∙

research

∙ 11/21/2018

Scene Text Detection with Supervised Pyramid Context Network

Scene text detection methods based on deep learning have achieved remark...

4 Enze Xie, et al. ∙

research

∙ 11/06/2018

Fast OBDD Reordering using Neural Message Passing on Hypergraph

Ordered binary decision diagrams (OBDDs) are an efficient data structure...

0 Feifan Xu, et al. ∙

research

∙ 04/11/2018

Attention Cropping: A Novel Data Augmentation Method for Real-world Plant Species Identification

This paper investigates the issue of realistic plant species identificat...

0 Qingguo Xiao, et al. ∙

Enze Xie

Featured Co-authors

Sign in with Google

Consider DeepAI Pro