Zhiding Yu

research

∙ 08/08/2023

FocalFormer3D : Focusing on Hard Instance for 3D Object Detection

False negatives (FN) in 3D object detection, e.g., missing predictions o...

0 Yilun Chen, et al. ∙

research

∙ 08/04/2023

FB-BEV: BEV Representation from Forward-Backward View Transformations

View Transformation Module (VTM), where transformations happen between m...

0 Zhiqi Li, et al. ∙

research

∙ 07/04/2023

FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation

This technical report summarizes the winning solution for the 3D Occupan...

0 Zhiqi Li, et al. ∙

research

∙ 06/27/2023

Differentially Private Video Activity Recognition

In recent years, differential privacy has seen significant advancements ...

0 Zelun Luo, et al. ∙

research

∙ 05/03/2023

Real-Time Radiance Fields for Single-Image Portrait View Synthesis

We present a one-shot method to infer and render a photorealistic 3D rep...

4 Alex Trevithick, et al. ∙

research

∙ 03/04/2023

Prismer: A Vision-Language Model with An Ensemble of Experts

Recent vision-language models have shown impressive multi-modal generati...

12 Shikun Liu, et al. ∙

research

∙ 02/23/2023

VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion

Humans can easily imagine the complete 3D geometry of occluded objects a...

11 Yiming Li, et al. ∙

research

∙ 02/09/2023

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning

Augmenting pretrained language models (LMs) with a vision encoder (e.g.,...

0 Zhuolin Yang, et al. ∙

research

∙ 01/10/2023

Vision Transformers Are Good Mask Auto-Labelers

We propose Mask Auto-Labeler (MAL), a high-quality Transformer-based mas...

13 Shiyi Lan, et al. ∙

research

∙ 10/23/2022

1st Place Solution of The Robust Vision Challenge (RVC) 2022 Semantic Segmentation Track

This report describes the winning solution to the semantic segmentation ...

20 Junfei Xiao, et al. ∙

research

∙ 09/15/2022

Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models

Pre-trained vision-language models (e.g., CLIP) have shown promising zer...

29 Manli Shu, et al. ∙

research

∙ 08/21/2022

PointDP: Diffusion-driven Purification against Adversarial Attacks on 3D Point Cloud Recognition

3D Point cloud is becoming a critical data representation in many real-w...

5 Jiachen Sun, et al. ∙

research

∙ 08/03/2022

MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training

We propose MinVIS, a minimal video instance segmentation (VIS) framework...

25 De-An Huang, et al. ∙

research

∙ 07/04/2022

How Much More Data Do I Need? Estimating Requirements for Downstream Tasks

Given a small training data set and a learning algorithm, how much more ...

63 Rafid Mahmood, et al. ∙

research

∙ 05/27/2022

Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions

A significant gap remains between today's visual pattern recognition mod...

16 Huaizu Jiang, et al. ∙

research

∙ 04/26/2022

Understanding The Robustness in Vision Transformers

Recent studies show that Vision Transformers(ViTs) exhibit strong robust...

13 Daquan Zhou, et al. ∙

research

∙ 04/24/2022

RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning

Reasoning about visual relationships is central to how humans interpret ...

6 Xiaojian Ma, et al. ∙

research

∙ 04/11/2022

M^2BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation

In this paper, we propose M^2BEV, a unified framework that jointly perfo...

17 Enze Xie, et al. ∙

research

∙ 02/24/2022

FreeSOLO: Learning to Segment Objects without Annotations

Instance segmentation is a fundamental vision task that aims to recogniz...

59 Xinlong Wang, et al. ∙

research

∙ 10/26/2021

AugMax: Adversarial Composition of Random Augmentations for Robust Training

Data augmentation is a simple yet effective way to improve the robustnes...

27 Haotao Wang, et al. ∙

research

∙ 09/08/2021

Panoptic SegFormer

We present Panoptic SegFormer, a general framework for end-to-end panopt...

8 Zhiqi Li, et al. ∙

research

∙ 06/22/2021

Towards Reducing Labeling Cost in Deep Object Detection

Deep neural networks have reached very high accuracy on object detection...

7 Ismail Elezi, et al. ∙

research

∙ 06/17/2021

SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies

Generalization has been a long-standing challenge for reinforcement lear...

3 Linxi Fan, et al. ∙

research

∙ 06/09/2021

Practical Machine Learning Safety: A Survey and Primer

The open-world deployment of Machine Learning (ML) algorithms in safety-...

0 Sina Mohseni, et al. ∙

research

∙ 05/31/2021

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

We present SegFormer, a simple, efficient yet powerful semantic segmenta...

6 Enze Xie, et al. ∙

research

∙ 05/13/2021

DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision

We introduce DiscoBox, a novel framework that jointly learns instance se...

8 Shiyi Lan, et al. ∙

research

∙ 04/12/2021

Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection

Training on datasets with long-tailed distributions has been challenging...

27 Nadine Chang, et al. ∙

research

∙ 04/06/2021

Contrastive Syn-to-Real Generalization

Training on synthetic data can be beneficial for label or data-scarce sc...

7 Wuyang Chen, et al. ∙

research

∙ 10/21/2020

UFO^2: A Unified Framework towards Omni-supervised Object Detection

Existing work on object detection often relies on a single form of annot...

0 Zhongzheng Ren, et al. ∙

research

∙ 10/08/2020

Distributionally Robust Learning for Unsupervised Domain Adaptation

We propose a distributionally robust learning (DRL) method for unsupervi...

2 Haoxuan Wang, et al. ∙

research

∙ 10/02/2020

Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning

Humans have an inherent ability to learn novel concepts from only a few ...

10 Weili Nie, et al. ∙

research

∙ 08/21/2020

Delving Deeper into Anti-aliasing in ConvNets

Aliasing refers to the phenomenon that high frequency signals degenerate...

41 Xueyan Zou, et al. ∙

research

∙ 07/20/2020

Joint Disentangling and Adaptation for Cross-Domain Person Re-Identification

Although a significant progress has been witnessed in supervised person ...

11 Yang Zou, et al. ∙

research

∙ 07/17/2020

Unsupervised Controllable Generation with Self-Training

Recent generative adversarial networks (GANs) are able to generate impre...

23 Grigorios G. Chrysos, et al. ∙

research

∙ 07/17/2020

Neural Networks with Recurrent Generative Feedback

Neural networks are vulnerable to input perturbations such as additive n...

22 Yujia Huang, et al. ∙

research

∙ 07/14/2020

Transposer: Universal Texture Synthesis Using Feature Maps as Transposed Convolution Filter

Conventional CNNs for texture synthesis consist of a sequence of (de)-co...

2 Guilin Liu, et al. ∙

research

∙ 07/14/2020

Automated Synthetic-to-Real Generalization

Models trained on synthetic images often face degraded generalization to...

9 Wuyang Chen, et al. ∙

research

∙ 06/28/2020

Uncertainty-aware multi-view co-training for semi-supervised medical image segmentation and domain adaptation

Although having achieved great success in medical image segmentation, de...

3 Yingda Xia, et al. ∙

research

∙ 04/09/2020

Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection

Weakly supervised learning has emerged as a compelling tool for object d...

2 Zhongzheng Ren, et al. ∙

research

∙ 12/04/2019

Angular Visual Hardness

Although convolutional neural networks (CNNs) are inspired by the mechan...

66 Beidi Chen, et al. ∙

research

∙ 08/26/2019

Confidence Regularized Self-Training

Recent advances in domain adaptation show that deep self-training presen...

13 Yang Zou, et al. ∙

research

∙ 06/12/2019

Compressive Hyperspherical Energy Minimization

Recent work on minimum hyperspherical energy (MHE) has demonstrated its ...

0 Rongmei Lin, et al. ∙

research

∙ 04/15/2019

Joint Discriminative and Generative Learning for Person Re-identification

Person re-identification (re-id) remains challenging due to significant ...

20 Zhedong Zheng, et al. ∙

research

∙ 11/28/2018

Partial Convolution based Padding

In this paper, we present a simple yet effective padding scheme that can...

6 Guilin Liu, et al. ∙

research

∙ 10/18/2018

Domain Adaptation for Semantic Segmentation via Class-Balanced Self-Training

Recent deep networks achieved state of the art performance on a variety ...

0 Yang Zou, et al. ∙

research

∙ 08/06/2018

Simultaneous Edge Alignment and Learning

Edge detection is among the most fundamental vision problems for its rol...

6 Zhiding Yu, et al. ∙

research

∙ 05/23/2018

Learning towards Minimum Hyperspherical Energy

Neural networks are a powerful class of nonlinear functions that can be ...

0 Weiyang Liu, et al. ∙

research

∙ 04/22/2018

Decoupled Networks

Inner product-based convolution has been a central component of convolut...

0 Weiyang Liu, et al. ∙

research

∙ 04/05/2018

Learning Strict Identity Mappings in Deep Residual Networks

A family of super deep networks, referred to as residual networks or Res...

0 Xin Yu, et al. ∙

research

∙ 11/08/2017

Deep Hyperspherical Learning

Convolution as inner product has been the founding basis of convolutiona...

0 Weiyang Liu, et al. ∙

Zhiding Yu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro