Zilong Huang

research

∙ 07/17/2023

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs

LLMs have demonstrated remarkable abilities at interacting with humans t...

0 fcq, et al. ∙

research

∙ 04/03/2023

Disentangled Pre-training for Image Matting

Image matting requires high-quality pixel-level human annotations to sup...

0 Yanda Li, et al. ∙

research

∙ 01/30/2023

SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation

Since the introduction of Vision Transformers, the landscape of many com...

0 Qiang Wan, et al. ∙

research

∙ 12/08/2022

Executing your Commands via Motion Diffusion in Latent Space

We study a challenging task, conditional human motion generation, which ...

0 Xin Chen, et al. ∙

research

∙ 10/20/2022

Coordinates Are NOT Lonely – Codebook Prior Helps Implicit Neural 3D Representations

Implicit neural 3D representation has achieved impressive results in sur...

3 Fukun Yin, et al. ∙

research

∙ 04/12/2022

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation

Although vision transformers (ViTs) have achieved great success in compu...

3 Wenqiang Zhang, et al. ∙

research

∙ 06/07/2021

Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer

Very recently, Window-based Transformers, which computed self-attention ...

0 Zilong Huang, et al. ∙

research

∙ 04/02/2021

Half-Real Half-Fake Distillation for Class-Incremental Semantic Segmentation

Despite their success for semantic segmentation, convolutional neural ne...

0 Zilong Huang, et al. ∙

research

∙ 03/22/2021

Human De-occlusion: Invisible Perception and Recovery for Humans

In this paper, we tackle the problem of human de-occlusion which reasons...

0 Qiang Zhou, et al. ∙

research

∙ 09/14/2020

High-Resolution Deep Image Matting

Image matting is a key technique for image and video editing and composi...

0 Haichao Yu, et al. ∙

research

∙ 05/21/2020

Deep learning-based automated image segmentation for concrete petrographic analysis

The standard petrography test method for measuring air voids in concrete...

16 Yu Song, et al. ∙

research

∙ 04/21/2020

The 1st Agriculture-Vision Challenge: Methods and Results

The first Agriculture-Vision Challenge aims to encourage research in dev...

18 Mang Tik Chiu, et al. ∙

research

∙ 01/05/2020

Agriculture-Vision: A Large Aerial Image Database for Agricultural Pattern Analysis

The success of deep learning in visual recognition tasks has driven adva...

11 Mang Tik Chiu, et al. ∙

research

∙ 08/26/2019

SPGNet: Semantic Prediction Guidance for Scene Parsing

Multi-scale context module and single-stage encoder-decoder structure ar...

16 Bowen Cheng, et al. ∙

research

∙ 07/02/2019

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Video object segmentation (VOS) aims at pixel-level object tracking give...

4 Qiang Zhou, et al. ∙

research

∙ 11/28/2018

CCNet: Criss-Cross Attention for Semantic Segmentation

Long-range dependencies can capture useful contextual information to ben...

6 Zilong Huang, et al. ∙

research

∙ 09/17/2018

Devil in the Details: Towards Accurate Single and Multiple Human Parsing

Human parsing has received considerable interest due to its wide applica...

8 Ting Liu, et al. ∙

research

∙ 06/12/2017

Point Linking Network for Object Detection

Object detection is a core problem in computer vision. With the developm...

0 Xinggang Wang, et al. ∙

research

∙ 05/06/2017

Deep Patch Learning for Weakly Supervised Object Classification and Discovery

Patch-level image representation is very important for object classifica...

0 Peng Tang, et al. ∙

Zilong Huang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro