Dahun Kim

research

∙ 09/02/2023

Contrastive Feature Masking Open-Vocabulary Vision Transformer

We present Contrastive Feature Masking Vision Transformer (CFM-ViT) - an...

0 Dahun Kim, et al. ∙

research

∙ 05/11/2023

Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers

We present Region-aware Open-vocabulary Vision Transformers (RO-ViT) - a...

0 Dahun Kim, et al. ∙

research

∙ 04/12/2023

RECLIP: Resource-efficient CLIP by Training with Small Images

We present RECLIP (Resource-efficient CLIP), a simple method that minimi...

0 Runze Li, et al. ∙

research

∙ 04/10/2023

Neural Image-based Avatars: Generalizable Radiance Fields for Human Avatar Modeling

We present a method that enables synthesizing novel views and novel pose...

0 Youngjoong Kwon, et al. ∙

research

∙ 04/10/2023

Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation

Video Panoptic Segmentation (VPS) aims to achieve comprehensive pixel-le...

0 Inkyu Shin, et al. ∙

research

∙ 03/29/2023

MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks

The development of language models have moved from encoder-decoder to de...

0 Weicheng Kuo, et al. ∙

research

∙ 06/17/2022

CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-base...

0 Qihang Yu, et al. ∙

research

∙ 05/30/2022

TubeFormer-DeepLab: Video Mask Transformer

We present TubeFormer-DeepLab, the first attempt to tackle multiple core...

0 Dahun Kim, et al. ∙

research

∙ 09/15/2021

Neural Human Performer: Learning Generalizable Radiance Fields for Human Performance Rendering

In this paper, we aim at synthesizing a free-viewpoint video of an arbit...

8 Youngjoong Kwon, et al. ∙

research

∙ 08/15/2021

Learning Open-World Object Proposals without Learning to Classify

Object proposals have become an integral preprocessing steps of many vis...

11 Dahun Kim, et al. ∙

research

∙ 06/17/2021

DeepLab2: A TensorFlow Library for Deep Labeling

DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a ...

0 Mark Weber, et al. ∙

research

∙ 06/17/2021

Learning to Associate Every Segment for Video Panoptic Segmentation

Temporal correspondence - linking pixels or objects across frames - is a...

0 Sanghyun Woo, et al. ∙

research

∙ 11/26/2020

The Devil is in the Boundary: Exploiting Boundary Representation for Basis-based Instance Segmentation

Pursuing a more coherent scene understanding towards real-time vision ap...

0 Myungchul Kim, et al. ∙

research

∙ 06/19/2020

Video Panoptic Segmentation

Panoptic segmentation has become a new standard of visual recognition ta...

0 Dahun Kim, et al. ∙

research

∙ 02/03/2020

Hide-and-Tell: Learning to Bridge Photo Streams for Visual Storytelling

Visual storytelling is a task of creating a short story based on photo s...

9 Yunjae Jung, et al. ∙

research

∙ 08/21/2019

Preserving Semantic and Temporal Consistency for Unpaired Video-to-Video Translation

In this paper, we investigate the problem of unpaired video-to-video tra...

22 KwanYong Park, et al. ∙

research

∙ 05/30/2019

Align-and-Attend Network for Globally and Locally Coherent Video Inpainting

We propose a novel feed-forward network for video inpainting. We use a s...

0 Sanghyun Woo, et al. ∙

research

∙ 05/08/2019

Deep Blind Video Decaptioning by Temporal Aggregation and Recurrence

Blind video decaptioning is a problem of automatically removing text ove...

0 Dahun Kim, et al. ∙

research

∙ 05/05/2019

Deep Video Inpainting

Video inpainting aims to fill spatio-temporal holes with plausible conte...

0 Dahun Kim, et al. ∙

research

∙ 11/24/2018

Self-Supervised Video Representation Learning with Space-Time Cubic Puzzles

Self-supervised tasks such as colorization, inpainting and zigsaw puzzle...

0 Dahun Kim, et al. ∙

research

∙ 11/24/2018

Discriminative Feature Learning for Unsupervised Video Summarization

In this paper, we address the problem of unsupervised video summarizatio...

0 Yunjae Jung, et al. ∙

research

∙ 11/15/2018

LinkNet: Relational Embedding for Scene Graph

Objects and their relationships are critical contents for image understa...

0 Sanghyun Woo, et al. ∙

research

∙ 02/06/2018

Learning Image Representations by Completing Damaged Jigsaw Puzzles

In this paper, we explore methods of complicating self-supervised tasks ...

0 Dahun Kim, et al. ∙

research

∙ 08/07/2017

Two-Phase Learning for Weakly Supervised Object Localization

Weakly supervised semantic segmentation and localiza- tion have a proble...

0 Dahun Kim, et al. ∙

Dahun Kim

Featured Co-authors

Sign in with Google

Consider DeepAI Pro