b'Kevin J. Shih'

research

∙ 01/24/2023

Multilingual Multiaccented Multispeaker TTS with RADTTS

We work to create a multilingual speech synthesis system which can gener...

0 Rohan Badlani, et al. ∙

research

∙ 10/04/2022

Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures

Human pose transfer aims to synthesize a new view of a person under a gi...

0 Nannan Li, et al. ∙

research

∙ 03/03/2022

Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows

Despite recent advances in generative modeling for text-to-speech synthe...

0 Kevin J. Shih, et al. ∙

research

∙ 08/23/2021

One TTS Alignment To Rule Them All

Speech-to-text alignment is a critical component of neural textto-speech...

0 Rohan Badlani, et al. ∙

research

∙ 01/26/2020

Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos

Unsupervised landmark learning is the task of learning semantic keypoint...

25 Aysegul Dundar, et al. ∙

research

∙ 09/06/2019

Video Interpolation and Prediction with Unsupervised Landmarks

Prediction and interpolation for long-range video data involves the comp...

1 Kevin J. Shih, et al. ∙

research

∙ 06/13/2019

Unsupervised Video Interpolation Using Cycle Consistency

Learning to synthesize high frame rate videos via interpolation requires...

1 Fitsum A. Reda, et al. ∙

research

∙ 03/07/2019

Graphical Contrastive Losses for Scene Graph Generation

Most scene graph generators use a two-stage pipeline to detect visual re...

10 Ji Zhang, et al. ∙

research

∙ 12/04/2018

Improving Semantic Segmentation via Video Propagation and Label Relaxation

Semantic segmentation requires large amounts of pixel-wise annotations t...

0 Yi Zhu, et al. ∙

research

∙ 11/28/2018

Partial Convolution based Padding

In this paper, we present a simple yet effective padding scheme that can...

6 Guilin Liu, et al. ∙

research

∙ 11/17/2018

Open-vocabulary Phrase Detection

Most existing work that grounds natural language phrases in images start...

0 Bryan A. Plummer, et al. ∙

research

∙ 11/02/2018

SDCNet: Video Prediction Using Spatially-Displaced Convolution

We present an approach for high-resolution video frame prediction by con...

6 Fitsum A. Reda, et al. ∙

research

∙ 04/20/2018

Image Inpainting for Irregular Holes Using Partial Convolutions

Existing deep learning based image inpainting methods use a standard con...

0 Guilin Liu, et al. ∙

research

∙ 12/10/2017

Learning Interpretable Spatial Operations in a Rich 3D Blocks World

In this paper, we study the problem of mapping natural language instruct...

0 Yonatan Bisk, et al. ∙

research

∙ 07/22/2015

Part Localization using Multi-Proposal Consensus for Fine-Grained Categorization

We present a simple deep learning framework to simultaneously predict ke...

0 Kevin J. Shih, et al. ∙

Kevin J. Shih

Featured Co-authors

Sign in with Google

Consider DeepAI Pro