Weining Wang

research

∙ 04/17/2023

VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

In this paper, we propose a Vision-Audio-Language Omni-peRception pretra...

0 Sihan Chen, et al. ∙

research

∙ 03/29/2023

Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation

As a combination of visual and audio signals, video is inherently multi-...

0 Jiawei Liu, et al. ∙

research

∙ 03/07/2023

MOSO: Decomposing MOtion, Scene and Object for Video Prediction

Motion, scene and object are three primary visual components of a video....

0 Mingzhen Sun, et al. ∙

research

∙ 08/27/2022

ℓ^2 Inference for Change Points in High-Dimensional Time Series via a Two-Way MOSUM

We propose a new inference method for multiple change-point detection in...

0 Jiaqi Li, et al. ∙

research

∙ 03/24/2022

Learning Disentangled Representation for One-shot Progressive Face Swapping

Although face swapping has attracted much attention in recent years, it ...

5 Qi Li, et al. ∙

research

∙ 01/27/2022

A projection based approach for interactive fixed effects panel data models

This paper presents a new approach to estimation and inference in panel ...

0 Georg Keilbar, et al. ∙

research

∙ 09/06/2021

Exploiting Spatial-Temporal Semantic Consistency for Video Scene Parsing

Compared with image scene parsing, video scene parsing introduces tempor...

0 Xingjian He, et al. ∙

research

∙ 07/01/2021

OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation

In this paper, we propose an Omni-perception Pre-Trainer (OPT) for cross...

0 Jing Liu, et al. ∙

research

∙ 06/29/2021

Face Sketch Synthesis via Semantic-Driven Generative Adversarial Network

Face sketch synthesis has made significant progress with the development...

4 Xingqun Qi, et al. ∙

research

∙ 05/16/2021

Learning Financial Network with Focally Sparse Structure

This paper studies the estimation of network connectedness with focally ...

0 Victor Chernozhukov, et al. ∙

research

∙ 02/17/2021

Temporal Memory Attention for Video Semantic Segmentation

Video semantic segmentation requires to utilize the complex temporal rel...

17 Hao Wang, et al. ∙

research

∙ 12/16/2020

AutoCaption: Image Captioning with Neural Architecture Search

Image captioning transforms complex visual information into abstract nat...

0 Xinxin Zhu, et al. ∙

research

∙ 05/20/2019

Non-Parametric Estimation of Spot Covariance Matrix with High-Frequency Data

Estimating spot covariance is an important issue to study, especially wi...

0 Konul Mustafayeva, et al. ∙

research

∙ 06/13/2018

LASSO-Driven Inference in Time and Space

We consider the estimation and inference in a system of high-dimensional...

0 Victor Chernozhukov, et al. ∙

Weining Wang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro