Anyi Rao

research

∙ 08/28/2023

Automated Conversion of Music Videos into Lyric Videos

Musicians and fans often produce lyric videos, a form of music videos th...

0 Jiaju Ma, et al. ∙

research

∙ 08/07/2023

Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization

Zero-shot skeleton-based action recognition aims to recognize actions of...

0 Yujie Zhou, et al. ∙

research

∙ 07/10/2023

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

With the advance of text-to-image models (e.g., Stable Diffusion) and co...

0 Yuwei Guo, et al. ∙

research

∙ 06/05/2023

HireVAE: An Online and Adaptive Factor Model Based on Hierarchical and Regime-Switch VAE

Factor model is a fundamental investment tool in quantitative investment...

12 Zikai Wei, et al. ∙

research

∙ 05/27/2023

CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers

Vision-language models have achieved tremendous progress far beyond what...

0 Dachuan Shi, et al. ∙

research

∙ 02/17/2023

Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences

Self-supervised learning has demonstrated remarkable capability in repre...

0 Yujie Zhou, et al. ∙

research

∙ 01/30/2023

Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video Production

Amateurs working on mini-films and short-form videos usually spend lots ...

1 Anyi Rao, et al. ∙

research

∙ 10/17/2022

Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows

The ability to choose an appropriate camera view among multiple cameras ...

4 Anyi Rao, et al. ∙

research

∙ 03/13/2022

AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation

Training a generalizable 3D part segmentation network is quite challengi...

0 Xueyi Liu, et al. ∙

research

∙ 12/10/2021

CityNeRF: Building NeRF at City Scale

Neural Radiance Field (NeRF) has achieved outstanding performance in mod...

27 Yuanbo Xiangli, et al. ∙

research

∙ 08/08/2020

A Unified Framework for Shot Type Classification Based on Subject Centric Lens

Shots are key narrative elements of various videos, e.g. movies, TV seri...

2 Anyi Rao, et al. ∙

research

∙ 08/08/2020

Online Multi-modal Person Search in Videos

The task of searching certain people in videos has seen increasing poten...

3 Jiangyue Xia, et al. ∙

research

∙ 07/21/2020

MovieNet: A Holistic Dataset for Movie Understanding

Recent years have seen remarkable advances in visual understanding. Howe...

6 Qingqiu Huang, et al. ∙

research

∙ 04/06/2020

A Local-to-Global Approach to Multi-modal Movie Scene Segmentation

Scene, as the crucial unit of storytelling in movies, contains complex a...

3 Anyi Rao, et al. ∙

research

∙ 03/24/2018

Automatic Music Accompanist

Automatic musical accompaniment is where a human musician is accompanied...

0 Anyi Rao, et al. ∙

research

∙ 12/19/2017

HotFlip: White-Box Adversarial Examples for NLP

Adversarial examples expose vulnerabilities of machine learning models. ...

0 Javid Ebrahimi, et al. ∙

Anyi Rao

Featured Co-authors

Sign in with Google

Consider DeepAI Pro