Matt Feiszli

research

∙ 08/29/2023

NOVIS: A Case for End-to-End Near-Online Video Instance Segmentation

Until recently, the Video Instance Segmentation (VIS) community operated...

0 Tim Meinhardt, et al. ∙

research

∙ 04/12/2023

SiLK – Simple Learned Keypoints

Keypoint detection descriptors are foundational tech-nologies for co...

0 Pierre Gleize, et al. ∙

research

∙ 02/16/2023

MINOTAUR: Multi-task Video Grounding From Multimodal Queries

Video understanding tasks take many forms, from action detection to visu...

0 Raghav Goyal, et al. ∙

research

∙ 01/09/2023

EgoTracks: A Long-term Egocentric Visual Object Tracking Dataset

Visual object tracking is a key component to many egocentric vision prob...

0 Hao Tang, et al. ∙

research

∙ 04/12/2022

Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity

Open-world instance segmentation is the task of grouping pixels into obj...

2 Weiyao Wang, et al. ∙

research

∙ 04/01/2022

Generic Event Boundary Captioning: A Benchmark for Status Changes Understanding

Cognitive science has shown that humans perceive videos in terms of even...

0 Yuxuan Wang, et al. ∙

research

∙ 11/18/2021

PyTorchVideo: A Deep Learning Library for Video Understanding

We introduce PyTorchVideo, an open-source deep-learning library that pro...

295 Haoqi Fan, et al. ∙

research

∙ 08/30/2021

Searching for Two-Stream Models in Multivariate Space for Video Recognition

Conventional video models rely on a single stream to capture the complex...

0 Xinyu Gong, et al. ∙

research

∙ 04/10/2021

Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation

Current state-of-the-art object detection and segmentation methods work ...

0 Weiyao Wang, et al. ∙

research

∙ 01/26/2021

Generic Event Boundary Detection: A Benchmark for Event Segmentation

This paper presents a novel task together with a new benchmark for detec...

11 Mike Zheng Shou, et al. ∙

research

∙ 11/22/2020

FP-NAS: Fast Probabilistic Neural Architecture Search

Differential Neural Architecture Search (NAS) requires all layer choices...

4 Zhicheng Yan, et al. ∙

research

∙ 03/15/2020

SF-Net: Single-Frame Supervision for Temporal Action Localization

In this paper, we study an intermediate form of supervision, i.e., singl...

7 Fan Ma, et al. ∙

research

∙ 01/09/2020

Don't Judge an Object by Its Context: Learning to Overcome Contextual Bias

Existing models often leverage co-occurrences between objects and their ...

32 Krishna Kumar Singh, et al. ∙

research

∙ 07/19/2019

Only Time Can Tell: Discovering Temporal Data for Temporal Modeling

Understanding temporal information and how the visual world changes over...

1 Laura Sevilla-Lara, et al. ∙

research

∙ 06/10/2019

FASTER Recurrent Networks for Video Classification

Video classification methods often divide the video into short clips, do...

0 Linchao Zhu, et al. ∙

research

∙ 06/07/2019

Video Modeling with Correlation Networks

Motion is a salient cue to recognize actions in video. Modern action rec...

0 Heng Wang, et al. ∙

research

∙ 05/29/2019

What Makes Training Multi-Modal Networks Hard?

Consider end-to-end training of a multi-modal vs. a single-modal network...

0 Weiyao Wang, et al. ∙

research

∙ 05/02/2019

Large-scale weakly-supervised pre-training for video action recognition

Current fully-supervised video datasets consist of only a few hundred th...

0 Deepti Ghadiyaram, et al. ∙

research

∙ 04/04/2019

Video Classification with Channel-Separated Convolutional Networks

Group convolution has been shown to offer great computational savings in...

0 Du Tran, et al. ∙

research

∙ 05/25/2017

Latent Geometry and Memorization in Generative Models

It can be difficult to tell whether a trained generative model has learn...

0 Matt Feiszli, et al. ∙

Matt Feiszli

Featured Co-authors

Sign in with Google

Consider DeepAI Pro