Paul Voigtlaender
HW Engineer at Google
Generating video stories from text prompts is a complex task. In additio...
We propose Video Localized Narratives, a new form of multimodal video
an...
Multiple existing benchmarks involve tracking and segmenting objects in ...
In this paper, we tackle video panoptic segmentation, a task that requir...
For further progress in video object segmentation (VOS), larger, more
di...
We present Siam R-CNN, a Siamese re-detection architecture which unleash...
We approach video object segmentation (VOS) by splitting the task into t...
This paper addresses the problem of object discovery from unlabeled driv...
Many of the recent successful methods for video object segmentation (VOS...
This paper extends the popular task of multi-object tracking to multi-ob...
Many high-level video understanding methods require input in the form of...
We propose to leverage a generic object tracker in order to perform obje...
We address semi-supervised video object segmentation, the task of
automa...
Deep learning requires large amounts of training data to be effective. F...
We explore object discovery and detector adaptation based on unlabeled v...
The most common paradigm for vision-based multi-object tracking is
track...
We tackle the task of semi-supervised video object segmentation, i.e.
se...
In this work we release our extensible and easily configurable neural ne...
We present a comprehensive study of deep bidirectional long short-term m...