We present a simple yet effective self-supervised pre-training method fo...
Effective modeling of complex spatiotemporal dependencies in long-form v...
To help customers make better-informed viewing choices, video-streaming
...
The learning objective of vision-language approach of CLIP does not
effe...
Existing approaches for Structure from Motion (SfM) produce impressive 3...
Automatic understanding of movie-scenes is an important problem with mul...
Scenes play a crucial role in breaking the storyline of movies and TV
ep...
We identify a novel instance of the background subtraction problem that
...
Kernel approximation using randomized feature maps has recently gained a...