Recent image enhancement methods have shown the advantages of using a pa...
Denoising diffusion models have enabled high-quality image generation an...
Instructional videos are an important resource to learn procedural tasks...
Novel view synthesis from a single input image is a challenging task, wh...
The popularity of Neural Radiance Fields (NeRFs) for view synthesis has ...
Neural Radiance Fields (NeRFs) have emerged as a popular approach for no...
There is limited understanding of the information captured by deep
spati...
Data augmentation is a key element for training accurate models by reduc...
Deep spatiotemporal models are used in a variety of computer vision task...
In this paper, we study the problem of procedure planning in instruction...
Probabilistic embeddings have proven useful for capturing polysemous wor...
This paper presents an approach to estimating the continuous 6-DoF pose ...
Few-shot video object segmentation (FS-VOS) aims at segmenting video fra...
Existing weakly or semi-supervised semantic segmentation methods utilize...
In this work, we consider the problem of sequence-to-sequence alignment ...
In this paper, we present a strategy for training convolutional neural
n...
In this paper, we challenge the common assumption that collapsing the sp...
We introduce a weakly supervised method for representation learning base...
Video understanding calls for a model to learn the characteristic interp...
In contrast to fully connected networks, Convolutional Neural Networks (...
Contrasting the previous evidence that neurons in the later layers of a
...
For humans, visual understanding is inherently generative: given a 3D sh...
Normalizing flows are a class of probabilistic generative models which a...
In this paper, we present a strategy for training convolutional neural
n...
Capturing photographs with wrong exposures remains a major source of err...
Event cameras are vision sensors that record asynchronous streams of
per...
We propose a new architecture that learns to attend to different
Convolu...
Recently, much progress has been made building systems that can capture
...
An intelligent observer looks at the world and sees not only what is, bu...
We introduce a two-stream model for dynamic texture synthesis. Our model...
This paper presents a novel approach to estimating the continuous six de...
To improve software quality, one needs to build test scenarios resemblin...
This paper addresses the challenge of 3D human pose estimation from a si...
Recently, convolutional networks (convnets) have proven useful for predi...
This paper proposes a new deep convolutional neural network (DCNN)
archi...
This paper presents a new state-of-the-art for document image classifica...