There is no settled universal 3D representation for geometry with many
a...
A diffusion model learns to predict a vector field of gradients. We prop...
We propose Fast text2StyleGAN, a natural language interface that adapts
...
Modern 3D computer vision leverages learning to boost geometric reasonin...
Existing work on sign language translation–that is, translation from sig...
Supervised or weakly supervised methods for phrase localization (textual...
Natural language processing for sign language video - including tasks li...
We propose Neural Neighbor Style Transfer (NNST), a pipeline that offers...
Camera calibration is integral to robotics and computer vision algorithm...
Fingerspelling, in which words are signed letter by letter, is an import...
Self-supervised monocular depth and ego-motion estimation is a promising...
We study image segmentation from an information-theoretic perspective,
p...
Self-supervised learning has emerged as a powerful tool for depth and
eg...
We develop and evaluate captioning models that allow control of caption
...
The core of our approach, Pixel Consensus Voting, is a framework for ins...
This paper presents a framework for the analysis of changes in visual
st...
Sign language recognition is a challenging gesture sequence recognition
...
Style transfer algorithms strive to render the content of one image usin...
We proposed a novel architecture for the problem of video super-resoluti...
We address the problem of American Sign Language fingerspelling recognit...
We consider how image super resolution (SR) can contribute to an object
...
The feed-forward architectures of recently proposed deep super-resolutio...
As an agent moves through the world, the apparent motion of scene elemen...