This paper introduces MVDiffusion, a simple yet effective multi-view ima...
This paper presents PolyDiffuse, a novel structured reconstruction algor...
Learning effective motion features is an essential pursuit of video
repr...
This paper presents a novel attention-based neural network for structure...
Few-shot semantic segmentation aims to segment novel-class objects in a ...
We propose an approach to generate images of people given a desired
appe...
Few-shot semantic segmentation aims to segment novel-class objects in a ...
Visual Semantic Embedding (VSE) is a dominant approach for vision-langua...
Learning to follow instructions is of fundamental importance to autonomo...
This paper proposes a new approach for automated floorplan reconstructio...
We propose an approach for forecasting video of complex human activity
i...