We introduce Text2Cinemagraph, a fully automated method for creating
cin...
Existing 3D-from-2D generators are typically designed for well-curated
s...
We propose a novel approach for unsupervised 3D animation of non-rigid
d...
Modern 3D-GANs synthesize geometry and texture by training on large-scal...
Existing 3D-aware image synthesis approaches mainly focus on generating ...
The two popular datasets ScanRefer [16] and ReferIt3D [3] connect natura...
In this work, we present a novel framework built to simplify 3D asset
ge...
There has been a recent explosion of impressive generative models that c...
While recent large-scale video-language pre-training made great progress...
Point cloud registration is a crucial problem in computer vision and
rob...
To achieve accurate 3D object detection at a low cost for autonomous dri...
Robotic peg-in-hole assembly remains a challenging task due to its high
...
Generating images from hand-drawings is a crucial and fundamental task i...
Current image-to-image translation methods formulate the task with
condi...
Creating and editing the shape and color of 3D objects require tremendou...
Most methods for conditional video synthesis use a single modality as th...
In the field of domain adaptation, a trade-off exists between the model
...
Recently, StyleGAN has enabled various image manipulation and editing ta...
Despite the success of deep learning on supervised point cloud semantic
...
We tackle a new problem of semantic view synthesis – generating
free-vie...
In recent years, few-shot learning problems have received a lot of atten...
We study the XAI (explainable AI) on the face recognition task, particul...
Few-shot classification aims to recognize novel categories with only few...
Graphic design is essential for visual communication with layouts being
...
Dancing to music is an instinctive move by humans. Learning to model the...
Spatial audio is an essential medium to audiences for 3D visual and audi...
Image-to-image translation aims to learn the mapping between two visual
...
Most conditional generation tasks expect diverse outputs given a single
...
Image-to-image translation aims to learn the mapping between two visual
...
We present an unsupervised representation learning approach using videos...
Numerous embedding models have been recently explored to incorporate sem...