Existing text-to-image diffusion models struggle to synthesize realistic...
Portrait stylization, which translates a real human face image into an
a...
Token-based masked generative models are gaining popularity for their fa...
Text-to-image generation and image captioning are recently emerged as a ...
Unsupervised fine-grained class clustering is practical yet challenging ...
We propose a deep video prediction model conditioned on a single image a...
This paper addresses the problem of manipulating images using natural
la...