Generating 3D faces from textual descriptions has a multitude of
applica...
Cross-modal garment synthesis and manipulation will significantly benefi...
Recently, large-scale diffusion models, e.g., Stable diffusion and DallE...
Recent advances in text-to-image diffusion models have achieved remarkab...
Existing text-guided image manipulation methods aim to modify the appear...
Existing text-guided image manipulation methods aim to modify the appear...
This paper presents a large-scale Chinese cross-modal dataset for
benchm...
Unsupervised large-scale vision-language pre-training has shown promisin...
Wasserstein GANs (WGANs), built upon the Kantorovich-Rubinstein (KR) dua...
Recent advances in large-scale optimal transport have greatly extended i...
CycleGAN is capable of learning a one-to-one mapping between two data
di...
Adam is shown not being able to converge to the optimal solution in cert...
Human body part parsing, or human semantic part segmentation, is fundame...