While recent advancements in vision-language models have revolutionized
...
Recent text-to-image generative models can generate high-fidelity images...
Recent years have witnessed a rapid growth of deep generative models, wi...
Open-vocabulary detection (OVD) is an object detection task aiming at
de...
Biological intelligence systems of animals perceive the world by integra...
The abundance and richness of Internet photos of landmarks and cities ha...