The field of text-to-image (T2I) generation has garnered significant
att...
Recent advances in text-to-image synthesis make it possible to visualize...
Language planning aims to implement complex high-level goals by decompos...
Human brains integrate linguistic and perceptual information simultaneou...
Automatic evaluations for natural language generation (NLG) conventional...
Vision-and-language navigation (VLN) is a multimodal task where an agent...
Iterative Language-Based Image Editing (IL-BIE) tasks follow iterative
i...
Vision-and-Language Navigation (VLN) is a task where agents must decide ...
Given the recent successes of deep learning applied to style transfer an...