Humans exploit prior knowledge to describe images, and are able to adapt...
In this paper, we present a framework for Multilingual Scene Text Visual...
In this work, we propose Text-Degradation Invariant Auto Encoder (Text-D...
Pretraining has proven successful in Document Intelligence tasks where d...
We propose a novel multimodal architecture for Scene Text Visual Questio...
The task of image-text matching aims to map representations from differe...
Explaining an image with missing or non-existent objects is known as obj...
This work investigates the problem of sketch-guided object localization
...
Low resource Handwritten Text Recognition (HTR) is a hard problem due to...
Scene text instances found in natural images carry explicit semantic
inf...
This paper presents a new model for the task of scene text visual questi...
Text contained in an image carries high-level semantics that can be expl...
This paper presents final results of ICDAR 2019 Scene Text Visual Questi...
This paper explores the possibilities of image style transfer applied to...
Current visual question answering datasets do not consider the rich sema...
Current image captioning systems perform at a merely descriptive level,
...