Ali Furkan Biten

research

∙ 09/21/2022

Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia

Humans exploit prior knowledge to describe images, and are able to adapt...

0 Khanh Nguyen, et al. ∙

research

∙ 09/14/2022

MUST-VQA: MUltilingual Scene-text VQA

In this paper, we present a framework for Multilingual Scene Text Visual...

0 Emanuele Vivoli, et al. ∙

research

∙ 03/09/2022

Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement

In this work, we propose Text-Degradation Invariant Auto Encoder (Text-D...

0 Mohamed Ali Souibgui, et al. ∙

research

∙ 02/25/2022

OCR-IDL: OCR Annotations for Industry Document Library Dataset

Pretraining has proven successful in Document Intelligence tasks where d...

0 Ali Furkan Biten, et al. ∙

research

∙ 12/23/2021

LaTr: Layout-Aware Transformer for Scene-Text VQA

We propose a novel multimodal architecture for Scene Text Visual Questio...

5 Ali Furkan Biten, et al. ∙

research

∙ 10/06/2021

Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

The task of image-text matching aims to map representations from differe...

0 Ali Furkan Biten, et al. ∙

research

∙ 10/04/2021

Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning

Explaining an image with missing or non-existent objects is known as obj...

0 Ali Furkan Biten, et al. ∙

research

∙ 09/24/2021

Localizing Infinity-shaped fishes: Sketch-guided object localization in the wild

This work investigates the problem of sketch-guided object localization ...

0 Pau Riba, et al. ∙

research

∙ 05/11/2021

One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition

Low resource Handwritten Text Recognition (HTR) is a hard problem due to...

0 Mohamed Ali Souibgui, et al. ∙

research

∙ 09/21/2020

Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval

Scene text instances found in natural images carry explicit semantic inf...

12 Andrés Mafla, et al. ∙

research

∙ 06/01/2020

Multimodal grid features and cell pointers for Scene Text Visual Question Answering

This paper presents a new model for the task of scene text visual questi...

7 Lluis Gómez, et al. ∙

research

∙ 01/14/2020

Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features

Text contained in an image carries high-level semantics that can be expl...

6 Andrés Mafla, et al. ∙

research

∙ 06/30/2019

ICDAR 2019 Competition on Scene Text Visual Question Answering

This paper presents final results of ICDAR 2019 Scene Text Visual Questi...

0 Ali Furkan Biten, et al. ∙

research

∙ 06/04/2019

Selective Style Transfer for Text

This paper explores the possibilities of image style transfer applied to...

0 Raul Gomez, et al. ∙

research

∙ 05/31/2019

Scene Text Visual Question Answering

Current visual question answering datasets do not consider the rich sema...

0 Ali Furkan Biten, et al. ∙

research

∙ 04/02/2019

Good News, Everyone! Context driven entity-aware captioning for news images

Current image captioning systems perform at a merely descriptive level, ...

0 Ali Furkan Biten, et al. ∙

Ali Furkan Biten

Featured Co-authors

Sign in with Google

Consider DeepAI Pro