Srikar Appalaraju

research

∙ 06/02/2023

DocFormerv2: Local Features for Document Understanding

We propose DocFormerv2, a multi-modal transformer for Visual Document Un...

0 Srikar Appalaraju, et al. ∙

research

∙ 11/15/2022

YORO – Lightweight End to End Visual Grounding

We present YORO - a multi-modal transformer encoder-only architecture fo...

0 Chih-Hui Ho, et al. ∙

research

∙ 06/16/2022

MixGen: A New Multi-Modal Data Augmentation

Data augmentation is a necessity to enhance data efficiency in deep lear...

36 Xiaoshuai Hao, et al. ∙

research

∙ 03/30/2022

Towards Differential Relational Privacy and its use in Question Answering

Memorization of the relation between entities in a dataset can lead to p...

0 Simone Bombari, et al. ∙

research

∙ 12/23/2021

LaTr: Layout-Aware Transformer for Scene-Text VQA

We propose a novel multimodal architecture for Scene Text Visual Questio...

5 Ali Furkan Biten, et al. ∙

research

∙ 06/22/2021

DocFormer: End-to-End Transformer for Document Understanding

We present DocFormer – a multi-modal transformer based architecture for ...

1 Srikar Appalaraju, et al. ∙

research

∙ 12/01/2020

Towards Good Practices in Self-supervised Representation Learning

Self-supervised representation learning has seen remarkable progress in ...

8 Srikar Appalaraju, et al. ∙

research

∙ 02/12/2020

Hierarchical Auto-Regressive Model for Image Compression Incorporating Object Saliency and a Deep Perceptual Loss

We propose a new end-to-end trainable model for lossy image compression ...

0 Yash Patel, et al. ∙

research

∙ 11/28/2019

Unbiased Evaluation of Deep Metric Learning Algorithms

Deep metric learning (DML) is a popular approach for images retrieval, s...

0 Istvan Fehervari, et al. ∙

research

∙ 08/09/2019

Human Perceptual Evaluations for Image Compression

Recently, there has been much interest in deep learning techniques to do...

0 Yash Patel, et al. ∙

research

∙ 07/18/2019

Deep Perceptual Compression

Several deep learned lossy compression techniques have been proposed in ...

5 Yash Patel, et al. ∙

research

∙ 11/19/2018

Scalable Logo Recognition using Proxies

Logo recognition is the task of identifying and classifying logos. Logo ...

16 Istvan Fehervari, et al. ∙

research

∙ 09/26/2017

Image similarity using Deep CNN and Curriculum Learning

Image similarity involves fetching similar looking images given a refere...

0 Srikar Appalaraju, et al. ∙

Srikar Appalaraju

Featured Co-authors

Sign in with Google

Consider DeepAI Pro