Tejas Srinivasan

research

∙ 04/04/2023

I2I: Initializing Adapters with Improvised Knowledge

Adapters present a promising solution to the catastrophic forgetting pro...

5 Tejas Srinivasan, et al. ∙

research

∙ 02/27/2023

Multimodal Speech Recognition for Language-Guided Embodied Agents

Benchmarks for language-guided embodied agents typically assume text-bas...

0 Allen Chang, et al. ∙

research

∙ 08/18/2022

VAuLT: Augmenting the Vision-and-Language Transformer with the Propagation of Deep Language Representations

We propose the Vision-and-Augmented-Language Transformer (VAuLT). VAuLT ...

30 Georgios Chochlakis, et al. ∙

research

∙ 07/29/2022

Curriculum Learning for Data-Efficient Vision-Language Alignment

Aligning image and text encoders from scratch using contrastive learning...

10 Tejas Srinivasan, et al. ∙

research

∙ 06/18/2022

CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks

Current state-of-the-art vision-and-language models are evaluated on tas...

7 Tejas Srinivasan, et al. ∙

research

∙ 04/18/2021

Worst of Both Worlds: Biases Compound in Pre-trained Vision-and-Language Models

Numerous works have analyzed biases in vision and pre-trained language m...

13 Tejas Srinivasan, et al. ∙

research

∙ 11/02/2020

Reasoning Over History: Context Aware Visual Dialog

While neural models have been shown to exhibit strong performance on sin...

1 Muhammad A. Shah, et al. ∙

research

∙ 10/16/2020

Multimodal Speech Recognition with Unstructured Audio Masking

Visual context has been shown to be useful for automatic speech recognit...

16 Tejas Srinivasan, et al. ∙

research

∙ 10/05/2020

Fine-Grained Grounding for Multimodal Speech Recognition

Multimodal automatic speech recognition systems integrate information fr...

0 Tejas Srinivasan, et al. ∙

research

∙ 02/13/2020

Looking Enhances Listening: Recovering Missing Speech Using Images

Speech is understood better by using visual context; for this reason, th...

0 Tejas Srinivasan, et al. ∙

research

∙ 10/27/2019

Multitask Learning For Different Subword Segmentations In Neural Machine Translation

In Neural Machine Translation (NMT) the usage of subwords and characters...

0 Tejas Srinivasan, et al. ∙

research

∙ 07/23/2019

Structured Fusion Networks for Dialog

Neural dialog models have exhibited strong performance, however their en...

0 Shikib Mehri, et al. ∙

research

∙ 06/30/2019

Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions

Multimodal learning allows us to leverage information from multiple sour...

0 Tejas Srinivasan, et al. ∙

Tejas Srinivasan

Featured Co-authors

Sign in with Google

Consider DeepAI Pro