Shruti Palaskar

research

∙ 05/24/2022

On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization

Integrating vision and language has gained notable attention following t...

0 Shruti Palaskar, et al. ∙

research

∙ 10/12/2021

Speech Summarization using Restricted Self-Attention

Speech summarization is typically performed by using a cascade of speech...

0 Roshan Sharma, et al. ∙

research

∙ 08/18/2020

How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language

Sign Language is the primary means of communication for the majority of ...

20 Amanda Duarte, et al. ∙

research

∙ 03/13/2020

ASR Error Correction and Domain Adaptation Using Machine Translation

Off-the-shelf pre-trained Automatic Speech Recognition (ASR) systems are...

7 Anirudh Mani, et al. ∙

research

∙ 06/19/2019

Multimodal Abstractive Summarization for How2 Videos

In this paper, we study abstractive summarization for open-domain videos...

0 Shruti Palaskar, et al. ∙

research

∙ 02/18/2019

Learned In Speech Recognition: Contextual Acoustic Word Embeddings

End-to-end acoustic-to-word speech recognition models have recently gain...

0 Shruti Palaskar, et al. ∙

research

∙ 11/21/2018

Learning from Multiview Correlations in Open-Domain Videos

An increasing number of datasets contain multiple views, such as video, ...

0 Nils Holzenberger, et al. ∙

research

∙ 11/09/2018

Multimodal Grounding for Sequence-to-Sequence Speech Recognition

Humans are capable of processing speech by making use of multiple sensor...

0 Ozan Caglayan, et al. ∙

research

∙ 11/01/2018

How2: A Large-scale Dataset for Multimodal Language Understanding

In this paper, we introduce How2, a multimodal collection of instruction...

0 Ramon Sanabria, et al. ∙

research

∙ 07/23/2018

Acoustic-to-Word Recognition with Sequence-to-Sequence Models

Acoustic-to-Word recognition provides a straightforward solution to end-...

0 Shruti Palaskar, et al. ∙

research

∙ 04/25/2018

End-to-End Multimodal Speech Recognition

Transcription or sub-titling of open-domain videos is still a challengin...

0 Shruti Palaskar, et al. ∙

research

∙ 02/14/2018

Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop

We summarize the accomplishments of a multi-disciplinary workshop explor...

0 Odette Scharenborg, et al. ∙

research

∙ 09/08/2017

Combining LSTM and Latent Topic Modeling for Mortality Prediction

There is a great need for technologies that can predict the mortality of...

0 Yohan Jo, et al. ∙

Shruti Palaskar

Featured Co-authors

Sign in with Google

Consider DeepAI Pro