Frank Keller

research

∙ 08/11/2023

Dynamic Planning with a LLM

While Large Language Models (LLMs) can solve many NLP tasks in zero-shot...

0 Gautier Dagan, et al. ∙

research

∙ 05/24/2023

Meta-Learning For Vision-and-Language Cross-lingual Transfer

Current pre-trained vison-language models (PVLMs) achieve excellent perf...

0 Hanxu Hu, et al. ∙

research

∙ 03/30/2023

Detecting and Grounding Important Characters in Visual Stories

Characters are essential to the plot of any story. Establishing the char...

0 Danyang Liu, et al. ∙

research

∙ 03/27/2023

Learning Action Changes by Measuring Verb-Adverb Textual Relationships

The goal of this work is to understand the way actions are performed in ...

0 Davide Moltisanti, et al. ∙

research

∙ 01/27/2023

Learning the Effects of Physical Actions in a Multi-modal Environment

Large Language Models (LLMs) handle physical commonsense information ina...

10 Gautier Dagan, et al. ∙

research

∙ 11/26/2022

Who are you referring to? Weakly supervised coreference resolution with multimodal grounding

Coreference resolution aims at identifying words and phrases which refer...

0 Arushi Goel, et al. ∙

research

∙ 09/30/2022

A Closer Look at Temporal Ordering in the Segmentation of Instructional Videos

Understanding the steps required to perform a task is an important skill...

0 Anil Batra, et al. ∙

research

∙ 06/09/2022

Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition

We address the problem of data augmentation for video action recognition...

0 Shreyank N Gowda, et al. ∙

research

∙ 11/26/2021

Not All Relations are Equal: Mining Informative Labels for Scene Graph Generation

Scene graph generation (SGG) aims to capture a wide variety of interacti...

0 Arushi Goel, et al. ∙

research

∙ 11/16/2021

Film Trailer Generation via Task Decomposition

Movie trailers perform multiple functions: they introduce viewers to the...

0 Pinelopi Papalampidi, et al. ∙

research

∙ 09/14/2021

A Temporal Variational Model for Story Generation

Recent language models can generate interesting and grammatically correc...

0 David Wilmot, et al. ∙

research

∙ 09/08/2021

Memory and Knowledge Augmented Language Models for Inferring Salience in Long-Form Stories

Measuring event salience is essential in the understanding of stories. T...

0 David Wilmot, et al. ∙

research

∙ 07/27/2021

A New Split for Evaluating True Zero-Shot Action Recognition

Zero-shot action recognition is the task of classifying action categorie...

0 Shreyank N Gowda, et al. ∙

research

∙ 01/18/2021

CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition

Zero-shot action recognition is the task of recognizing action classes w...

0 Shreyank N Gowda, et al. ∙

research

∙ 12/14/2020

Movie Summarization via Sparse Graph Construction

We summarize full-length movies by creating shorter videos containing th...

0 Pinelopi Papalampidi, et al. ∙

research

∙ 10/19/2020

Heads-up! Unsupervised Constituency Parsing via Self-Attention Heads

Transformer-based pre-trained language models (PLMs) have dramatically i...

7 Bowen Li, et al. ∙

research

∙ 04/30/2020

Modelling Suspense in Short Stories as Uncertainty Reduction over Neural Representation

Suspense is a crucial ingredient of narrative fiction, engaging readers ...

0 David Wilmot, et al. ∙

research

∙ 04/27/2020

Screenplay Summarization Using Latent Narrative Structure

Most general-purpose extractive summarization models are trained on news...

0 Pinelopi Papalampidi, et al. ∙

research

∙ 08/27/2019

Movie Plot Analysis via Turning Point Identification

According to screenwriting theory, turning points (e.g., change of plans...

0 Pinelopi Papalampidi, et al. ∙

research

∙ 06/05/2019

An Imitation Learning Approach to Unsupervised Parsing

Recently, there has been an increasing interest in unsupervised parsers ...

0 Bowen Li, et al. ∙

research

∙ 04/10/2019

Cross-lingual Visual Verb Sense Disambiguation

Recent work has shown that visual context improves cross-lingual sense d...

0 Spandana Gella, et al. ∙

research

∙ 02/02/2019

Character-based Surprisal as a Model of Human Reading in the Presence of Errors

Intuitively, human readers cope easily with errors in text; typos, missp...

0 Michael Hahn, et al. ∙

research

∙ 11/14/2018

Dependency Grammar Induction with a Neural Variational Transition-based Parser

Dependency grammar induction is the task of learning dependency syntax w...

0 Bowen Li, et al. ∙

research

∙ 07/31/2018

Modeling Task Effects in Human Reading with Neural Attention

Humans read by making a sequence of fixations and saccades. They often s...

0 Michael Hahn, et al. ∙

research

∙ 08/09/2017

Extreme clicking for efficient object annotation

Manually annotating object bounding boxes is central to building compute...

0 Dim P. Papadopoulos, et al. ∙

research

∙ 07/24/2017

Image Pivoting for Learning Multilingual Multimodal Representations

In this paper we propose a model to learn multimodal multilingual repres...

0 Spandana Gella, et al. ∙

research

∙ 04/24/2017

An Analysis of Action Recognition Datasets for Language and Vision Tasks

A large amount of recent research has focused on tasks that combine lang...

0 Spandana Gella, et al. ∙

research

∙ 08/19/2016

Modeling Human Reading with Neural Attention

When humans read text, they fixate some words and skip others. However, ...

0 Michael Hahn, et al. ∙

research

∙ 03/30/2016

Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings

We introduce a new task, visual sense disambiguation for verbs: given an...

0 Spandana Gella, et al. ∙

research

∙ 02/26/2016

We don't need no bounding-boxes: Training object class detectors using only human verification

Training object class detectors typically requires a large set of images...

0 Dim P. Papadopoulos, et al. ∙

research

∙ 01/15/2016

Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures

Automatic description generation from natural images is a challenging pr...

0 Raffaella Bernardi, et al. ∙

Frank Keller

Featured Co-authors

Sign in with Google

Consider DeepAI Pro