The development of recommender systems that optimize multi-turn interact...
Vision-and-Language Navigation wayfinding agents can be enhanced by
expl...
Identifying a short segment in a long video that semantically matches a ...
Summarization is the task of compressing source document(s) into coheren...
We introduce Room-Across-Room (RxR), a new Vision-and-Language Navigatio...
Learning to fuse vision and language information and representing them i...
We present a multi-level geocoding model (MLG) that learns to associate ...
Uncertainty quantification is an important research area in machine lear...
Learning to follow instructions is of fundamental importance to autonomo...
Recent research efforts enable study for natural language grounded navig...
The Touchdown dataset (Chen et al., 2019) provides instructions by human...
VALAN is a lightweight and scalable software framework for deep reinforc...
We show that it is feasible to perform entity linking by training a dual...
We propose RecSim, a configurable platform for authoring simulation
envi...
Vision-and-Language Navigation (VLN) tasks such as Room-to-Room (R2R) re...
In instruction conditioned navigation, agents interpret natural language...
Vision-and-Language Navigation (VLN) is a natural language grounding tas...
Most practical recommender systems focus on estimating immediate user
en...
Advances in learning and representations have reinvigorated work that
co...
Object recognition and localization are important tasks in computer visi...