We train a language model (LM) to robustly answer multistep questions by...
Adapters present a promising solution to the catastrophic forgetting pro...
The size and the computational load of fine-tuning large-scale pre-train...
Benchmarks for language-guided embodied agents typically assume text-bas...
We use insights from research on American Sign Language (ASL) phonology ...
Household robots operate in the same space for years. Such robots
increm...
Household environments are visually diverse. Embodied agents performing
...
For vision-and-language reasoning tasks, both fully connectionist, end-t...
Task planning can require defining myriad domain knowledge about the wor...
We propose the Vision-and-Augmented-Language Transformer (VAuLT). VAuLT ...
Aligning image and text encoders from scratch using contrastive learning...
Natural language is an intuitive way for humans to communicate tasks to ...
Current state-of-the-art vision-and-language models are evaluated on tas...
A long-term goal of AI research is to build intelligent agents that can
...
Learning-based methods for training embodied agents typically require a ...
Robots operating in human spaces must be able to engage in natural langu...
Language-guided robots performing home and office tasks must navigate in...
Seemingly simple natural language requests to a robot are generally
unde...
Autonomous robot systems for applications from search and rescue to assi...
Fluent communication requires understanding your audience. In the new
co...
Successful linguistic communication relies on a shared experience of the...
We present ALFRED (Action Learning From Realistic Environments and
Direc...
Robots navigating in human environments should use language to ask for
a...
Some robots can interact with humans using natural language, and identif...
We use static object data to improve success detection for stacking obje...
While many methods for interpreting machine learning models have been
pr...
High-level human instructions often correspond to behaviors with multipl...
Natural language understanding for robotics can require substantial doma...
Language-and-vision navigation and question answering (QA) are exciting ...
Efforts are underway at UT Austin to build autonomous robot systems that...
We consider the problem of estimating a regression function in the commo...