We study the problem of learning a range of vision-based manipulation ta...
Conventional approaches to vision-and-language navigation (VLN) are trai...
Object navigation is defined as navigating to an object of a given label...
Inspired by research in psychology, we introduce a behavioral approach f...
We propose an end-to-end deep learning model for translating free-form
n...
We present a method for generating colored 3D shapes from natural langua...
Human actions captured in video sequences are three-dimensional signals
...
Inspired by the recent success of methods that employ shape priors to ac...