Understanding which concepts models can and cannot represent has been
fu...
We present the Pathways Autoregressive Text-to-Image (Parti) model, whic...
Pretraining language models with next-token prediction on massive text
c...
PanGEA, the Panoramic Graph Environment Annotation toolkit, is a lightwe...
Vision-and-Language Navigation wayfinding agents can be enhanced by
expl...
We introduce Room-Across-Room (RxR), a new Vision-and-Language Navigatio...
Vision-and-Language Navigation (VLN) tasks such as Room-to-Room (R2R) re...
In instruction conditioned navigation, agents interpret natural language...
Image generation has been successfully cast as an autoregressive sequenc...
Image generation has been successfully cast as an autoregressive sequenc...