Recent progress in using machine learning models for reasoning tasks has...
We present BlenderBot 3x, an update on the conversational model BlenderB...
In recent years, large pre-trained language models (LLMs) have demonstra...
Large language models have been shown to struggle with limited context m...
The success of transformer models trained with a language modeling objec...
Video understanding tasks take many forms, from action detection to visu...
Developing agents that can execute multiple skills by learning from
pre-...
Standard language model training employs gold human documents or human-h...
Learning a diverse set of skills by interacting with an environment with...
Current language models achieve low perplexity but their resulting
gener...
In reinforcement learning, the graph Laplacian has proved to be a valuab...
We investigate the training of sparse layers that use different paramete...
Attention mechanisms have become a standard tool for sequence modeling t...
Attention mechanisms have shown promising results in sequence modeling t...
In this work, we address the problem of image-goal navigation in the con...
Learning to navigate in a realistic setting where an agent must rely sol...
Transformers are feedforward networks that can process input tokens in
p...
Transformer networks have lead to important progress in language modelin...
We propose a novel self-attention mechanism that can learn its optimal
a...
Learning when to communicate and doing that effectively is essential in
...
In hierarchical reinforcement learning a major challenge is determining
...
A desirable property of an intelligent agent is its ability to understan...
The tasks that an agent will need to solve often are not known during
tr...
We describe a very simple bag-of-words baseline for visual question
answ...
This paper introduces MazeBase: an environment for simple 2D games, desi...
We introduce a neural network with a recurrent attention model over a
po...
The availability of large labeled datasets has allowed Convolutional Net...
Learning invariant representations from images is one of the hardest
cha...