Sainbayar Sukhbaatar

research

∙ 09/14/2023

A Data Source for Reasoning Embodied Agents

Recent progress in using machine learning models for reasoning tasks has...

0 Jack Lanchantin, et al. ∙

research

∙ 06/07/2023

Improving Open Language Models by Learning from Organic Interactions

We present BlenderBot 3x, an update on the conversational model BlenderB...

0 Jing Xu, et al. ∙

research

∙ 05/09/2023

Large Language Model Programs

In recent years, large pre-trained language models (LLMs) have demonstra...

0 Imanol Schlag, et al. ∙

research

∙ 05/01/2023

Learning to Reason and Memorize with Self-Notes

Large language models have been shown to struggle with limited context m...

4 Jack Lanchantin, et al. ∙

research

∙ 04/18/2023

Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions

The success of transformer models trained with a language modeling objec...

0 Lina Mezghani, et al. ∙

research

∙ 02/16/2023

MINOTAUR: Multi-task Video Grounding From Multimodal Queries

Video understanding tasks take many forms, from action detection to visu...

0 Raghav Goyal, et al. ∙

research

∙ 01/05/2023

Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping

Developing agents that can execute multiple skills by learning from pre-...

0 Lina Mezghani, et al. ∙

research

∙ 11/10/2022

The CRINGE Loss: Learning what language not to model

Standard language model training employs gold human documents or human-h...

10 Leonard Adolphs, et al. ∙

research

∙ 06/23/2022

Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision

Learning a diverse set of skills by interacting with an environment with...

0 Lina Mezghani, et al. ∙

research

∙ 06/15/2022

DIRECTOR: Generator-Classifiers For Supervised Language Modeling

Current language models achieve low perplexity but their resulting gener...

11 Kushal Arora, et al. ∙

research

∙ 03/21/2022

Temporal Abstractions-Augmented Temporally Contrastive Learning: An Alternative to the Laplacian in RL

In reinforcement learning, the graph Laplacian has proved to be a valuab...

0 Akram Erraqabi, et al. ∙

research

∙ 06/08/2021

Hash Layers For Large Sparse Models

We investigate the training of sparse layers that use different paramete...

0 Stephen Roller, et al. ∙

research

∙ 06/08/2021

Staircase Attention for Recurrent Processing of Sequences

Attention mechanisms have become a standard tool for sequence modeling t...

0 Da Ju, et al. ∙

research

∙ 05/13/2021

Not All Memories are Created Equal: Learning to Forget by Expiring

Attention mechanisms have shown promising results in sequence modeling t...

0 Sainbayar Sukhbaatar, et al. ∙

research

∙ 01/13/2021

Memory-Augmented Reinforcement Learning for Image-Goal Navigation

In this work, we address the problem of image-goal navigation in the con...

15 Lina Mezghani, et al. ∙

research

∙ 04/10/2020

Learning to Visually Navigate in Photorealistic Environments Without any Supervision

Learning to navigate in a realistic setting where an agent must rely sol...

3 Lina Mezghani, et al. ∙

research

∙ 02/21/2020

Accessing Higher-level Representations in Sequential Transformers with Feedback Memory

Transformers are feedforward networks that can process input tokens in p...

5 Angela Fan, et al. ∙

research

∙ 07/02/2019

Augmenting Self-attention with Persistent Memory

Transformer networks have lead to important progress in language modelin...

6 Sainbayar Sukhbaatar, et al. ∙

research

∙ 05/19/2019

Adaptive Attention Span in Transformers

We propose a novel self-attention mechanism that can learn its optimal a...

0 Sainbayar Sukhbaatar, et al. ∙

research

∙ 12/23/2018

Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks

Learning when to communicate and doing that effectively is essential in ...

0 Amanpreet Singh, et al. ∙

research

∙ 11/22/2018

Learning Goal Embeddings via Self-Play for Hierarchical Reinforcement Learning

In hierarchical reinforcement learning a major challenge is determining ...

0 Sainbayar Sukhbaatar, et al. ∙

research

∙ 09/06/2018

Planning with Arithmetic and Geometric Attributes

A desirable property of an intelligent agent is its ability to understan...

10 David Folqué, et al. ∙

research

∙ 03/01/2018

Composable Planning with Attributes

The tasks that an agent will need to solve often are not known during tr...

0 Amy Zhang, et al. ∙

research

∙ 12/07/2015

Simple Baseline for Visual Question Answering

We describe a very simple bag-of-words baseline for visual question answ...

0 Bolei Zhou, et al. ∙

research

∙ 11/23/2015

MazeBase: A Sandbox for Learning from Games

This paper introduces MazeBase: an environment for simple 2D games, desi...

0 Sainbayar Sukhbaatar, et al. ∙

research

∙ 03/31/2015

End-To-End Memory Networks

We introduce a neural network with a recurrent attention model over a po...

0 Sainbayar Sukhbaatar, et al. ∙

research

∙ 06/09/2014

Training Convolutional Networks with Noisy Labels

The availability of large labeled datasets has allowed Convolutional Net...

0 Sainbayar Sukhbaatar, et al. ∙

research

∙ 01/15/2013

Auto-pooling: Learning to Improve Invariance of Image Features from Image Sequences

Learning invariant representations from images is one of the hardest cha...

0 Sainbayar Sukhbaatar, et al. ∙

Sainbayar Sukhbaatar

Featured Co-authors

Sign in with Google

Consider DeepAI Pro