To learn good joint policies for multi-agent collaboration with imperfec...
The combination of deep reinforcement learning and search at both traini...
Since DeepMind's AlphaZero, Zero learning quickly became the state-of-th...
We explore using latent natural language instructions as an expressive a...
We analyze the dynamics of training deep ReLU networks and their implica...
The AlphaGo, AlphaGo Zero, and AlphaZero series of algorithms are a
rema...
In this paper, we propose ELF, an Extensive, Lightweight and Flexible
pl...