The cornerstone of neural algorithmic reasoning is the ability to solve
...
In this report, we summarize the takeaways from the first NeurIPS 2021
N...
Proximal Policy Optimization (PPO) methods learn a policy by iteratively...
Recent multi-task learning research argues against unitary scalarization...
The progress in deep reinforcement learning (RL) is heavily driven by th...
Recent research has shown that Graph Neural Networks (GNNs) can learn
po...
Multitask Reinforcement Learning is a promising way to obtain models wit...
This paper introduces the deep coordination graph (DCG) for collaborativ...
We present GQSAT, a branching heuristic in a Boolean SAT solver trained ...
The performance of policy gradient methods is sensitive to hyperparamete...
Learning from demonstration (LfD) is useful in settings where hand-codin...
We propose CAML, a meta-learning method for fast adaptation that partiti...
Recent progress in Reinforcement Learning (RL), fueled by its combinatio...
With the rise of end-to-end learning through deep learning, person detec...