Learning a universal policy across different robot morphologies can
sign...
While deep reinforcement learning (RL) has fueled multiple high-profile
...
Training a reinforcement learning (RL) agent on a real-world robotics ta...
Meta-gradients provide a general approach for optimizing the meta-parame...
Consistency is the theoretical property of a meta learning algorithm tha...
Mutually beneficial behavior in repeated games can be enforced via the t...
In autonomous vehicle (AV) control, allowing mistakes can be quite dange...
Neural networks are based on a simplified model of the brain. In this
pr...