In many applications of machine learning, like drug discovery and materi...
The mixing time of the Markov chain induced by a policy limits performan...
Multi-head, key-value attention is the backbone of the widely successful...
Gradient-based meta-learners such as Model-Agnostic Meta-Learning (MAML)...
Goal-directed Reinforcement Learning (RL) traditionally considers an age...