Autonomous robots deployed in the real world will need control policies ...
Vizier is the de-facto blackbox and hyperparameter optimization service
...
Meta-learning hyperparameter optimization (HPO) algorithms from prior
ex...
The combination of Reinforcement Learning (RL) with deep learning has le...
Approximate bi-level optimization (ABLO) consists of (outer-level)
optim...
We introduce RL-DARTS, one of the first applications of Differentiable
A...
There has recently been significant interest in training reinforcement
l...
We introduce ES-ENAS, a simple neural architecture search (NAS) algorith...
The Transformer architecture has revolutionized deep learning on sequent...
We introduce Performers, Transformer architectures which can estimate re...
We present a new paradigm for Neural ODE algorithms, calledODEtoODE, whe...
Bilevel optimization (BLO) is a popular approach with many applications
...
Transformer models have achieved state-of-the-art results across a diver...
We propose a model-free algorithm for learning efficient policies capabl...
We present a new class of stochastic, geometrically-driven optimization
...
Learning adaptable policies is crucial for robots to operate autonomousl...
A major component of overfitting in model-free reinforcement learning (R...
Zeroth-order optimization is the process of minimizing an objective f(x)...
We introduce ES-MAML, a new framework for solving the model agnostic met...
We present a new algorithm for finding compact neural networks encoding
...
Recent results in Reinforcement Learning (RL) have shown that agents wit...
Several recent papers have examined generalization in reinforcement lear...
In this work, we present our findings and experiments for stock-market
p...