The Lyapunov Neural Network: Adaptive Stability Certification for Safe Learning of Dynamic Systems

08/02/2018
by   Spencer M. Richards, et al.
0

Learning algorithms have shown considerable prowess in simulation by allowing robots to adapt to uncertain environments and improve their performance. However, such algorithms are rarely used in practice on safety-critical systems, since the learned policy typically does not yield any safety guarantees and thus the required exploration may cause physical harm to the robot or its environment. In this paper, we present a method to learn accurate safety certificates for nonlinear, closed-loop dynamic systems. Specifically, we construct a neural network Lyapunov function and a training algorithm that adapts it to the shape of the largest safe region in the state space. The algorithm relies only on knowledge of inputs and outputs of the dynamics, rather than on any specific model structure. We demonstrate our method by learning the safe region of attraction for a simulated inverted pendulum. Furthermore, we discuss how our method can be used in safe learning algorithms together with statistical models of dynamic systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2017

Safe Model-based Reinforcement Learning with Stability Guarantees

Reinforcement learning is a powerful paradigm for learning optimal polic...
research
11/10/2019

Synthesis of Feedback Controller for Nonlinear Control Systems with Optimal Region of Attraction

The problem of computing and characterizing Region of Attraction (ROA) w...
research
04/08/2022

Barrier Bayesian Linear Regression: Online Learning of Control Barrier Conditions for Safety-Critical Control of Uncertain Systems

In this work, we consider the problem of designing a safety filter for a...
research
02/24/2022

Data-Driven Safety Verification for Legged Robots

Planning safe motions for legged robots requires sophisticated safety ve...
research
01/15/2021

Scalable Learning of Safety Guarantees for Autonomous Systems using Hamilton-Jacobi Reachability

Autonomous systems like aircraft and assistive robots often operate in s...
research
04/24/2023

Quality-Diversity Optimisation on a Physical Robot Through Dynamics-Aware and Reset-Free Learning

Learning algorithms, like Quality-Diversity (QD), can be used to acquire...
research
01/16/2016

Engineering Safety in Machine Learning

Machine learning algorithms are increasingly influencing our decisions a...

Please sign up or login with your details

Forgot password? Click here to reset