DeepAI

AI Chat AI Image Generator AI Video AI Music Generator

Nonsmooth optimal value and policy functions for mechanical systems subject to unilateral constraints

10/18/2017

∙

by Bora S. Banjanin, et al.

∙

∙

State-of-the-art approaches to optimal control of contact-rich robot dynamics use smooth approximations of value and policy functions and gradient-based algorithms for improving approximator parameters. Unfortunately, the dynamics of mechanical systems subject to unilateral constraints--i.e. robot locomotion and manipulation--are generally nonsmooth. We show that value and policy functions generally inherit regularity properties like (non)smoothness from the underlying system's dynamics, and demonstrate this effect in a simple mechanical system. We conclude with a discussion of implications for the use of gradient-based algorithms for optimal control of contact-rich robot dynamics.

Bora S. Banjanin
1 publication
Sam A. Burden
1 publication

research

∙ 09/11/2021

Bundled Gradients through Contact via Randomized Smoothing

The empirical success of derivative-free methods in reinforcement learni...

0 H. J. Terry Suh, et al. ∙

research

∙ 01/11/2022

ValueNetQP: Learned one-step optimal control for legged locomotion

Optimal control is a successful approach to generate motions for complex...

0 Julian Viereck, et al. ∙

research

∙ 03/29/2021

Fundamental Challenges in Deep Learning for Stiff Contact Dynamics

Frictional contact has been extensively studied as the core underlying b...

0 Mihir Parmar, et al. ∙

research

∙ 01/02/2020

Thresholds of descending algorithms in inference problems

We review recent works on analyzing the dynamics of gradient-based algor...

27 Stefano Sarao Mannelli, et al. ∙

research

∙ 10/10/2020

Online Optimal Control with Affine Constraints

This paper considers online optimal control with affine constraints on t...

0 Yingying Li, et al. ∙

research

∙ 09/25/2018

Sampling-based Polytopic Trees for Approximate Optimal Control of Piecewise Affine Systems

Piecewise affine (PWA) systems are widely used to model highly nonlinear...

0 Sadra Sadraddini, et al. ∙

research

∙ 07/09/2019

Control of Painlevé Paradox in a Robotic System

The Painlevé paradox is a phenomenon that causes instability in mechanic...

0 Davide Marchese, et al. ∙