Learning Min-norm Stabilizing Control Laws for Systems with Unknown Dynamics

被引:0
|
作者
Westenbroek, Tyler [1 ]
Castaneda, Fernando [2 ]
Agrawal, Ayush [2 ]
Sastry, S. Shankar [1 ]
Sreenath, Koushil [2 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94720 USA
来源
2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC) | 2020年
基金
美国国家科学基金会;
关键词
TO-STATE STABILITY; ADAPTIVE-CONTROL; LYAPUNOV;
D O I
10.1109/cdc42340.2020.9304118
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces a framework for learning a minimum-norm stabilizing controller for a system with unknown dynamics using model-free policy optimization methods. The approach begins by first designing a Control Lyapunov Function (CLF) for a (possibly inaccurate) dynamics model for the system, along with a function which specifies a minimum acceptable rate of energy dissipation for the CLF at different points in the state-space. Treating the energy dissipation condition as a constraint on the desired closed-loop behavior of the real-world system, we use penalty methods to formulate an unconstrained optimization problem over the parameters of a learned controller, which can be solved using model-free policy optimization algorithms using data collected from the plant. We discuss when the optimization learns a stabilizing controller for the real world system and derive conditions on the structure of the learned controller which ensure that the optimization is strongly convex, meaning the globally optimal solution can be found reliably. We validate the approach in simulation, first for a double pendulum, and then generalize the framework to learn stable walking controllers for underactuated bipedal robots using the Hybrid Zero Dynamics framework. By encoding a large amount of structure into the learning problem, we are able to learn stabilizing controllers for both systems with only minutes or even seconds of training data.
引用
收藏
页码:737 / 744
页数:8
相关论文
共 50 条
  • [1] Gaussian Process-based Min-norm Stabilizing Controller for Control-Affine Systems with Uncertain Input Effects and Dynamics
    Castaneda, Fernando
    Choi, Jason J.
    Zhang, Bike
    Tomlin, Claire J.
    Sreenath, Koushil
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 3683 - 3690
  • [2] Robust Pointwise Min-Norm Control of Distributed Systems with Fluid Flow
    Igreja, Jose M.
    Lemos, Joao M.
    Costa, Sergio J.
    2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 2662 - 2667
  • [3] Generalized point-wise min-norm control of affine nonlinear systems
    Robotics Laboratory, Shenyang Institute of Automation, Chinese Acad. of Sci., Shenyang 110016, China
    不详
    Kong Zhi Li Lun Yu Ying Yong, 2008, 5 (962-965): : 962 - 965
  • [4] Generalized point wise min-norm control based on Control Lyapunov Functions
    He Yuqing
    Han Jianda
    PROCEEDINGS OF THE 26TH CHINESE CONTROL CONFERENCE, VOL 2, 2007, : 404 - +
  • [5] Power system stabilizer based on pointwise min-norm control law
    Erceg, Igor
    Sumina, Damir
    Tusun, Stjepan
    Kutija, Martina
    ELECTRIC POWER SYSTEMS RESEARCH, 2017, 143 : 215 - 224
  • [6] Nonlinear model predictive control enhanced by generalized pointwise min-norm scheme
    He, Yuqing
    Han, Jianda
    PROCEEDINGS OF THE 46TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2007, : 3343 - +
  • [7] Nonlinear Predictive Control of Input Constrained System Based on Generalized Pointwise Min-Norm Scheme
    He, Yuqing
    Han, Jianda
    2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 341 - 346
  • [8] Learning Conservation Laws in Unknown Quantum Dynamics
    Zhan, Yongtao
    Elben, Andreas
    Huang, Hsin-Yuan
    Tong, Yu
    PRX QUANTUM, 2024, 5 (01):
  • [9] Reinforcement Learning of Structured Stabilizing Control for Linear Systems With Unknown State Matrix
    Mukherjee, Sayak
    Vu, Thanh Long
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (03) : 1746 - 1752
  • [10] On Stabilizing Control of Gaussian Processes for Unknown Nonlinear Systems
    Ito, Yuji
    Fujimoto, Kenji
    Tadokoro, Yukihiro
    Yoshimura, Takayoshi
    IFAC PAPERSONLINE, 2017, 50 (01): : 15385 - 15390