Learning Min-norm Stabilizing Control Laws for Systems with Unknown Dynamics

被引：0

作者：

Westenbroek, Tyler ^{[1
]}

Castaneda, Fernando ^{[2
]}

Agrawal, Ayush ^{[2
]}

Sastry, S. Shankar ^{[1
]}

Sreenath, Koushil ^{[2
]}

机构：

[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA

[2] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94720 USA

来源：

2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC) | 2020年

基金：

美国国家科学基金会;

关键词：

TO-STATE STABILITY; ADAPTIVE-CONTROL; LYAPUNOV;

D O I：

10.1109/cdc42340.2020.9304118

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper introduces a framework for learning a minimum-norm stabilizing controller for a system with unknown dynamics using model-free policy optimization methods. The approach begins by first designing a Control Lyapunov Function (CLF) for a (possibly inaccurate) dynamics model for the system, along with a function which specifies a minimum acceptable rate of energy dissipation for the CLF at different points in the state-space. Treating the energy dissipation condition as a constraint on the desired closed-loop behavior of the real-world system, we use penalty methods to formulate an unconstrained optimization problem over the parameters of a learned controller, which can be solved using model-free policy optimization algorithms using data collected from the plant. We discuss when the optimization learns a stabilizing controller for the real world system and derive conditions on the structure of the learned controller which ensure that the optimization is strongly convex, meaning the globally optimal solution can be found reliably. We validate the approach in simulation, first for a double pendulum, and then generalize the framework to learn stable walking controllers for underactuated bipedal robots using the Hybrid Zero Dynamics framework. By encoding a large amount of structure into the learning problem, we are able to learn stabilizing controllers for both systems with only minutes or even seconds of training data.

引用

页码：737 / 744

页数：8

共 50 条

[1] Gaussian Process-based Min-norm Stabilizing Controller for Control-Affine Systems with Uncertain Input Effects and Dynamics
Castaneda, Fernando
Choi, Jason J.
Zhang, Bike
Tomlin, Claire J.
Sreenath, Koushil
2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 3683 - 3690
[2] Robust Pointwise Min-Norm Control of Distributed Systems with Fluid Flow
Igreja, Jose M.
Lemos, Joao M.
Costa, Sergio J.
2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 2662 - 2667
[3] Generalized point-wise min-norm control of affine nonlinear systems
Robotics Laboratory, Shenyang Institute of Automation, Chinese Acad. of Sci., Shenyang 110016, China
不详
Kong Zhi Li Lun Yu Ying Yong, 2008, 5 (962-965): : 962 - 965
[4] Generalized point wise min-norm control based on Control Lyapunov Functions
He Yuqing
Han Jianda
PROCEEDINGS OF THE 26TH CHINESE CONTROL CONFERENCE, VOL 2, 2007, : 404 - +
[5] Power system stabilizer based on pointwise min-norm control law
Erceg, Igor
Sumina, Damir
Tusun, Stjepan
Kutija, Martina
ELECTRIC POWER SYSTEMS RESEARCH, 2017, 143 : 215 - 224
[6] Nonlinear model predictive control enhanced by generalized pointwise min-norm scheme
He, Yuqing
Han, Jianda
PROCEEDINGS OF THE 46TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2007, : 3343 - +
[7] Nonlinear Predictive Control of Input Constrained System Based on Generalized Pointwise Min-Norm Scheme
He, Yuqing
Han, Jianda
2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 341 - 346
[8] Learning Conservation Laws in Unknown Quantum Dynamics
Zhan, Yongtao
Elben, Andreas
Huang, Hsin-Yuan
Tong, Yu
PRX QUANTUM, 2024, 5 (01):
[9] Reinforcement Learning of Structured Stabilizing Control for Linear Systems With Unknown State Matrix
Mukherjee, Sayak
Vu, Thanh Long
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (03) : 1746 - 1752
[10] On Stabilizing Control of Gaussian Processes for Unknown Nonlinear Systems
Ito, Yuji
Fujimoto, Kenji
Tadokoro, Yukihiro
Yoshimura, Takayoshi
IFAC PAPERSONLINE, 2017, 50 (01): : 15385 - 15390

← 1 2 3 4 5 →