Decoupled Data-Based Approach for Learning to Control Nonlinear Dynamical Systems

被引：2

作者：

Wang, Ran ^{[1
]}

Parunandi, Karthikeya S. ^{[1
]}

Yu, Dan ^{[2
]}

Kalathil, Dileep ^{[3
]}

Chakravorty, Suman ^{[1
]}

机构：

[1] Texas A&M Univ, Dept Aerosp Engn, College Stn, TX 77840 USA

[2] Nanjing Univ Aeronaut & Astronaut, Coll Astronaut, Nanjing 210016, Peoples R China

[3] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77840 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2022年 / 67卷 / 07期

基金：

美国国家科学基金会;

关键词：

Heuristic algorithms; Trajectory; Approximation algorithms; Stochastic processes; Dynamic programming; Data models; Computational modeling; Reinforcement learning; stochastic control; nonlinear systems;

D O I：

10.1109/TAC.2021.3108552

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article addresses the problem of learning the optimal control policy for a nonlinear stochastic dynamical. This problem is subject to the "curse of dimensionality" associated with the dynamic programming method. This article proposes a novel decoupled data-based control (D2C) algorithm that addresses this problem using a decoupled, "open-loop-closed-loop," approach. First, an open-loop deterministic trajectory optimization problem is solved using a black-box simulation model of the dynamical system. Then, closed-loop control is developed around this open-loop trajectory by linearization of the dynamics about this nominal trajectory. By virtue of linearization, a linear quadratic regulator based algorithm can be used for this closed-loop control. We show that the performance of D2C algorithm is approximately optimal. Moreover, simulation performance suggests a significant reduction in training time compared to other state-of-the-art algorithms.

引用

页码：3582 / 3589

页数：8

共 50 条

[31] Data-based Predictive Control for Networked Control Systems
Wang, Yan
Ji, Zhicheng
PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 2302 - 2305
[32] Data-Based Optimal Control of Multiagent System A Reinforcement Learning Design Approach
Zhang, Jilie
Wang, Zhanshan
Zhang, Hongwei
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (12) : 4441 - 4449
[33] Data-Based Iterative Learning Control: A Nonconservative Approach via LMI Techniques
Wang, Chenchao
Meng, Deyuan
Cheng, Long
2024 IEEE 18TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA 2024, 2024, : 653 - 658
[34] Data-based stochastic models of uncertain nonlinear systems
Hernandez-Garcia, M.
Masri, S. F.
Ghanem, R.
Arrate, F.
IUTAM SYMPOSIUM ON DYNAMICS AND CONTROL OF NONLINEAR SYSTEMS WITH UNCERTAINTY, 2007, 2 : 11 - +
[35] Data-Based Control of Feedback Linearizable Systems
Alsalti, Mohammad
Lopez, Victor G.
Berberich, Julian
Allgoewer, Frank
Mueller, Matthias A.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (11) : 7014 - 7021
[36] Input-Output Data-Based Output Antisynchronization Control of Multiagent Systems Using Reinforcement Learning Approach
Peng, Zhinan
Zhao, Yiyi
Hu, Jiangping
Luo, Rui
Ghosh, Bijoy Kumar
Nguang, Sing Kiong
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (11) : 7359 - 7367
[37] Data-based control design for nonlinear systems with recurrent neural network-based controllers
D'Amico, William
La Bella, Alessio
Dercole, Fabio
Farina, Marcello
IFAC PAPERSONLINE, 2023, 56 (02): : 6235 - 6240
[38] A data driven approach to learning dynamical systems
Brugarolas, PB
Safonov, MG
PROCEEDINGS OF THE 41ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 2002, : 4670 - 4675
[39] Data-Based Optimal Tracking Control of Nonaffine Nonlinear Discrete-Time Systems
Luo, Biao
Liu, Derong
Huang, Tingwen
Li, Chao
NEURAL INFORMATION PROCESSING, ICONIP 2016, PT IV, 2016, 9950 : 573 - 581
[40] Data-based predictive control for networked nonlinear systems with packet dropout and measurement noise
Zhonghua Pang
Guoping Liu
Donghua Zhou
Dehui Sun
Journal of Systems Science and Complexity, 2017, 30 : 1072 - 1083

← 1 2 3 4 5 →