Decoupled Data-Based Approach for Learning to Control Nonlinear Dynamical Systems

被引:2
|
作者
Wang, Ran [1 ]
Parunandi, Karthikeya S. [1 ]
Yu, Dan [2 ]
Kalathil, Dileep [3 ]
Chakravorty, Suman [1 ]
机构
[1] Texas A&M Univ, Dept Aerosp Engn, College Stn, TX 77840 USA
[2] Nanjing Univ Aeronaut & Astronaut, Coll Astronaut, Nanjing 210016, Peoples R China
[3] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77840 USA
基金
美国国家科学基金会;
关键词
Heuristic algorithms; Trajectory; Approximation algorithms; Stochastic processes; Dynamic programming; Data models; Computational modeling; Reinforcement learning; stochastic control; nonlinear systems;
D O I
10.1109/TAC.2021.3108552
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article addresses the problem of learning the optimal control policy for a nonlinear stochastic dynamical. This problem is subject to the "curse of dimensionality" associated with the dynamic programming method. This article proposes a novel decoupled data-based control (D2C) algorithm that addresses this problem using a decoupled, "open-loop-closed-loop," approach. First, an open-loop deterministic trajectory optimization problem is solved using a black-box simulation model of the dynamical system. Then, closed-loop control is developed around this open-loop trajectory by linearization of the dynamics about this nominal trajectory. By virtue of linearization, a linear quadratic regulator based algorithm can be used for this closed-loop control. We show that the performance of D2C algorithm is approximately optimal. Moreover, simulation performance suggests a significant reduction in training time compared to other state-of-the-art algorithms.
引用
收藏
页码:3582 / 3589
页数:8
相关论文
共 50 条
  • [31] Data-based Predictive Control for Networked Control Systems
    Wang, Yan
    Ji, Zhicheng
    PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 2302 - 2305
  • [32] Data-Based Optimal Control of Multiagent System A Reinforcement Learning Design Approach
    Zhang, Jilie
    Wang, Zhanshan
    Zhang, Hongwei
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (12) : 4441 - 4449
  • [33] Data-Based Iterative Learning Control: A Nonconservative Approach via LMI Techniques
    Wang, Chenchao
    Meng, Deyuan
    Cheng, Long
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA 2024, 2024, : 653 - 658
  • [34] Data-based stochastic models of uncertain nonlinear systems
    Hernandez-Garcia, M.
    Masri, S. F.
    Ghanem, R.
    Arrate, F.
    IUTAM SYMPOSIUM ON DYNAMICS AND CONTROL OF NONLINEAR SYSTEMS WITH UNCERTAINTY, 2007, 2 : 11 - +
  • [35] Data-Based Control of Feedback Linearizable Systems
    Alsalti, Mohammad
    Lopez, Victor G.
    Berberich, Julian
    Allgoewer, Frank
    Mueller, Matthias A.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (11) : 7014 - 7021
  • [36] Input-Output Data-Based Output Antisynchronization Control of Multiagent Systems Using Reinforcement Learning Approach
    Peng, Zhinan
    Zhao, Yiyi
    Hu, Jiangping
    Luo, Rui
    Ghosh, Bijoy Kumar
    Nguang, Sing Kiong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (11) : 7359 - 7367
  • [37] Data-based control design for nonlinear systems with recurrent neural network-based controllers
    D'Amico, William
    La Bella, Alessio
    Dercole, Fabio
    Farina, Marcello
    IFAC PAPERSONLINE, 2023, 56 (02): : 6235 - 6240
  • [38] A data driven approach to learning dynamical systems
    Brugarolas, PB
    Safonov, MG
    PROCEEDINGS OF THE 41ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 2002, : 4670 - 4675
  • [39] Data-Based Optimal Tracking Control of Nonaffine Nonlinear Discrete-Time Systems
    Luo, Biao
    Liu, Derong
    Huang, Tingwen
    Li, Chao
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT IV, 2016, 9950 : 573 - 581
  • [40] Data-based predictive control for networked nonlinear systems with packet dropout and measurement noise
    Zhonghua Pang
    Guoping Liu
    Donghua Zhou
    Dehui Sun
    Journal of Systems Science and Complexity, 2017, 30 : 1072 - 1083