Model-Free Inverse Optimal Control for Completely Unknown Nonlinear Systems by Adaptive Dynamic Programming

被引:0
作者
Ahmadi, Peyman [1 ]
Rahmani, Mehdi [1 ]
Shahmansoorian, Aref [1 ]
机构
[1] Imam Khomeini Int Univ, Elect Engn Dept, Qazvin 3414916818, Iran
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2025年
关键词
Optimal control; Polynomials; Linear programming; Mathematical models; Computational modeling; Nonlinear systems; Vectors; Trajectory; Numerical stability; Numerical models; Adaptive dynamic programming (ADP); inverse optimal control (IOC); sum-of-squares programming; unknown nonlinear systems; TRACKING CONTROL; OUTPUT-FEEDBACK;
D O I
10.1109/TSMC.2025.3526576
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article presents an inverse optimal control (IOC) approach for nonlinear polynomial systems based on adaptive dynamic programming (ADP). First, a novel model-based algorithm is presented which provides a control policy, a suboptimal Lyapunov function, and an objective function, that is, minimized by applying the control policy. It is then extended to a model-free approach for systems with a completely unknown model using only the measured input/output data. Compared with existing ADP-based algorithms for nonlinear continuous-time systems, the proposed algorithm is data-based and does not rely on numerical solutions for model approximation. Additionally, it is an off-policy algorithm and avoids the repeat of experiments for control design. Instead, an ADP-based sum-of-squares programming is presented which is computationally tractable. The theoretical guarantee for the stability of the proposed IOC is established using the Lyapunov technique. The performance and efficacy of the proposed approach are investigated through three simulation examples.
引用
收藏
页数:12
相关论文
共 49 条
  • [1] From inverse optimal control to inverse reinforcement learning: A historical review
    Ab Azar, Nematollah
    Shahmansoorian, Aref
    Davoudi, Mohsen
    [J]. ANNUAL REVIEWS IN CONTROL, 2020, 50 : 119 - 138
  • [2] LQR based optimal co-design for linear control systems with input and state constraints
    Ahmadi, Peyman
    Rahmani, Mehdi
    Shahmansoorian, Aref
    [J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2023, 54 (05) : 1136 - 1149
  • [3] Inverse-Optimal Consensus Control of Fractional-Order Multiagent Systems
    An, Chunlan
    Su, Housheng
    Chen, Shiming
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (08): : 5320 - 5331
  • [4] [Anonymous], 1964, Journal of Fluids Engineering, Transactions of the ASME, DOI [10.1115/1.3653115, DOI 10.1115/1.3653115]
  • [5] Online estimation of objective function for continuous-time deterministic systems
    Asl, Hamed Jabbari
    Uchibe, Eiji
    [J]. NEURAL NETWORKS, 2024, 172
  • [6] Online Data-Driven Inverse Reinforcement Learning for Deterministic Systems
    Asl, Hamed Jabbari
    Uchibe, Eiji
    [J]. 2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 884 - 889
  • [7] Event-triggered-based adaptive dynamic programming for distributed formation control of multi-UAV
    Dou, Liqian
    Cai, Siyuan
    Zhang, Xiuyun
    Su, Xiaotong
    Zhang, Ruilong
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2022, 359 (08): : 3671 - 3691
  • [8] Data-Driven Cooperative Output Regulation of Multi-Agent Systems via Robust Adaptive Dynamic Programming
    Gao, Weinan
    Jiang, Yu
    Davari, Masoud
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2019, 66 (03) : 447 - 451
  • [9] Output-feedback adaptive optimal control of interconnected systems based on robust adaptive dynamic programming
    Gao, Weinan
    Jiang, Yu
    Jiang, Zhong-Ping
    Chai, Tianyou
    [J]. AUTOMATICA, 2016, 72 : 37 - 45
  • [10] Adaptive Inverse Control of a Vibrating Coupled Vessel-Riser System With Input Backlash
    He, Xiuyu
    Zhao, Zhijia
    Su, Jinya
    Yang, Qinmin
    Zhu, Dachang
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (08): : 4706 - 4715