Model-Free Inverse Optimal Control for Completely Unknown Nonlinear Systems by Adaptive Dynamic Programming

被引：0

作者：

Ahmadi, Peyman ^{[1
]}

Rahmani, Mehdi ^{[1
]}

Shahmansoorian, Aref ^{[1
]}

机构：

[1] Imam Khomeini Int Univ, Elect Engn Dept, Qazvin 3414916818, Iran

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2025年

关键词：

Optimal control; Polynomials; Linear programming; Mathematical models; Computational modeling; Nonlinear systems; Vectors; Trajectory; Numerical stability; Numerical models; Adaptive dynamic programming (ADP); inverse optimal control (IOC); sum-of-squares programming; unknown nonlinear systems; TRACKING CONTROL; OUTPUT-FEEDBACK;

D O I：

10.1109/TSMC.2025.3526576

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article presents an inverse optimal control (IOC) approach for nonlinear polynomial systems based on adaptive dynamic programming (ADP). First, a novel model-based algorithm is presented which provides a control policy, a suboptimal Lyapunov function, and an objective function, that is, minimized by applying the control policy. It is then extended to a model-free approach for systems with a completely unknown model using only the measured input/output data. Compared with existing ADP-based algorithms for nonlinear continuous-time systems, the proposed algorithm is data-based and does not rely on numerical solutions for model approximation. Additionally, it is an off-policy algorithm and avoids the repeat of experiments for control design. Instead, an ADP-based sum-of-squares programming is presented which is computationally tractable. The theoretical guarantee for the stability of the proposed IOC is established using the Lyapunov technique. The performance and efficacy of the proposed approach are investigated through three simulation examples.

引用

页数：12

共 49 条

[1] From inverse optimal control to inverse reinforcement learning: A historical review
Ab Azar, Nematollah
Shahmansoorian, Aref
Davoudi, Mohsen
[J]. ANNUAL REVIEWS IN CONTROL, 2020, 50 : 119 - 138
[2] LQR based optimal co-design for linear control systems with input and state constraints
Ahmadi, Peyman
Rahmani, Mehdi
Shahmansoorian, Aref
[J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2023, 54 (05) : 1136 - 1149
[3] Inverse-Optimal Consensus Control of Fractional-Order Multiagent Systems
An, Chunlan
Su, Housheng
Chen, Shiming
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (08): : 5320 - 5331
[4] [Anonymous], 1964, Journal of Fluids Engineering, Transactions of the ASME, DOI [10.1115/1.3653115, DOI 10.1115/1.3653115]
[5] Online estimation of objective function for continuous-time deterministic systems
Asl, Hamed Jabbari
Uchibe, Eiji
[J]. NEURAL NETWORKS, 2024, 172
[6] Online Data-Driven Inverse Reinforcement Learning for Deterministic Systems
Asl, Hamed Jabbari
Uchibe, Eiji
[J]. 2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 884 - 889
[7] Event-triggered-based adaptive dynamic programming for distributed formation control of multi-UAV
Dou, Liqian
Cai, Siyuan
Zhang, Xiuyun
Su, Xiaotong
Zhang, Ruilong
[J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2022, 359 (08): : 3671 - 3691
[8] Data-Driven Cooperative Output Regulation of Multi-Agent Systems via Robust Adaptive Dynamic Programming
Gao, Weinan
Jiang, Yu
Davari, Masoud
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2019, 66 (03) : 447 - 451
[9] Output-feedback adaptive optimal control of interconnected systems based on robust adaptive dynamic programming
Gao, Weinan
Jiang, Yu
Jiang, Zhong-Ping
Chai, Tianyou
[J]. AUTOMATICA, 2016, 72 : 37 - 45
[10] Adaptive Inverse Control of a Vibrating Coupled Vessel-Riser System With Input Backlash
He, Xiuyu
Zhao, Zhijia
Su, Jinya
Yang, Qinmin
Zhu, Dachang
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (08): : 4706 - 4715

← 1 2 3 4 5 →