Adaptive Q-Learning Based Model-Free H∞ Control of Continuous-Time Nonlinear Systems: Theory and Application

被引：4

作者：

Zhao, Jun ^{[1
]}

Lv, Yongfeng ^{[2
]}

Wang, Zhangu ^{[1
]}

Zhao, Ziliang ^{[1
]}

机构：

[1] Shandong Univ Sci & Technol, Coll Transportat, Shandong Key Lab Hydrogen Elect Hybrid Power Syst, Qingdao 266590, Peoples R China

[2] Taiyuan Univ Technol, Coll Elect & Power Engn, Taiyuan 030024, Peoples R China

来源：

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2025年 / 9卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Q-learning; Artificial neural networks; Nonlinear systems; Cost function; Control systems; Heuristic algorithms; System dynamics; Reinforcement learning; H-infinity control; learning law; LINEAR-SYSTEMS;

D O I：

10.1109/TETCI.2024.3449870

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Although model based H-infinity control scheme for nonlinear continuous-time (CT) systems with unknown system dynamics has been extensively studied, model-free H-infinity control of nonlinear CT systems via Q-learning is still a challenging problem. This paper develops a novel Q-learning based model-free H-infinity control scheme for nonlinear CT systems, where the adaptive critic and actor continuously and simultaneously update each other, eliminating the need for iterative steps. As a result, a hybrid structure is avoided and there is no longer a requirement for an initial stabilizing control policy. To obtain the H-infinity control of the CT nonlinear system, the Q-learning strategy is introduced to online resolve the H-infinity control problem in a non-iterative approach, where the system dynamics are not required. In addition, a new learning law is further developed by utilizing a sliding mode scheme to online update the critic neural network (NN) weights. Due to the strong convergence of critic NN weights, the actor NN used in most H-infinity control algorithms is removed. Finally, numerical simulation and experimental results of an adaptive cruise control (ACC) system based on a real vehicle effectively demonstrate the feasibility of the presented control method and learning algorithm.

引用

页码：1143 / 1152

页数：10

共 35 条

[1] Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].