H∞ Tracking Control for Linear Discrete-Time Systems: Model-Free Q-Learning Designs

被引：40

作者：

Yang, Yunjie ^{[1
]}

Wan, Yan ^{[2
]}

Zhu, Jihong ^{[1
]}

Lewis, Frank L. ^{[3
]}

机构：

[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China

[2] Univ Texas Arlington, Dept Elect Engn, Arlington, TX 76019 USA

[3] Univ Texas Arlington, UTA Res Inst, Ft Worth, TX 75052 USA

来源：

IEEE CONTROL SYSTEMS LETTERS | 2021年 / 5卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Linear discrete-time systems; H-infinity tracking control; Q-learning; ZERO-SUM GAMES;

D O I：

10.1109/LCSYS.2020.3001241

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this letter, a novel model-free Q-learning based approach is developed to solve the H-infinity tracking problem for linear discrete-time systems. A new exponential discounted value function is introduced that includes the cost of the whole control input and tracking error. The tracking Bellman equation and the game algebraic Riccati equation (GARE) are derived. The solution to the GARE leads to the feedback and feedforward parts of the control input. A Q-learning algorithm is then developed to learn the solution of the GARE online without requiring any knowledge of the system dynamics. Convergence of the algorithm is analyzed, and it is also proved that probing noises in maintaining the persistence of excitation (PE) condition do not result in any bias. An example of the F-16 aircraft short period dynamics is developed to validate the proposed algorithm.

引用

页码：175 / 180

页数：6

共 50 条

[21] Adjustable Iterative Q-Learning Schemes for Model-Free Optimal Tracking Control
Qiao, Junfei
Zhao, Mingming
Wang, Ding
Ha, Mingming
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (02): : 1202 - 1213
[22] Optimal tracking control for discrete-time modal persistent dwell time switched systems based on Q-learning
Zhang, Xuewen
Wang, Yun
Xia, Jianwei
Li, Feng
Shen, Hao
OPTIMAL CONTROL APPLICATIONS & METHODS, 2023, 44 (06) : 3327 - 3341
[23] The Adaptive Optimal Output Feedback Tracking Control of Unknown Discrete-Time Linear Systems Using a Multistep Q-Learning Approach
Dong, Xunde
Lin, Yuxin
Suo, Xudong
Wang, Xihao
Sun, Weijie
MATHEMATICS, 2024, 12 (04)
[24] Model-free extended Q-learning method for H∞, output tracking control of networked control systems with network delays and packet loss
Hao, Longyan
Wang, Chaoli
Liang, Dong
Li, Shihua
NEUROCOMPUTING, 2025, 634
[25] Explainable data-driven Q-learning control for a class of discrete-time linear autonomous systems
Perrusquia, Adolfo
Zou, Mengbang
Guo, Weisi
INFORMATION SCIENCES, 2024, 682
[26] Adaptive Q-Learning Based Model-Free H∞ Control of Continuous-Time Nonlinear Systems: Theory and Application
Zhao, Jun
Lv, Yongfeng
Wang, Zhangu
Zhao, Ziliang
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
[27] Model-Free Optimal Output Regulation for Linear Discrete-Time Lossy Networked Control Systems
Fan, Jialu
Wu, Qian
Jiang, Yi
Chai, Tianyou
Lewis, Frank L.
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 4033 - 4042
[28] Adaptive optimal output feedback tracking control for unknown discrete-time linear systems using a combined reinforcement Q-learning and internal model method
Sun, Weijie
Zhao, Guangyue
Peng, Yunjian
IET CONTROL THEORY AND APPLICATIONS, 2019, 13 (18) : 3075 - 3086
[29] Experience replay-based output feedback Q-learning scheme for optimal output tracking control of discrete-time linear systems
Rizvi, Syed Ali Asad
Lin, Zongli
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2019, 33 (12) : 1825 - 1842
[30] Model-free Q-learning over Finite Horizon for Uncertain Linear Continuous-time Systems
Xu, Hao
Jagannathan, S.
2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 164 - 169

← 1 2 3 4 5 →