Optimal tracking control for completely unknown nonlinear discrete-time Markov jump systems using data-based reinforcement learning method

被引：32

作者：

Jiang, He ^{[1
]}

Zhang, Huaguang ^{[1
]}

Luo, Yanhong ^{[1
]}

Wang, Junyi ^{[1
]}

机构：

[1] Northeastern Univ, Coll Informat Sci & Engn, Box 134, Shenyang 110819, Peoples R China

来源：

NEUROCOMPUTING | 2016年 / 194卷

基金：

中国国家自然科学基金;

关键词：

Optimal tracking control; Markov jump systems; Data-based; Reinforcement learning; Adaptive dynamic programming; Neural networks; SYNCHRONIZATION CONTROL; GRAPHICAL GAMES; CONTROL SCHEME; STABILITY; ALGORITHM;

D O I：

10.1016/j.neucom.2016.02.029

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we develop a novel optimal tracking control scheme for a class of nonlinear discrete-time Markov jump systems (MJSs) by utilizing a data-based reinforcement learning method. It is not practical to obtain accurate system models of the real-world MJSs due to the existence of abrupt variations in their system structures. Consequently, most traditional model-based methods for MJSs are invalid for the practical engineering applications. In order to overcome the difficulties without any identification scheme which would cause estimation errors, a model-free adaptive dynamic programming (ADP) algorithm will be designed by using system data rather than accurate system functions. Firstly, we combine the tracking error dynamics and reference system dynamics to form an augmented system. Then, based on the augmented system, a new performance index function with discount factor is formulated for the optimal tracking control problem via Markov chain and weighted sum technique. Neural networks are employed to implement the on-line ADP learning algorithm. Finally, a simulation example is given to demonstrate the effectiveness of our proposed approach. (C) 2016 Elsevier B.V. All rights reserved.

引用

页码：176 / 182

页数：7

共 50 条

[1] Data-Based Optimal Tracking Control of Nonaffine Nonlinear Discrete-Time Systems
Luo, Biao
Liu, Derong
Huang, Tingwen
Li, Chao
NEURAL INFORMATION PROCESSING, ICONIP 2016, PT IV, 2016, 9950 : 573 - 581
[2] Optimal Control for Unknown Discrete-Time Nonlinear Markov Jump Systems Using Adaptive Dynamic Programming
Zhong, Xiangnan
He, Haibo
Zhang, Huaguang
Wang, Zhanshan
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (12) : 2141 - 2155
[3] Reinforcement learning-based optimal control for Markov jump systems with completely unknown dynamics
Shi, Xiongtao
Li, Yanjie
Du, Chenglong
Chen, Chaoyang
Zong, Guangdeng
Gui, Weihua
AUTOMATICA, 2025, 171
[4] Reinforcement Q-learning algorithm for H∞ tracking control of discrete-time Markov jump systems
Shi, Jiahui
He, Dakuo
Zhang, Qiang
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2025, 56 (03) : 502 - 523
[5] Online optimal and adaptive integral tracking control for varying discrete-time systems using reinforcement learning
Sanusi, Ibrahim
Mills, Andrew
Dodd, Tony
Konstantopoulos, George
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2020, 34 (08) : 971 - 991
[6] Adaptive Constrained Optimal Control Design for Data-Based Nonlinear Discrete-Time Systems With Critic-Only Structure
Luo, Biao
Liu, Derong
Wu, Huai-Ning
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2099 - 2111
[7] H∞ control with constrained input for completely unknown nonlinear systems using data-driven reinforcement learning method
Jiang, He
Zhang, Huaguang
Luo, Yanhong
Cui, Xiaohong
NEUROCOMPUTING, 2017, 237 : 226 - 234
[8] Robust control scheme for a class of uncertain nonlinear systems with completely unknown dynamics using data-driven reinforcement learning method
Jiang, He
Zhang, Huaguang
Cui, Yang
Xiao, Geyang
NEUROCOMPUTING, 2018, 273 : 68 - 77
[9] Optimal Tracking Control for Linear Discrete-time Systems Using Reinforcement Learning
Kiumarsi-Khomartash, Bahare
Lewis, Frank L.
Naghibi-Sistani, Mohammad-Bagher
Karimpour, Ali
2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 3845 - 3850
[10] Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning
Yang, Xiong
Liu, Derong
Wang, Ding
Wei, Qinglai
NEURAL NETWORKS, 2014, 55 : 30 - 41

← 1 2 3 4 5 →