Optimal tracking control for completely unknown nonlinear discrete-time Markov jump systems using data-based reinforcement learning method

被引:32
作者
Jiang, He [1 ]
Zhang, Huaguang [1 ]
Luo, Yanhong [1 ]
Wang, Junyi [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Box 134, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimal tracking control; Markov jump systems; Data-based; Reinforcement learning; Adaptive dynamic programming; Neural networks; SYNCHRONIZATION CONTROL; GRAPHICAL GAMES; CONTROL SCHEME; STABILITY; ALGORITHM;
D O I
10.1016/j.neucom.2016.02.029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we develop a novel optimal tracking control scheme for a class of nonlinear discrete-time Markov jump systems (MJSs) by utilizing a data-based reinforcement learning method. It is not practical to obtain accurate system models of the real-world MJSs due to the existence of abrupt variations in their system structures. Consequently, most traditional model-based methods for MJSs are invalid for the practical engineering applications. In order to overcome the difficulties without any identification scheme which would cause estimation errors, a model-free adaptive dynamic programming (ADP) algorithm will be designed by using system data rather than accurate system functions. Firstly, we combine the tracking error dynamics and reference system dynamics to form an augmented system. Then, based on the augmented system, a new performance index function with discount factor is formulated for the optimal tracking control problem via Markov chain and weighted sum technique. Neural networks are employed to implement the on-line ADP learning algorithm. Finally, a simulation example is given to demonstrate the effectiveness of our proposed approach. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:176 / 182
页数:7
相关论文
共 50 条
  • [31] Adaptive optimal control of unknown discrete-time linear systems with guaranteed prescribed degree of stability using reinforcement learning
    Razavi, Seyed Ehsan
    Moradi, Mohammad Amin
    Shamaghdari, Saeed
    Menhaj, Mohammad Bagher
    [J]. INTERNATIONAL JOURNAL OF DYNAMICS AND CONTROL, 2022, 10 (03) : 870 - 878
  • [32] FINITE-HORIZON OPTIMAL CONTROL OF DISCRETE-TIME LINEAR SYSTEMS WITH COMPLETELY UNKNOWN DYNAMICS USING Q-LEARNING
    Zhao, Jingang
    Zhang, Chi
    [J]. JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2021, 17 (03) : 1471 - 1483
  • [33] Data-based robust adaptive control for a class of unknown nonlinear constrained-input systems via integral reinforcement learning
    Yang, Xiong
    Liu, Derong
    Luo, Biao
    Li, Chao
    [J]. INFORMATION SCIENCES, 2016, 369 : 731 - 747
  • [34] Dynamic event-triggered control for discrete-time nonlinear Markov jump systems using policy iteration-based adaptive dynamic programming
    Tang, Fanghua
    Wang, Huanqing
    Chang, Xiao-Heng
    Zhang, Liang
    Alharbi, Khalid H.
    [J]. NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2023, 49
  • [35] Error-based adaptive optimal tracking control of nonlinear discrete-time systems
    Li, Chun
    Ding, Jinliang
    Lewis, Frank L.
    Chai, Tianyou
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (01)
  • [36] Data-based optimal control design with reinforcement learning for nonlinear PDE systems
    Zheng, Yuqing
    Zhang, Guoshan
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 345 - 350
  • [37] Learning Optimal Control Policy for Unknown Discrete-Time Systems
    Lai, Jing
    Xiong, Junlin
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (11) : 4191 - 4195
  • [38] Optimal Control of Unknown Discrete-Time Nonlinear Systems with Constrained Inputs Using GDHP Technique
    Liu Derong
    Wang Ding
    Li Hongliang
    [J]. PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 2926 - 2931
  • [39] Stable Iterative Optimal Control for Discrete-Time Nonlinear Systems Using Numerical Controller
    Wei, Qinglai
    Liu, Derong
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON VEHICULAR ELECTRONICS AND SAFETY (ICVES), 2013, : 185 - 188
  • [40] Reinforcement Learning Design-Based Adaptive Tracking Control With Less Learning Parameters for Nonlinear Discrete-Time MIMO Systems
    Liu, Yan-Jun
    Tang, Li
    Tong, Shaocheng
    Chen, C. L. Philip
    Li, Dong-Juan
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (01) : 165 - 176