Optimal tracking control for completely unknown nonlinear discrete-time Markov jump systems using data-based reinforcement learning method

被引:32
作者
Jiang, He [1 ]
Zhang, Huaguang [1 ]
Luo, Yanhong [1 ]
Wang, Junyi [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Box 134, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimal tracking control; Markov jump systems; Data-based; Reinforcement learning; Adaptive dynamic programming; Neural networks; SYNCHRONIZATION CONTROL; GRAPHICAL GAMES; CONTROL SCHEME; STABILITY; ALGORITHM;
D O I
10.1016/j.neucom.2016.02.029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we develop a novel optimal tracking control scheme for a class of nonlinear discrete-time Markov jump systems (MJSs) by utilizing a data-based reinforcement learning method. It is not practical to obtain accurate system models of the real-world MJSs due to the existence of abrupt variations in their system structures. Consequently, most traditional model-based methods for MJSs are invalid for the practical engineering applications. In order to overcome the difficulties without any identification scheme which would cause estimation errors, a model-free adaptive dynamic programming (ADP) algorithm will be designed by using system data rather than accurate system functions. Firstly, we combine the tracking error dynamics and reference system dynamics to form an augmented system. Then, based on the augmented system, a new performance index function with discount factor is formulated for the optimal tracking control problem via Markov chain and weighted sum technique. Neural networks are employed to implement the on-line ADP learning algorithm. Finally, a simulation example is given to demonstrate the effectiveness of our proposed approach. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:176 / 182
页数:7
相关论文
共 50 条
  • [1] Data-Based Optimal Tracking Control of Nonaffine Nonlinear Discrete-Time Systems
    Luo, Biao
    Liu, Derong
    Huang, Tingwen
    Li, Chao
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT IV, 2016, 9950 : 573 - 581
  • [2] Optimal Control for Unknown Discrete-Time Nonlinear Markov Jump Systems Using Adaptive Dynamic Programming
    Zhong, Xiangnan
    He, Haibo
    Zhang, Huaguang
    Wang, Zhanshan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (12) : 2141 - 2155
  • [3] Reinforcement learning-based optimal control for Markov jump systems with completely unknown dynamics
    Shi, Xiongtao
    Li, Yanjie
    Du, Chenglong
    Chen, Chaoyang
    Zong, Guangdeng
    Gui, Weihua
    AUTOMATICA, 2025, 171
  • [4] Reinforcement Q-learning algorithm for H∞ tracking control of discrete-time Markov jump systems
    Shi, Jiahui
    He, Dakuo
    Zhang, Qiang
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2025, 56 (03) : 502 - 523
  • [5] Online optimal and adaptive integral tracking control for varying discrete-time systems using reinforcement learning
    Sanusi, Ibrahim
    Mills, Andrew
    Dodd, Tony
    Konstantopoulos, George
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2020, 34 (08) : 971 - 991
  • [6] Adaptive Constrained Optimal Control Design for Data-Based Nonlinear Discrete-Time Systems With Critic-Only Structure
    Luo, Biao
    Liu, Derong
    Wu, Huai-Ning
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2099 - 2111
  • [7] H∞ control with constrained input for completely unknown nonlinear systems using data-driven reinforcement learning method
    Jiang, He
    Zhang, Huaguang
    Luo, Yanhong
    Cui, Xiaohong
    NEUROCOMPUTING, 2017, 237 : 226 - 234
  • [8] Robust control scheme for a class of uncertain nonlinear systems with completely unknown dynamics using data-driven reinforcement learning method
    Jiang, He
    Zhang, Huaguang
    Cui, Yang
    Xiao, Geyang
    NEUROCOMPUTING, 2018, 273 : 68 - 77
  • [9] Optimal Tracking Control for Linear Discrete-time Systems Using Reinforcement Learning
    Kiumarsi-Khomartash, Bahare
    Lewis, Frank L.
    Naghibi-Sistani, Mohammad-Bagher
    Karimpour, Ali
    2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 3845 - 3850
  • [10] Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning
    Yang, Xiong
    Liu, Derong
    Wang, Ding
    Wei, Qinglai
    NEURAL NETWORKS, 2014, 55 : 30 - 41