A new self-learning optimal control laws for a class of discrete-time nonlinear systems based on ESN architecture

被引:0
|
作者
SONG RuiZhuo [1 ]
XIAO WenDong [1 ]
SUN ChangYin [1 ]
机构
[1] School of Automation and Electrical Engineering,University of Science and Technology Beijing
基金
中国博士后科学基金; 中国国家自然科学基金; 北京市自然科学基金;
关键词
adaptive dynamic programming; discrete-time; optimal control; ESN; costate function;
D O I
暂无
中图分类号
TP13 [自动控制理论];
学科分类号
0711 ; 071102 ; 0811 ; 081101 ; 081103 ;
摘要
A novel self-learning optimal control method for a class of discrete-time nonlinear systems is proposed based on iteration adaptive dynamic programming(ADP)algorithm.It is proven that the iteration costate functions converge to the optimal one,and a detailed convergence analysis of the iteration ADP algorithm is given.Furthermore,echo state network(ESN)architecture is used as the approximator of the costate function for each iteration.To ensure the reliability of the ESN approximator,the ESN mean square training error is constrained in the satisfactory range.Two simulation examples are given to demonstrate that the proposed control method has a fast response speed due to the special structure and the fast training process.
引用
收藏
页码:284 / 293
页数:10
相关论文
共 50 条
  • [1] A new self-learning optimal control laws for a class of discrete-time nonlinear systems based on ESN architecture
    RuiZhuo Song
    WenDong Xiao
    ChangYin Sun
    Science China Information Sciences, 2014, 57 : 1 - 10
  • [2] A new self-learning optimal control laws for a class of discrete-time nonlinear systems based on ESN architecture
    Song RuiZhuo
    Xiao WenDong
    Sun ChangYin
    SCIENCE CHINA-INFORMATION SCIENCES, 2014, 57 (06) : 1 - 10
  • [3] Infinite Horizon Self-Learning Optimal Control of Nonaffine Discrete-Time Nonlinear Systems
    Wei, Qinglai
    Liu, Derong
    Yang, Xiong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (04) : 866 - 879
  • [4] Optimal Self-Learning Control Scheme for Discrete-Time Nonlinear Systems Using Local Value Iteration
    Wei, Qinglai
    Liu, Derong
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3544 - 3549
  • [5] Discrete-Time Self-Learning Parallel Control
    Wei, Qinglai
    Wang, Lingxiao
    Lu, Jingwei
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (01): : 192 - 204
  • [6] Learning-based T-sHDP(λ) for optimal control of a class of nonlinear discrete-time systems
    Yu, Luyang
    Liu, Weibo
    Liu, Yurong
    Alsaadi, Fawaz E.
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (05) : 2624 - 2643
  • [7] Self-Learning Optimal Regulation for Discrete-Time Nonlinear Systems Under Event-Driven Formulation
    Wang, Ding
    Ha, Mingming
    Qiao, Junfei
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (03) : 1272 - 1279
  • [8] Optimal Iterative Learning Control for Nonlinear Discrete-time Systems
    Xu Hong-wei
    KEY ENGINEERING MATERIALS AND COMPUTER SCIENCE, 2011, 320 : 605 - 609
  • [9] Optimal Iterative Learning Control for Nonlinear Discrete-Time Systems
    Xu, Hong-wei
    SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING: THEORY AND PRACTICE, VOL 2, 2012, 115 : 69 - 75
  • [10] Relaxed Optimal Control With Self-Learning Horizon for Discrete-Time Stochastic Dynamics
    Wang, Ding
    Wang, Jiangyu
    Liu, Ao
    Liu, Derong
    Qiao, Junfei
    IEEE TRANSACTIONS ON CYBERNETICS, 2025, 55 (03) : 1183 - 1196