Neural-Network-Based Control for Discrete-Time Nonlinear Systems with Input Saturation Under Stochastic Communication Protocol

被引:82
作者
Wang, Xueli [1 ]
Ding, Derui [2 ]
Dong, Hongli [3 ]
Zhang, Xian-Ming [2 ]
机构
[1] Univ Shanghai Sci & Technol, Dept Control Sci & Engn, Shanghai 200093, Peoples R China
[2] Swinburne Univ Technol, Sch Software & Elect Engn, Melbourne, Vic 3122, Australia
[3] Northeast Petr Univ, Inst Complex Syst & Adv Control, Daqing 163318, Peoples R China
基金
澳大利亚研究理事会;
关键词
Adaptive dynamic programming (ADP); constrained inputs; neural network (NN); stochastic communication protocols (SCPs); suboptimal control; DYNAMIC-PROGRAMMING ALGORITHM; OPTIMAL ADAPTIVE-CONTROL; STABILITY ANALYSIS;
D O I
10.1109/JAS.2021.1003922
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an adaptive dynamic programming (ADP) strategy is investigated for discrete-time nonlinear systems with unknown nonlinear dynamics subject to input saturation. To save the communication resources between the controller and the actuators, stochastic communication protocols (SCPs) are adopted to schedule the control signal, and therefore the closed-loop system is essentially a protocol-induced switching system. A neural network (NN)-based identifier with a robust term is exploited for approximating the unknown nonlinear system, and a set of switch-based updating rules with an additional tunable parameter of NN weights are developed with the help of the gradient descent. By virtue of a novel Lyapunov function, a sufficient condition is proposed to achieve the stability of both system identification errors and the update dynamics of NN weights. Then, a value iterative ADP algorithm in an offline way is proposed to solve the optimal control of protocol-induced switching systems with saturation constraints, and the convergence is profoundly discussed in light of mathematical induction. Furthermore, an actor-critic NN scheme is developed to approximate the control law and the proposed performance index function in the framework of ADP, and the stability of the closed-loop system is analyzed in view of the Lyapunov theory. Finally, the numerical simulation results are presented to demonstrate the effectiveness of the proposed control scheme.
引用
收藏
页码:766 / 778
页数:13
相关论文
共 44 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]   Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].
Al-Tamimi, Asma ;
Lewis, Frank L. ;
Abu-Khalaf, Murad .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :943-949
[3]   Dynamic programming for a Markov-switching jump-diffusion [J].
Azevedo, N. ;
Pinheiro, D. ;
Weber, G. -W. .
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2014, 267 :1-19
[4]  
Bertsekas D., 1996, Neuro-Dynamic Pro-gramming, V27
[5]   Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence [J].
Dierks, Travis ;
Thumati, Balaje T. ;
Jagannathan, S. .
NEURAL NETWORKS, 2009, 22 (5-6) :851-860
[6]  
Ding D., 2019, IEEE T CYBERN, V50, P1
[7]   Secure State Estimation and Control of Cyber-Physical Systems: A Survey [J].
Ding, Derui ;
Han, Qing-Long ;
Ge, Xiaohua ;
Wang, Jun .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (01) :176-190
[8]   Neural-network-based output-feedback control with stochastic communication protocols [J].
Ding, Derui ;
Wang, Zidong ;
Han, Qing-Long .
AUTOMATICA, 2019, 106 :221-229
[9]   Stability analysis of stochastic networked control systems [J].
Donkers, M. C. F. ;
Heemels, W. P. M. H. ;
Bernardini, D. ;
Bemporad, A. ;
Shneer, V. .
AUTOMATICA, 2012, 48 (05) :917-925
[10]   Stability Analysis of Optimal Adaptive Control Under Value Iteration Using a Stabilizing Initial Policy [J].
Heydari, Ali .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (09) :4522-4527