Neural-Network-Based Output-Feedback Control Under Round-Robin Scheduling Protocols

被引:198
作者
Ding, Derui [1 ]
Wang, Zidong [2 ,3 ]
Han, Qing-Long [1 ]
Wei, Guoliang [4 ]
机构
[1] Swinburne Univ Technol, Sch Software & Elect Engn, Melbourne, Vic 3122, Australia
[2] Shandong Univ Sci & Technol, Coll Elect Engn & Automat, Qingdao 266590, Shandong, Peoples R China
[3] Brunel Univ London, Dept Comp Sci, Uxbridge UB8 3PH, Middx, England
[4] Univ Shanghai Sci & Technol, Dept Control Sci & Engn, Shanghai 200093, Peoples R China
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
Actor-critic structures; neural networks (NNs); output-feedback control; periodic systems; round-Robin (RR) scheduling protocol; TIME-VARYING SYSTEMS; NONLINEAR-SYSTEMS; MULTIAGENT SYSTEMS; CONSENSUS CONTROL; DESIGN; STABILIZATION; ARCHITECTURE; STABILITY;
D O I
10.1109/TCYB.2018.2827037
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The neural-network (NN)-based output-feedback control is considered for a class of stochastic nonlinear systems under round-Robin (RR) scheduling protocols. For the purpose of effectively mitigating data congestions and saving energies, the RR protocols are implemented and the resulting nonlinear systems become the so-called protocol-induced periodic ones. Taking such a periodic characteristic into account, an NN-based observer is first proposed to reconstruct the system states where a novel adaptive tuning law on NN weights is adopted to cater to the requirement of performance analysis. In addition, with the established boundedness of the periodic systems in the mean-square sense, the desired observer gain is obtained by solving a set of matrix inequalities. Then, an actor-critic NN scheme with a time-varying step length in adaptive law is developed to handle the considered control problem with terminal constraints over finite-horizon. Some sufficient conditions are derived to guarantee the boundedness of estimation errors of critic and actor NN weights. In view of these conditions, some key parameters in adaptive tuning laws are easily determined via elementary algebraic operations. Furthermore, the stability in the mean-square sense is investigated for the discussed issue in infinite horizon. Finally, a simulation example is utilized to illustrate the applicability of the proposed control scheme.
引用
收藏
页码:2372 / 2384
页数:13
相关论文
共 40 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]   Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].
Al-Tamimi, Asma ;
Lewis, Frank L. ;
Abu-Khalaf, Murad .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :943-949
[3]  
Athanasopoulos N., 2013, P IFAC VOL, V46, P17
[4]  
Bertsekas D. P., 1996, Neuro-dynamic programming
[5]   A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].
Bhasin, S. ;
Kamalapurkar, R. ;
Johnson, M. ;
Vamvoudakis, K. G. ;
Lewis, F. L. ;
Dixon, W. E. .
AUTOMATICA, 2013, 49 (01) :82-92
[6]   Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design [J].
Bian, Tao ;
Jiang, Zhong-Ping .
AUTOMATICA, 2016, 71 :348-360
[7]   Stability of periodically time-varying systems: Periodic Lyapunov functions [J].
Boehm, Christoph ;
Lazar, Mircea ;
Allgoewer, Frank .
AUTOMATICA, 2012, 48 (10) :2663-2669
[8]   Stabilization and Entropy Reduction via SDP-Based Design of Fixed-Order Output Feedback Controllers and Tuning Parameters [J].
Chesi, Graziano .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (03) :1094-1108
[9]   Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update [J].
Dierks, Travis ;
Jagannathan, Sarangapani .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (07) :1118-1129
[10]   Observer-Based Event-Triggering Consensus Control for Multiagent Systems With Lossy Sensors and Cyber-Attacks [J].
Ding, Derui ;
Wang, Zidong ;
Ho, Daniel W. C. ;
Wei, Guoliang .
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (08) :1936-1947