Reinforcement Learning for Fuzzy Structured Adaptive Optimal Control of Discrete-Time Nonlinear Complex Networks

被引：2

作者：

Wu, Tao ^{[1
]}

Cao, Jinde ^{[1
,2
,3
]}

Xiong, Lianglin ^{[4
]}

Park, Ju H. ^{[5
]}

Lam, Hak-Keung ^{[6
]}

机构：

[1] Southeast Univ, Sch Math, Nanjing 210096, Peoples R China

[2] Southeast Univ, Frontiers Sci Ctr Mobile Informat Commun & Secur, Nanjing 210096, Peoples R China

[3] Purple Mt Labs, Nanjing 211111, Peoples R China

[4] Yunnan Open Univ, Sch Media & Informat Engn, Kunming 650504, Peoples R China

[5] Yeungnam Univ, Dept Elect Engn, Gyongsan 38541, South Korea

[6] Kings Coll London, Dept Engn, London WC2R 2LS, England

来源：

IEEE TRANSACTIONS ON FUZZY SYSTEMS | 2024年 / 32卷 / 11期

基金：

新加坡国家研究基金会;

关键词：

Adaptive optimal control; discrete-time nonlinear complex networks; fuzzy coupled algebraic Riccati equations (CAREs); reinforcement learning (RL); structural learning iteration; PINNING SYNCHRONIZATION; ROBUST STABILIZATION; TRACKING CONTROL; CONTROL-SYSTEMS;

D O I：

10.1109/TFUZZ.2024.3434690

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This article focuses on fuzzy structural adaptive optimal control issue of discrete-time nonlinear complex networks (CNs) via adopting the reinforcement learning (RL) and Takagi-Sugeno fuzzy modeling approaches, where the control gains are subjected to structured constraints. In accordance with the Bellman optimality theory, the modified fuzzy coupled algebraic Riccati equations (CAREs) are constructed for discrete-time fuzzy CNs, while the modified fuzzy CAREs are difficult to solve directly through mathematical approaches. Then, a model-based offline learning iteration algorithm is developed to solve the modified fuzzy CAREs, where the network dynamics information is needed. Moreover, a novel data-driven off-policy RL algorithm is given to compute the modified fuzzy CAREs, and the structural optimal solutions can be obtained directly by using the collected state and input data in the absence of the network dynamics information. Furthermore, the convergence proofs of the presented learning algorithms are provided. In the end, the validity and practicability of the theoretical results are explicated via two numerical simulations.

引用

页码：6035 / 6043

页数：9

共 50 条

[31] Reinforcement learning-based online adaptive controller design for a class of unknown nonlinear discrete-time systems with time delays [J].

Liang, Yuling ;

Zhang, Huaguang ;

Xiao, Geyang ;

Jiang, He .

NEURAL COMPUTING & APPLICATIONS, 2018, 30 (06) :1733-1745

[32] Inverse Reinforcement Learning for Discrete-Time Systems With Data Dropouts [J].

Fan, Jialu ;

Shi, Pengfei ;

Xue, Wenqian ;

Lian, Bosen ;

Cui, Yunfang ;

Lewis, Frank L. .

IEEE TRANSACTIONS ON CYBERNETICS, 2025, 55 (04) :1744-1757

[33] A Novel Adaptive Control Design for a Class of Nonstrict-Feedback Discrete-Time Systems via Reinforcement Learning [J].

Bai, Weiwei ;

Li, Tieshan ;

Long, Yue ;

Chen, C. L. Philip ;

Xiao, Yang ;

Li, Wenjiang ;

Li, Ronghui .

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (02) :1250-1262

[34] Design of an adaptive fuzzy sliding mode control for uncertain discrete-time nonlinear systems based on noisy measurements [J].

Yoshimura, Toshio .

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2016, 47 (03) :617-630

[35] Learning-based T-sHDP(λ) for optimal control of a class of nonlinear discrete-time systems [J].

Yu, Luyang ;

Liu, Weibo ;

Liu, Yurong ;

Alsaadi, Fawaz E. .

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (05) :2624-2643

[36] Reinforcement Learning-Based Model Predictive Control for Discrete-Time Systems [J].

Lin, Min ;

Sun, Zhongqi ;

Xia, Yuanqing ;

Zhang, Jinhui .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) :3312-3324

[37] Adaptive Optimal Control for Discrete-Time Linear Systems via Hybrid Iteration [J].

Qasem, Omar ;

Gao, Weinan ;

Gutierrez, Hector .

2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, :1141-1146

[38] Reinforcement Learning and Adaptive Optimal Control for Continuous-Time Nonlinear Systems: A Value Iteration Approach [J].

Bian, Tao ;

Jiang, Zhong-Ping .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) :2781-2790

[39] Adaptive control of a class of switched nonlinear discrete-time systems with unknown parameter [J].

Wang, Hao ;

Liu, Yan-Jun ;

Tong, Shaocheng .

NEUROCOMPUTING, 2016, 214 :1-6

[40] A CO-OPERATIVE CONTROL APPROACH TO THE REGULATION OF NONLINEAR DISCRETE-TIME STRUCTURED SYSTEMS [J].

Shams, N. A. ;

Davison, D. E. .

2011 24TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2011, :611-616

← 1 2 3 4 5 →