Optimal Containment Control for Unknown Active Heterogeneous MASs via Model-Free Recursive Reinforcement Learning

被引:0
作者
Xia, Lina [1 ,2 ]
Li, Qing [2 ]
Song, Ruizhuo [1 ]
Yang, Gaofu [1 ]
机构
[1] Univ Sci & Technol Beijing, Beijing Engn Res Ctr Ind Spectrum Imaging, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Key Lab Knowledge Automat Ind Proc, Minist Educ, Beijing 100083, Peoples R China
来源
IEEE ACCESS | 2025年 / 13卷
基金
中国国家自然科学基金;
关键词
Observers; Protocols; Heuristic algorithms; Optimal control; Reinforcement learning; Mathematical models; Directed graphs; Convergence; Laplace equations; Convex hulls; Optimal containment control; active leaders; fully distributed observers; model-free recursive reinforcement learning; HOMOGENEOUS MULTIAGENT SYSTEMS; CONTINUOUS-TIME SYSTEMS; TRACKING CONTROL; OUTPUT REGULATION; CONSENSUS; LEADER; SYNCHRONIZATION;
D O I
10.1109/ACCESS.2025.3526871
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The distributed optimal output containment control problem for multi-agent systems (MASs) involves coordinating a group of autonomous agents to drive the outputs of all followers into the convex hull spanned by the outputs of the leaders while optimizing system performance, which has numerous applications. In this paper, a fully distributed optimal containment tracking control protocol is established for unknown active heterogeneous MASs with external disturbances. Firstly, a fully distributed observer is designed to ensure its trajectory stays within the convex hull established by active leaders without requiring global network topology information. Subsequently, an augmented system is constructed using the dynamics of the followers and the observers to design $H_{\infty }$ optimal containment control protocol. Then, a model-free recursive reinforcement learning (RRL) algorithm is devised to learn the optimal control protocol, which demonstrates that the weight iteration error asymptotically converges to zero, and the algorithm exhibits favorable convergence speed. Finally, the effectiveness of the proposed improved algorithm is validated using a heterogeneous nonlinear multi-agent model.
引用
收藏
页码:7603 / 7613
页数:11
相关论文
共 50 条
  • [41] On Model-Free Reinforcement Learning of Reduced-order Optimal Control for Singularly Perturbed Systems
    Mukherjee, Sayak
    Bai, He
    Chakrabortty, Aranya
    2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 5288 - 5293
  • [42] An Improved Reinforcement Learning Based Heuristic Dynamic Programming Algorithm for Model-Free Optimal Control
    Li, Jia
    Yuan, Zhaolin
    Ban, Xiaojuan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 282 - 294
  • [43] On Distributed Model-Free Reinforcement Learning Control with Stability Guarantee
    Mukherjee, Sayak
    Thanh Long Vu
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 2175 - 2180
  • [44] Input-constrained optimal output synchronization of heterogeneous multiagent systems via observer-based model-free reinforcement learning
    Zhang, Tengfei
    Jia, Yingmin
    ASIAN JOURNAL OF CONTROL, 2024, 26 (01) : 98 - 113
  • [45] Model-Free Emergency Frequency Control Based on Reinforcement Learning
    Chen, Chunyu
    Cui, Mingjian
    Li, Fangxing
    Yin, Shengfei
    Wang, Xinan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (04) : 2336 - 2346
  • [46] Model-free Control for Stratospheric Airship Based on Reinforcement Learning
    Nie, Chunyu
    Zhu, Ming
    Zheng, Zewei
    Wu, Zhe
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 10702 - 10707
  • [47] An Hybrid Model-Free Reinforcement Learning Approach for HVAC Control
    Solinas, Francesco M.
    Bellagarda, Andrea
    Macii, Enrico
    Patti, Edoardo
    Bottaccioli, Lorenzo
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2021 5TH IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC/I&CPS EUROPE), 2021,
  • [48] Model-free closed-loop wind farm control using reinforcement learning with recursive least squares
    Liew, Jaime
    Gocmen, Tuhfe
    Lio, Wai Hou
    Larsen, Gunner Chr.
    WIND ENERGY, 2024, 27 (11) : 1173 - 1187
  • [49] Model-Free Reinforcement Learning for Mean Field Games
    Mishra, Rajesh
    Vasal, Deepanshu
    Vishwanath, Sriram
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (04): : 2141 - 2151
  • [50] Model-free H control of Itô stochastic system via off-policy reinforcement learning
    Zhang, Weihai
    Guo, Jing
    Jiang, Xiushan
    AUTOMATICA, 2025, 174