A biologically-inspired reinforcement learning based intelligent distributed flocking control for Multi-Agent Systems in presence of uncertain system and dynamic environment

被引:23
作者
Jafari, Mohammad [1 ]
Xu, Hao [2 ]
Carrillo, Luis Rodolfo Garcia [3 ]
机构
[1] Univ Calif Santa Cruz, Jack Baskin Sch Engn, Dept Appl Math, 1156 High St, Santa Cruz, CA 95064 USA
[2] Univ Nevada, Dept Elect & Biomed Engn, Reno, NV 89557 USA
[3] Texas A&M Univ, Sch Engn & Comp Sci, Dept Elect Engn, 6300 Ocean Dr,Unit 5797, Corpus Christi, TX 78412 USA
关键词
Biologically-inspired reinforcement learning based intelligent control; BELBIC; Flocking control; Multi-Agent Systems; COMPUTATIONAL MODEL; IMPLEMENTATION; COMMUNICATION; COORDINATION;
D O I
10.1016/j.ifacsc.2020.100096
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we investigate the real-time flocking control of Multi-Agent Systems (MAS) in the presence of system uncertainties and dynamic environment. To handle the impacts from system uncertainties and dynamic environment, a novel reinforcement learning technique, which is appropriate for real-time implementation, has been integrated with multi-agent flocking control in this paper. The Brain Emotional Learning Based Intelligent Controller (BELBIC) is a biologically-inspired reinforcement learning-based controller relying on a computational model of emotional learning in the mammalian limbic system. The learning capabilities, multi-objective properties, and low computational complexity of BELBIC make it a very promising learning technique for implementation in real-time applications. Firstly, a novel brain emotional learning-based flocking control structure is proposed. Then, the realtime update laws are developed to tune the emotional signals based on real-time operational data. It is important to note that this data-driven reinforcement learning approach relaxes the requirement for system dynamics and effectively handle the uncertain impacts of the environment. Using the tuned emotional signals, the optimal flocking control can be obtained. The Lyapunov analysis has been used to prove the convergence of the proposed design. The effectiveness of the proposed design is also demonstrated through numerical and experimental results based on the coordination of multiple Unmanned Aerial Vehicles (UAVs). (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:14
相关论文
共 44 条
[41]   Formation Control and Obstacle Avoidance of Multiple Rectangular Agents With Limited Communication Ranges [J].
Thang Nguyen ;
La, Hung Manh ;
Le, Tuan Dzung ;
Jafari, Mohammad .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2017, 4 (04) :680-691
[42]   Activated γδ T Cells Promote Dendritic Cell Maturation and Exacerbate the Development of Experimental Autoimmune Uveitis (EAU) in Mice [J].
Wang, Beibei ;
Tian, Qingmei ;
Guo, Dadong ;
Lin, Wei ;
Xie, Xiaofeng ;
Bi, Hongsheng .
IMMUNOLOGICAL INVESTIGATIONS, 2021, 50 (2-3) :164-183
[43]   Robust Global Coordination of Networked Systems With Input Saturation and External Disturbances [J].
Wang, Xiaoling ;
Jiang, Guo-Ping ;
Su, Housheng ;
Wang, Xiaofan .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (12) :7788-7800
[44]  
Zhiyuan Li, 2011, 2011 American Control Conference - ACC 2011, P2204