A biologically-inspired reinforcement learning based intelligent distributed flocking control for Multi-Agent Systems in presence of uncertain system and dynamic environment

被引:23
作者
Jafari, Mohammad [1 ]
Xu, Hao [2 ]
Carrillo, Luis Rodolfo Garcia [3 ]
机构
[1] Univ Calif Santa Cruz, Jack Baskin Sch Engn, Dept Appl Math, 1156 High St, Santa Cruz, CA 95064 USA
[2] Univ Nevada, Dept Elect & Biomed Engn, Reno, NV 89557 USA
[3] Texas A&M Univ, Sch Engn & Comp Sci, Dept Elect Engn, 6300 Ocean Dr,Unit 5797, Corpus Christi, TX 78412 USA
关键词
Biologically-inspired reinforcement learning based intelligent control; BELBIC; Flocking control; Multi-Agent Systems; COMPUTATIONAL MODEL; IMPLEMENTATION; COMMUNICATION; COORDINATION;
D O I
10.1016/j.ifacsc.2020.100096
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we investigate the real-time flocking control of Multi-Agent Systems (MAS) in the presence of system uncertainties and dynamic environment. To handle the impacts from system uncertainties and dynamic environment, a novel reinforcement learning technique, which is appropriate for real-time implementation, has been integrated with multi-agent flocking control in this paper. The Brain Emotional Learning Based Intelligent Controller (BELBIC) is a biologically-inspired reinforcement learning-based controller relying on a computational model of emotional learning in the mammalian limbic system. The learning capabilities, multi-objective properties, and low computational complexity of BELBIC make it a very promising learning technique for implementation in real-time applications. Firstly, a novel brain emotional learning-based flocking control structure is proposed. Then, the realtime update laws are developed to tune the emotional signals based on real-time operational data. It is important to note that this data-driven reinforcement learning approach relaxes the requirement for system dynamics and effectively handle the uncertain impacts of the environment. Using the tuned emotional signals, the optimal flocking control can be obtained. The Lyapunov analysis has been used to prove the convergence of the proposed design. The effectiveness of the proposed design is also demonstrated through numerical and experimental results based on the coordination of multiple Unmanned Aerial Vehicles (UAVs). (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:14
相关论文
共 44 条
  • [1] [Anonymous], 2017, J INSTRUM
  • [2] Emotional learning:: A computational model of the amygdala
    Balkenius, C
    Morén, J
    [J]. CYBERNETICS AND SYSTEMS, 2001, 32 (06) : 611 - 636
  • [3] How Much Control is Enough for Network Connectivity Preservation and Collision Avoidance?
    Chen, Zhiyong
    Fan, Ming-Can
    Zhang, Hai-Tao
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (08) : 1647 - 1656
  • [4] Decentralized connectivity maintenance in mobile networks with bounded inputs
    Dimarogonas, Dimos V.
    Johansson, Karl H.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 1507 - 1512
  • [5] Robust consensus tracking for an integrator-type multi-agent system with disturbances and unmodelled dynamics
    Hu, Guoqiang
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2011, 84 (01) : 1 - 8
  • [6] Multirobot Cooperative Learning for Predator Avoidance
    Hung Manh La
    Lim, Ronny
    Sheng, Weihua
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2015, 23 (01) : 52 - 63
  • [7] A Q-Learning Approach to Flocking With UAVs in a Stochastic Environment
    Hung, Shao-Ming
    Givigi, Sidney N.
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (01) : 186 - 197
  • [8] Optimization-Based Distributed Flocking Control for Multiple Rigid Bodies
    Ibuki, Tatsuya
    Wilson, Sean
    Yamauchi, Junya
    Fujita, Masayuki
    Egerstedt, Magnus
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) : 1891 - 1898
  • [9] Jafari M., 2015, On the cooperative control and obstacle avoidance of multivehicle systems
  • [10] Biologically inspired adaptive intelligent secondary control for MGs under cyber imperfections
    Jafari, Mohammad
    Ghasemkhani, Amir
    Sarfi, Vahid
    Livani, Hanif
    Yang, Lei
    Xu, Hao
    [J]. IET CYBER-PHYSICAL SYSTEMS: THEORY & APPLICATIONS, 2019, 4 (04) : 341 - 352