A biologically-inspired reinforcement learning based intelligent distributed flocking control for Multi-Agent Systems in presence of uncertain system and dynamic environment

被引：23

作者：

Jafari, Mohammad ^{[1
]}

Xu, Hao ^{[2
]}

Carrillo, Luis Rodolfo Garcia ^{[3
]}

机构：

[1] Univ Calif Santa Cruz, Jack Baskin Sch Engn, Dept Appl Math, 1156 High St, Santa Cruz, CA 95064 USA

[2] Univ Nevada, Dept Elect & Biomed Engn, Reno, NV 89557 USA

[3] Texas A&M Univ, Sch Engn & Comp Sci, Dept Elect Engn, 6300 Ocean Dr,Unit 5797, Corpus Christi, TX 78412 USA

来源：

IFAC JOURNAL OF SYSTEMS AND CONTROL | 2020年 / 13卷

关键词：

Biologically-inspired reinforcement learning based intelligent control; BELBIC; Flocking control; Multi-Agent Systems; COMPUTATIONAL MODEL; IMPLEMENTATION; COMMUNICATION; COORDINATION;

D O I：

10.1016/j.ifacsc.2020.100096

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we investigate the real-time flocking control of Multi-Agent Systems (MAS) in the presence of system uncertainties and dynamic environment. To handle the impacts from system uncertainties and dynamic environment, a novel reinforcement learning technique, which is appropriate for real-time implementation, has been integrated with multi-agent flocking control in this paper. The Brain Emotional Learning Based Intelligent Controller (BELBIC) is a biologically-inspired reinforcement learning-based controller relying on a computational model of emotional learning in the mammalian limbic system. The learning capabilities, multi-objective properties, and low computational complexity of BELBIC make it a very promising learning technique for implementation in real-time applications. Firstly, a novel brain emotional learning-based flocking control structure is proposed. Then, the realtime update laws are developed to tune the emotional signals based on real-time operational data. It is important to note that this data-driven reinforcement learning approach relaxes the requirement for system dynamics and effectively handle the uncertain impacts of the environment. Using the tuned emotional signals, the optimal flocking control can be obtained. The Lyapunov analysis has been used to prove the convergence of the proposed design. The effectiveness of the proposed design is also demonstrated through numerical and experimental results based on the coordination of multiple Unmanned Aerial Vehicles (UAVs). (C) 2020 Elsevier Ltd. All rights reserved.

引用

页数：14

共 44 条

[1]

[Anonymous], 2017, J INSTRUM

[2] Emotional learning:: A computational model of the amygdala [J].

Balkenius, C ;

Morén, J .

CYBERNETICS AND SYSTEMS, 2001, 32 (06) :611-636

[3] How Much Control is Enough for Network Connectivity Preservation and Collision Avoidance? [J].

Chen, Zhiyong ;

Fan, Ming-Can ;

Zhang, Hai-Tao .

IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (08) :1647-1656

[4] Decentralized connectivity maintenance in mobile networks with bounded inputs [J].

Dimarogonas, Dimos V. ;

Johansson, Karl H. .

2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, :1507-1512

[5] Robust consensus tracking for an integrator-type multi-agent system with disturbances and unmodelled dynamics [J].

Hu, Guoqiang .

INTERNATIONAL JOURNAL OF CONTROL, 2011, 84 (01) :1-8

[6] Multirobot Cooperative Learning for Predator Avoidance [J].

Hung Manh La ;

Lim, Ronny ;

Sheng, Weihua .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2015, 23 (01) :52-63

[7] A Q-Learning Approach to Flocking With UAVs in a Stochastic Environment [J].

Hung, Shao-Ming ;

Givigi, Sidney N. .

IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (01) :186-197

[8] Optimization-Based Distributed Flocking Control for Multiple Rigid Bodies [J].

Ibuki, Tatsuya ;

Wilson, Sean ;

Yamauchi, Junya ;

Fujita, Masayuki ;

Egerstedt, Magnus .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :1891-1898

[9]

Jafari M., 2015, On the cooperative control and obstacle avoidance of multivehicle systems

[10] Biologically inspired adaptive intelligent secondary control for MGs under cyber imperfections [J].

Jafari, Mohammad ;

Ghasemkhani, Amir ;

Sarfi, Vahid ;

Livani, Hanif ;

Yang, Lei ;

Xu, Hao .

IET CYBER-PHYSICAL SYSTEMS: THEORY & APPLICATIONS, 2019, 4 (04) :341-352

← 1 2 3 4 5 →