Reinforcement Learning-Based Formation Pinning and Shape Transformation for Swarms

被引：2

作者：

Dong, Zhaoqi ^{[1
]}

Wu, Qizhen ^{[2
]}

Chen, Lei ^{[1
,3
]}

机构：

[1] Beijing Inst Technol, Adv Res Inst Multidisciplinary Sci, Beijing 100081, Peoples R China

[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China

[3] Beijing Inst Technol, Yangtze Delta Reg Acad, Jiaxing 314000, Peoples R China

来源：

DRONES | 2023年 / 7卷 / 11期

基金：

美国国家科学基金会;

关键词：

reinforcement learning; Boids model; virtual leader; obstacle avoidance; drone swarms; SYSTEMS;

D O I：

10.3390/drones7110673

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

Swarm models hold significant importance as they provide the collective behavior of self-organized systems. Boids model is a fundamental framework for studying emergent behavior in swarms systems. It addresses problems related to simulating the emergent behavior of autonomous agents, such as alignment, cohesion, and repulsion, to imitate natural flocking movements. However, traditional models of Boids often lack pinning and the adaptability to quickly adapt to the dynamic environment. To address this limitation, we introduce reinforcement learning into the framework of Boids to solve the problem of disorder and the lack of pinning. The aim of this approach is to enable drone swarms to quickly and effectively adapt to dynamic external environments. We propose a method based on the Q-learning network to improve the cohesion and repulsion parameters in the Boids model to achieve continuous obstacle avoidance and maximize spatial coverage in the simulation scenario. Additionally, we introduce a virtual leader to provide pinning and coordination stability, reflecting the leadership and coordination seen in drone swarms. To validate the effectiveness of this method, we demonstrate the model's capabilities through empirical experiments with drone swarms, and show the practicality of the RL-Boids framework.

引用

页数：16

共 34 条

[1] HAPS-UAV-Enabled Heterogeneous Networks: A Deep Reinforcement Learning Approach [J].

Arani, Atefeh Hajijamali ;

Hu, Peng ;

Zhu, Yeying .

IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2023, 4 :1745-1760

[2] Learning-Based Multi-Robot Formation Control With Obstacle Avoidance [J].

Bai, Chengchao ;

Yan, Peng ;

Pan, Wei ;

Guo, Jifeng .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) :11811-11822

[3] Interaction ruling animal collective behavior depends on topological rather than metric distance: Evidence from a field study [J].

Ballerini, M. ;

Calbibbo, N. ;

Candeleir, R. ;

Cavagna, A. ;

Cisbani, E. ;

Giardina, I. ;

Lecomte, V. ;

Orlandi, A. ;

Parisi, G. ;

Procaccini, A. ;

Viale, M. ;

Zdravkovic, V. .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (04) :1232-1237

[4]

Busoniu L, 2010, STUD COMPUT INTELL, V310, P183

[5] Multi-Agent Reinforcement Learning: A Review of Challenges and Applications [J].

Canese, Lorenzo ;

Cardarilli, Gian Carlo ;

Di Nunzio, Luca ;

Fazzolari, Rocco ;

Giardino, Daniele ;

Re, Marco ;

Spano, Sergio .

APPLIED SCIENCES-BASEL, 2021, 11 (11)

[6] Behavior-based swarm robotic search and rescue using fuzzy controller [J].

Din, Ahmad ;

Jabeen, Meh ;

Zia, Kashif ;

Khalid, Abbas ;

Saini, Dinesh Kumar .

COMPUTERS & ELECTRICAL ENGINEERING, 2018, 70 :53-65

[7] Ant colony optimization -: Artificial ants as a computational intelligence technique [J].

Dorigo, Marco ;

Birattari, Mauro ;

Stuetzle, Thomas .

IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2006, 1 (04) :28-39

[8]

Droge G, 2015, P AMER CONTR CONF, P2323, DOI 10.1109/ACC.2015.7171079

[9]

Greenwald A., 2003, P 20 INT C MACHINE L, P242

[10]

GUO JL, 2019, INT WIREL COMMUN, P1508, DOI DOI 10.1109/IWCMC.2019.8766625

← 1 2 3 4 →