Reinforcement learning-based aggregation for robot swarms

被引：4

作者：

Amjadi, Arash Sadeghi ^{[1
,2
]}

Bilaloglu, Cem ^{[1
]}

Turgut, Ali Emre ^{[1
]}

Na, Seongin ^{[3
]}

Sahin, Erol ^{[1
]}

Krajnik, Tomas ^{[3
]}

Arvin, Farshad ^{[4
,5
]}

机构：

[1] Middle East Tech Univ, Mech Engn Dept, Ankara, Turkiye

[2] Czech Tech Univ, Fac Elect Engn, Dept Comp Sci, Prague, Czech Republic

[3] Univ Manchester, Dept Elect & Elect Engn, Manchester, England

[4] Univ Durham, Dept Comp Sci, Durham, England

[5] Univ Durham, Dept Comp Sci, Durham M13 9PL, England

来源：

ADAPTIVE BEHAVIOR | 2024年 / 32卷 / 03期

关键词：

Swarm robotics; aggregation; reinforcement learning; bio-inspired; NAVIGATION; PERCEPTION; MODEL;

D O I：

10.1177/10597123231202593

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Aggregation, the gathering of individuals into a single group as observed in animals such as birds, bees, and amoeba, is known to provide protection against predators or resistance to adverse environmental conditions for the whole. Cue-based aggregation, where environmental cues determine the location of aggregation, is known to be challenging when the swarm density is low. Here, we propose a novel aggregation method applicable to real robots in low-density swarms. Previously, Landmark-Based Aggregation (LBA) method had used odometric dead-reckoning coupled with visual landmarks and yielded better aggregation in low-density swarms. However, the method's performance was affected adversely by odometry drift, jeopardizing its application in real-world scenarios. In this article, a novel Reinforcement Learning-based Aggregation method, RLA, is proposed to increase aggregation robustness, thus making aggregation possible for real robots in low-density swarm settings. Systematic experiments conducted in a kinematic-based simulator and on real robots have shown that the RLA method yielded larger aggregates, is more robust to odometry noise than the LBA method, and adapts better to environmental changes while not being sensitive to parameter tuning, making it better deployable under real-world conditions.

引用

页码：265 / 281

页数：17

共 59 条

[1]

Amjadi A, 2020, COLLABORATECOM 2020, P469

[2]

Andraud M., 2018, P GEN EV COMP C COMP, P1497

[3]

[Anonymous], 2007, Kobot: A mobile robot designed specifically for swarm robotics research

[4] Investigation of cue-based aggregation in static and dynamic environments with a mobile robot swarm [J].

Arvin, Farshad ;

Turgut, Ali Emre ;

Krajnik, Tomas ;

Yue, Shigang .

ADAPTIVE BEHAVIOR, 2016, 24 (02) :102-118

[5] A robotic honeycomb for interaction with a honeybee colony [J].

Barmak, Rafael ;

Stefanec, Martin ;

Hofstadler, Daniel N. ;

Piotet, Louis ;

Schoenwetter-Fuchs-Schistek, Stefan ;

Mondada, Francesco ;

Schmickl, Thomas ;

Mills, Rob .

SCIENCE ROBOTICS, 2023, 8 (76)

[6] Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age [J].

Cadena, Cesar ;

Carlone, Luca ;

Carrillo, Henry ;

Latif, Yasir ;

Scaramuzza, Davide ;

Neira, Jose ;

Reid, Ian ;

Leonard, John J. .

IEEE TRANSACTIONS ON ROBOTICS, 2016, 32 (06) :1309-1332

[7]

Camazine S., 2003, Self-Organization in Biological Systems

[8]

Collett TS, 1996, J EXP BIOL, V199, P227

[9]

Douglas D.H., 1973, Cartographica, Int. J. Geographic Inf. Geovis., V10, P112, DOI [10.3138/FM57-6770-U75U-7727, DOI 10.3138/FM57-6770-U75U-7727]

[10] Multiple model-based reinforcement learning [J].

Doya, K ;

Samejima, K ;

Katagiri, K ;

Kawato, M .

NEURAL COMPUTATION, 2002, 14 (06) :1347-1369

← 1 2 3 4 5 6 →