The PageRank algorithm as a method to optimize swarm behavior through local analysis

被引:4
作者
Coppola, M. [1 ,2 ]
Guo, J. [2 ]
Gill, E. [2 ]
de Croon, G. C. H. E. [1 ]
机构
[1] Delft Univ Technol, Dept Control & Simulat, Micro Air Vehicle Lab, Fac Aerosp Engn, Kluyverweg 1, NL-2629 HS Delft, Netherlands
[2] Delft Univ Technol, Dept Space Syst Engn, Fac Aerosp Engn, Kluyverweg 1, NL-2629 HS Delft, Netherlands
关键词
Swarm robotics; Microscopic; Micro-macro link; Evolutionary algorithm; Pattern formation; Consensus; Aggregation; Local; PageRank; Centrality; MICRO-MACRO LINK; SYSTEMS;
D O I
10.1007/s11721-019-00172-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work proposes PageRank as a tool to evaluate and optimize the global performance of a swarm based on the analysis of the local behavior of a single robot. PageRank is a graph centrality measure that assesses the importance of nodes based on how likely they are to be reached when traversing a graph. We relate this, using a microscopic model, to a random robot in a swarm that transitions through local states by executing local actions. The PageRank centrality then becomes a measure of how likely it is, given a local policy, for a robot in the swarm to visit each local state. This is used to optimize a stochastic policy such that the robot is most likely to reach the local states that are "desirable," based on the swarm's global goal. The optimization is performed by an evolutionary algorithm, whereby the fitness function maximizes the PageRank score of these local states. The calculation of the PageRank score only scales with the size of the local state space and demands much less computation than swarm simulations would. The approach is applied to a consensus task, a pattern formation task, and an aggregation task. For each task, when all robots in the swarm execute the evolved policy, the swarm significantly outperforms a swarm that uses the baseline policy. When compared to globally optimized policies, the final performance achieved by the swarm is also shown to be comparable. As this new approach is based on a local model, it natively produces controllers that are flexible and robust to global parameters such as the number of robots in the swarm, the environment, and the initial conditions. Furthermore, as the wall-clock time to evaluate the fitness function does not scale with the size of the swarm, it is possible to optimize for larger swarms at no additional computational expense.
引用
收藏
页码:277 / 319
页数:43
相关论文
共 47 条
[1]  
[Anonymous], 1998, P 7 WORLD WID WEB C
[2]  
[Anonymous], 2018, INTRO SWARM ROBOTICS, DOI DOI 10.1007/978-3-319-74528-21
[3]  
[Anonymous], 2008, P 7 INT JOINT C AAMA
[4]  
[Anonymous], 2018, Evolving behaviour trees for swarm robotics
[5]  
Berman S, 2007, LECT NOTES COMPUT SC, V4433, P56
[6]  
Berman S, 2011, IEEE INT C INT ROBOT, P3923, DOI 10.1109/IROS.2011.6048378
[7]   Optimized Stochastic Policies for Task Allocation in Swarms of Robots [J].
Berman, Spring ;
Halasz, Adam ;
Hsieh, M. Ani ;
Kumar, Vijay .
IEEE TRANSACTIONS ON ROBOTICS, 2009, 25 (04) :927-937
[8]  
Busoniu L, 2010, AUTOM CONTROL ENG SE, P1, DOI 10.1201/9781439821091-f
[9]  
Campo A, 2007, LECT NOTES ARTIF INT, V4648, P696
[10]   Provable self-organizing pattern formation by a swarm of robots with limited knowledge [J].
Coppola, Mario ;
Guo, Jian ;
Gill, Eberhard ;
de Croon, Guido C. H. E. .
SWARM INTELLIGENCE, 2019, 13 (01) :59-94