End-to-end decentralized formation control using a graph neural network-based learning method

被引:4
作者
Jiang, Chao [1 ]
Huang, Xinchi [2 ]
Guo, Yi [2 ]
机构
[1] Univ Wyoming, Dept Elect Engn & Comp Sci, Laramie, WY 82071 USA
[2] Stevens Inst Technol, Dept Elect & Comp Engn, Hoboken, NJ USA
基金
美国国家科学基金会;
关键词
distributed multi-robot control; multi-robot learning; graph neural network; formation control and coordination; autonomous robots; NAVIGATION;
D O I
10.3389/frobt.2023.1285412
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Multi-robot cooperative control has been extensively studied using model-based distributed control methods. However, such control methods rely on sensing and perception modules in a sequential pipeline design, and the separation of perception and controls may cause processing latencies and compounding errors that affect control performance. End-to-end learning overcomes this limitation by implementing direct learning from onboard sensing data, with control commands output to the robots. Challenges exist in end-to-end learning for multi-robot cooperative control, and previous results are not scalable. We propose in this article a novel decentralized cooperative control method for multi-robot formations using deep neural networks, in which inter-robot communication is modeled by a graph neural network (GNN). Our method takes LiDAR sensor data as input, and the control policy is learned from demonstrations that are provided by an expert controller for decentralized formation control. Although it is trained with a fixed number of robots, the learned control policy is scalable. Evaluation in a robot simulator demonstrates the triangular formation behavior of multi-robot teams of different sizes under the learned control policy.
引用
收藏
页数:11
相关论文
共 38 条
[1]  
Agarwal A., 2020, P 19 INT C AUT AG MU, P1741
[2]   Robust Formation Control for Multiple Underwater Vehicles [J].
Bechlioulis, Charalampos P. ;
Giagkas, Fotis ;
Karras, George C. ;
Kyriakopoulos, Kostas J. .
FRONTIERS IN ROBOTICS AND AI, 2019, 6
[3]   A Framework for Real-World Multi-Robot Systems Running Decentralized GNN-Based Policies [J].
Blumenkamp, Jan ;
Morad, Steven ;
Gielis, Jennifer ;
Li, Qingbiao ;
Prorok, Amanda .
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, :8772-8778
[4]  
Chen Z, 2019, CHIN CONT DECIS CONF, P5501, DOI [10.1109/ccdc.2019.8833001, 10.1109/CCDC.2019.8833001]
[5]  
Cortes Jorge, 2017, SICE Journal of Control, Measurement, and System Integration, V10, P495
[6]   Towards Generalization in Target-Driven Visual Navigation by Using Deep Reinforcement Learning [J].
Devo, Alessandro ;
Mezzetti, Giacomo ;
Costante, Gabriele ;
Fravolini, Mario L. ;
Valigi, Paolo .
IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (05) :1546-1561
[7]   How Fast Is Too Fast? The Role of Perception Latency in High-Speed Sense and Avoid [J].
Falanga, Davide ;
Kim, Suseong ;
Scaramuzza, Davide .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02) :1884-1891
[8]  
Foerster J, 2017, PR MACH LEARN RES, V70
[9]  
Foerster JN, 2016, ADV NEUR IN, V29
[10]  
Foerster JN, 2018, AAAI CONF ARTIF INTE, P2974