STR: Spatial-Temporal RetNet for Distributed Multi-Robot Navigation

被引:2
作者
Chen, Lin [1 ,2 ]
Wang, Yaonan [1 ,2 ]
Miao, Zhiqiang [1 ,2 ]
Feng, Mingtao [3 ]
Wang, Yuanzhe [4 ]
Mo, Yang [1 ,2 ]
He, Wei [5 ,6 ,7 ]
Wang, Hesheng [8 ]
Wang, Danwei [4 ]
机构
[1] Hunan Univ, Sch Elect & Informat Engn, Changsha 410082, Peoples R China
[2] Natl Engn Res Ctr Robot Visual Percept & Control T, Changsha 410082, Peoples R China
[3] Xidian Univ, Sch Comp Sci & Technol, Xian 710126, Peoples R China
[4] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[5] Univ Sci & Technol Beijing, Sch Intelligence Sci & Technol, Beijing 100083, Peoples R China
[6] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100016, Peoples R China
[7] Beijing Informat Sci & Technol Univ, Inst Artificial Intelligence, Beijing 100016, Peoples R China
[8] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200030, Peoples R China
基金
中国国家自然科学基金; 新加坡国家研究基金会; 中国博士后科学基金;
关键词
Multi-agent systems; collision avoidance; reinforcement learning; SPACE;
D O I
10.1109/TASE.2024.3524000
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The core of multi-robot collision avoidance is to guide robots to avoid collisions with other robots and obstacles in a dynamic multi-robot environment, which has recently gained increasing interest among the main challenges of robotics. However, the current multi-robot navigation policy neural network exhibits weak position encoding capabilities for spatial environmental features in mapping environment states and robot actions, as well as an inability to recurrently infer information on dynamic environmental features in the temporal dimension, leading to insufficient safety and effectiveness in guiding robot motion. In this paper, we propose a novel spatial-temporal RetNet (STR) that encodes reciprocal collision avoidance states between robots in both spatial and temporal dimensions, aiming to enhance the safety and effectiveness of the policy neural network in guiding robots to accomplish specified tasks. The spatial state encoder module is developed based on parallel RetNet structure, which enhances the ability of the neural network in multi-robot navigation policies to extract reciprocal collision avoidance states between robots in spatial dimensions and overcomes the weak position encoding capability of advanced transformer-based multi-robot navigation policy neural networks. A temporal state encoder is designed by introducing the recurrent RetNet structure. This enhances the multi-robot navigation policy neural network's ability to encode features in the temporal dimension of multi-robot movements and overcomes the transformer-based multi-robot navigation policy neural network's inability to recurrently infer information in the time dimension. Simulation experiments were designed to demonstrate that the safety and effectiveness of our proposed method outperform the previous state-of-the-art approaches in guiding the robot to complete the task. Physical experiments illustrate that our policy can be effectively applied to real-world systems. Note to Practitioners-Multi-robot navigation has a wide range of real-world applications, such as multi-robot formation flying for search and rescue, autonomous warehouse operations, and robots navigating through human crowds. This paper introduces a novel Spatial-Temporal RetNet (STR) framework aimed at enhancing safety and effectiveness in multi-robot collision avoidance. STR addresses the limitations of existing methods by improving the neural network's ability to extract reciprocal collision avoidance states in both spatial and temporal dimensions. The spatial state encoder strengthens the extraction of spatial features, while the temporal state encoder improves the handling of time-dependent information. Simulation and physical experiments demonstrate that STR enhances robot navigation in dynamic environments, making it suitable for real-world applications such as multi-robot coordination.
引用
收藏
页码:10429 / 10441
页数:13
相关论文
共 32 条
[1]  
Alonso-Mora J, 2013, SPRINGER TRAC ADV RO, V83, P203
[2]  
[Anonymous], 2017, P IEEE INT C ROB AUT, P285
[3]  
Chen CG, 2019, IEEE INT CONF ROBOT, P6015, DOI [10.1109/ICRA.2019.8794134, 10.1109/icra.2019.8794134]
[4]   Toward Safe Distributed Multi-Robot Navigation Coupled With Variational Bayesian Model [J].
Chen, Lin ;
Wang, Yaonan ;
Miao, Zhiqiang ;
Feng, Mingtao ;
Zhou, Zhen ;
Wang, Hesheng ;
Wang, Danwei .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) :7583-7598
[5]   Transformer-Based Imitative Reinforcement Learning for Multirobot Path Planning [J].
Chen, Lin ;
Wang, Yaonan ;
Miao, Zhiqiang ;
Mo, Yang ;
Feng, Mingtao ;
Zhou, Zhen ;
Wang, Hesheng .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (10) :10233-10243
[6]   Multiagent Path Finding Using Deep Reinforcement Learning Coupled With Hot Supervision Contrastive Loss [J].
Chen, Lin ;
Wang, Yaonan ;
Mo, Yang ;
Miao, Zhiqiang ;
Wang, Hesheng ;
Feng, Mingtao ;
Wang, Sifei .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (07) :7032-7040
[7]   PLANNING COLLISION-FREE PATHS FOR ROBOTIC ARM AMONG OBSTACLES [J].
CHIEN, RT ;
ZHANG, L ;
ZHANG, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1984, 6 (01) :91-96
[8]  
Everett M, 2018, IEEE INT C INT ROBOT, P3052, DOI 10.1109/IROS.2018.8593871
[9]   Distributed multi-robot collision avoidance via deep reinforcement learning for navigation in complex scenarios [J].
Fan, Tingxiang ;
Long, Pinxin ;
Liu, Wenxi ;
Pan, Jia .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (07) :856-892
[10]  
Gopalakrishnan B, 2017, IEEE INT C INT ROBOT, P1089, DOI 10.1109/IROS.2017.8202279