Reciprocal Velocity Obstacle Spatial-Temporal Network for Distributed Multirobot Navigation

被引：1

作者：

Chen, Lin ^{[1
,2
]}

Wang, Yaonan ^{[1
,2
]}

Miao, Zhiqiang ^{[1
,2
]}

Feng, Mingtao ^{[3
]}

Zhou, Zhen ^{[1
,2
]}

Wang, Hesheng ^{[4
]}

Wang, Danwei ^{[5
]}

机构：

[1] Hunan Univ, Sch Elect & Informat Engn, Changsha 410082, Peoples R China

[2] Natl Engn Res Ctr Robot Visual Percept & Control T, Changsha 410082, Peoples R China

[3] Xidian Univ, Sch Comp Sci & Technol, Xian 710126, Peoples R China

[4] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200030, Peoples R China

[5] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore

来源：

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS | 2024年 / 71卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Collision avoidance; multiagent systems; reinforcement learning; COLLISION-AVOIDANCE; ENVIRONMENT;

D O I：

10.1109/TIE.2024.3379630

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The core of multirobot collision avoidance lies in developing a decentralized policy that can guide robots from their initial positions to target locations based on the environment states perceived by the robots and ensure collision avoidance. However, the current multirobot collision avoidance policy network is challenging to simultaneously extract the global spatial state, temporal state, and reciprocity among robots, which limits its performance. In this work, we have developed a novel reciprocal velocity obstacle (RVO) spatial-temporal network and employed the proximal policy optimization algorithm to train the network parameters during interactions with amultirobot simulation environment. Specifically, a temporal state encoder module, utilized to represent the temporal characteristics of observation sequence data, is designed and achieved through the combination of the graph attention mechanism and the transformer encoding module. Furthermore, we design a reciprocal spatial state encoder module achieved through the use of a transformer encoding module to merge feature data from long short-term memory (LSTM), GRU, and bidirectional gated recurrent units (BiGRUs) branches, serving the purpose of representing spatial characteristics in RVO sequence data. Extensive simulation experiments demonstrate that our proposed method outperforms the state-of-the-art distributed policy reinforcement learning (RL)-RVO. We further conducted physical experiments using three Crazyflie quadcopter drones, illustrating its ability to effectively guide agents' movements and avoid collisions.

引用

页码：14470 / 14480

页数：11

共 50 条

[1] STR: Spatial-Temporal RetNet for Distributed Multi-Robot Navigation
Chen, Lin
Wang, Yaonan
Miao, Zhiqiang
Feng, Mingtao
Wang, Yuanzhe
Mo, Yang
He, Wei
Wang, Hesheng
Wang, Danwei
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025,
[2] Reinforcement Learned Distributed Multi-Robot Navigation With Reciprocal Velocity Obstacle Shaped Rewards
Han, Ruihua
Chen, Shengduo
Wang, Shuaijun
Zhang, Zeqing
Gao, Rui
Hao, Qi
Pan, Jia
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 5896 - 5903
[3] Spatial-temporal Graph Transformer Network for Spatial-temporal Forecasting
Dao, Minh-Son
Zetsu, Koji
Hoang, Duy-Tang
Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024, 2024, : 1276 - 1281
[4] Spatial-temporal comprehensive matching evaluation method for distribution network with distributed generation
Xiao, Jun
Li, Hang
Bai, Linquan
Zhang, Xinsong
INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2020, 30 (12):
[5] PRVO: Probabilistic Reciprocal Velocity Obstacle for Multi Robot Navigation under Uncertainty
Gopalakrishnan, Bharath
Singh, Arun Kumar
Kaushik, Meha
Krishna, K. Madhava
Manocha, Dinesh
2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 1089 - 1096
[6] Distributed Spatial-Temporal Precoding with Limited Feedback
Long, Hang
Wang, Wenbo
Wang, Fangxiang
Zheng, Kan
IEICE TRANSACTIONS ON COMMUNICATIONS, 2010, E93B (02) : 407 - 410
[7] Fast Spatial-Temporal Transformer Network
Escher, Rafael Molossi
de Bem, Rodrigo Andrade
Jorge Drews Jr, Paulo Lilles
2021 34TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2021), 2021, : 65 - 72
[8] Spatial-Temporal Wireless Network Channels
Chen, Yifan
Mucchi, Lorenzo
Wang, Rui
2013 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2013, : 2597 - 2602
[9] Deep Reinforcement Learning Based on Social Spatial-Temporal Graph Convolution Network for Crowd Navigation
Lu, Yazhou
Ruan, Xiaogang
Huang, Jing
MACHINES, 2022, 10 (08)
[10] Nonfeedback Distributed Beamforming Using Spatial-Temporal Extraction
Sriploy, Pongnarin
Uthansakul, Monthippa
INTERNATIONAL JOURNAL OF ANTENNAS AND PROPAGATION, 2016, 2016

← 1 2 3 4 5 →