Reciprocal Velocity Obstacle Spatial-Temporal Network for Distributed Multirobot Navigation

被引:1
|
作者
Chen, Lin [1 ,2 ]
Wang, Yaonan [1 ,2 ]
Miao, Zhiqiang [1 ,2 ]
Feng, Mingtao [3 ]
Zhou, Zhen [1 ,2 ]
Wang, Hesheng [4 ]
Wang, Danwei [5 ]
机构
[1] Hunan Univ, Sch Elect & Informat Engn, Changsha 410082, Peoples R China
[2] Natl Engn Res Ctr Robot Visual Percept & Control T, Changsha 410082, Peoples R China
[3] Xidian Univ, Sch Comp Sci & Technol, Xian 710126, Peoples R China
[4] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200030, Peoples R China
[5] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
基金
中国国家自然科学基金;
关键词
Collision avoidance; multiagent systems; reinforcement learning; COLLISION-AVOIDANCE; ENVIRONMENT;
D O I
10.1109/TIE.2024.3379630
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The core of multirobot collision avoidance lies in developing a decentralized policy that can guide robots from their initial positions to target locations based on the environment states perceived by the robots and ensure collision avoidance. However, the current multirobot collision avoidance policy network is challenging to simultaneously extract the global spatial state, temporal state, and reciprocity among robots, which limits its performance. In this work, we have developed a novel reciprocal velocity obstacle (RVO) spatial-temporal network and employed the proximal policy optimization algorithm to train the network parameters during interactions with amultirobot simulation environment. Specifically, a temporal state encoder module, utilized to represent the temporal characteristics of observation sequence data, is designed and achieved through the combination of the graph attention mechanism and the transformer encoding module. Furthermore, we design a reciprocal spatial state encoder module achieved through the use of a transformer encoding module to merge feature data from long short-term memory (LSTM), GRU, and bidirectional gated recurrent units (BiGRUs) branches, serving the purpose of representing spatial characteristics in RVO sequence data. Extensive simulation experiments demonstrate that our proposed method outperforms the state-of-the-art distributed policy reinforcement learning (RL)-RVO. We further conducted physical experiments using three Crazyflie quadcopter drones, illustrating its ability to effectively guide agents' movements and avoid collisions.
引用
收藏
页码:14470 / 14480
页数:11
相关论文
共 50 条
  • [1] STR: Spatial-Temporal RetNet for Distributed Multi-Robot Navigation
    Chen, Lin
    Wang, Yaonan
    Miao, Zhiqiang
    Feng, Mingtao
    Wang, Yuanzhe
    Mo, Yang
    He, Wei
    Wang, Hesheng
    Wang, Danwei
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025,
  • [2] Reinforcement Learned Distributed Multi-Robot Navigation With Reciprocal Velocity Obstacle Shaped Rewards
    Han, Ruihua
    Chen, Shengduo
    Wang, Shuaijun
    Zhang, Zeqing
    Gao, Rui
    Hao, Qi
    Pan, Jia
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 5896 - 5903
  • [3] Spatial-temporal Graph Transformer Network for Spatial-temporal Forecasting
    Dao, Minh-Son
    Zetsu, Koji
    Hoang, Duy-Tang
    Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024, 2024, : 1276 - 1281
  • [4] Spatial-temporal comprehensive matching evaluation method for distribution network with distributed generation
    Xiao, Jun
    Li, Hang
    Bai, Linquan
    Zhang, Xinsong
    INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2020, 30 (12):
  • [5] PRVO: Probabilistic Reciprocal Velocity Obstacle for Multi Robot Navigation under Uncertainty
    Gopalakrishnan, Bharath
    Singh, Arun Kumar
    Kaushik, Meha
    Krishna, K. Madhava
    Manocha, Dinesh
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 1089 - 1096
  • [6] Distributed Spatial-Temporal Precoding with Limited Feedback
    Long, Hang
    Wang, Wenbo
    Wang, Fangxiang
    Zheng, Kan
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2010, E93B (02) : 407 - 410
  • [7] Fast Spatial-Temporal Transformer Network
    Escher, Rafael Molossi
    de Bem, Rodrigo Andrade
    Jorge Drews Jr, Paulo Lilles
    2021 34TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2021), 2021, : 65 - 72
  • [8] Spatial-Temporal Wireless Network Channels
    Chen, Yifan
    Mucchi, Lorenzo
    Wang, Rui
    2013 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2013, : 2597 - 2602
  • [9] Deep Reinforcement Learning Based on Social Spatial-Temporal Graph Convolution Network for Crowd Navigation
    Lu, Yazhou
    Ruan, Xiaogang
    Huang, Jing
    MACHINES, 2022, 10 (08)
  • [10] Nonfeedback Distributed Beamforming Using Spatial-Temporal Extraction
    Sriploy, Pongnarin
    Uthansakul, Monthippa
    INTERNATIONAL JOURNAL OF ANTENNAS AND PROPAGATION, 2016, 2016