Reciprocal Velocity Obstacle Spatial-Temporal Network for Distributed Multirobot Navigation

被引:1
|
作者
Chen, Lin [1 ,2 ]
Wang, Yaonan [1 ,2 ]
Miao, Zhiqiang [1 ,2 ]
Feng, Mingtao [3 ]
Zhou, Zhen [1 ,2 ]
Wang, Hesheng [4 ]
Wang, Danwei [5 ]
机构
[1] Hunan Univ, Sch Elect & Informat Engn, Changsha 410082, Peoples R China
[2] Natl Engn Res Ctr Robot Visual Percept & Control T, Changsha 410082, Peoples R China
[3] Xidian Univ, Sch Comp Sci & Technol, Xian 710126, Peoples R China
[4] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200030, Peoples R China
[5] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
基金
中国国家自然科学基金;
关键词
Collision avoidance; multiagent systems; reinforcement learning; COLLISION-AVOIDANCE; ENVIRONMENT;
D O I
10.1109/TIE.2024.3379630
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The core of multirobot collision avoidance lies in developing a decentralized policy that can guide robots from their initial positions to target locations based on the environment states perceived by the robots and ensure collision avoidance. However, the current multirobot collision avoidance policy network is challenging to simultaneously extract the global spatial state, temporal state, and reciprocity among robots, which limits its performance. In this work, we have developed a novel reciprocal velocity obstacle (RVO) spatial-temporal network and employed the proximal policy optimization algorithm to train the network parameters during interactions with amultirobot simulation environment. Specifically, a temporal state encoder module, utilized to represent the temporal characteristics of observation sequence data, is designed and achieved through the combination of the graph attention mechanism and the transformer encoding module. Furthermore, we design a reciprocal spatial state encoder module achieved through the use of a transformer encoding module to merge feature data from long short-term memory (LSTM), GRU, and bidirectional gated recurrent units (BiGRUs) branches, serving the purpose of representing spatial characteristics in RVO sequence data. Extensive simulation experiments demonstrate that our proposed method outperforms the state-of-the-art distributed policy reinforcement learning (RL)-RVO. We further conducted physical experiments using three Crazyflie quadcopter drones, illustrating its ability to effectively guide agents' movements and avoid collisions.
引用
收藏
页码:14470 / 14480
页数:11
相关论文
共 50 条
  • [41] Spatial-temporal dynamic semantic graph neural network
    Zhang, Rui
    Xie, Fei
    Sun, Rui
    Huang, Lei
    Liu, Xixiang
    Shi, Jianjun
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (19): : 16655 - 16668
  • [42] Graph Spatial-Temporal Transformer Network for Traffic Prediction
    Zhao, Zhenzhen
    Shen, Guojiang
    Wang, Lei
    Kong, Xiangjie
    BIG DATA RESEARCH, 2024, 36
  • [43] STPNet: A Spatial-Temporal Propagation Network for Background Subtraction
    Yang, Yizhong
    Ruan, Jiahao
    Zhang, Yongqiang
    Cheng, Xin
    Zhang, Zhang
    Xie, Guangjun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2145 - 2157
  • [44] Spatial-Temporal Autoencoder with Attention Network for Video Compression
    Sigger, Neetu
    Al-Jawed, Naseer
    Nguyen, Tuan
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 290 - 300
  • [45] Spatial-Temporal Interleaved Network for Efficient Action Recognition
    Jiang, Shengqin
    Zhang, Haokui
    Qi, Yuankai
    Liu, Qingshan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2025, 21 (01) : 178 - 187
  • [46] Object Tracking via Spatial-Temporal Memory Network
    Zhou, Zikun
    Li, Xin
    Zhang, Tianzhu
    Wang, Hongpeng
    He, Zhenyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 2976 - 2989
  • [47] Spatial-temporal hypergraph convolutional network for traffic forecasting
    Zhao Z.
    Shen G.
    Zhou J.
    Jin J.
    Kong X.
    PeerJ Computer Science, 2023, 9
  • [48] Spatial-Temporal Variation Characteristics and Obstacle Factors of Resilience in Border Cities of Northeast China
    Jiang, Kaiping
    Li, Kaichao
    Cong, Nan
    Wu, Siyu
    Peng, Fei
    LAND, 2023, 12 (05)
  • [49] Hierarchical Spatial-Temporal Network for Skeleton-Based Temporal Action Segmentation
    Tan, Chenwei
    Sun, Tao
    Fu, Talas
    Wang, Yuhan
    Xu, Minjie
    Liu, Shenglan
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 28 - 39
  • [50] Enhancing Swift and Socially-Aware Navigation with Continuous Spatial-Temporal Routing
    Ge, Zijian
    Jiang, Jingjing
    Coombes, Matthew
    Liang, Sun
    INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2025, 17 (01) : 87 - 98