Distributed and Scalable Cooperative Formation of Unmanned Ground Vehicles Using Deep Reinforcement Learning

被引:2
|
作者
Huang, Shichun [1 ]
Wang, Tao [1 ,2 ,3 ]
Tang, Yong [4 ,5 ]
Hu, Yiwen [6 ]
Xin, Gu [7 ]
Zhou, Dianle [8 ]
机构
[1] Sun Yat sen Univ, Sch Intelligent Syst Engn, Guangzhou 510275, Peoples R China
[2] Southern Marine Sci & Engn Guangdong Lab Zhuhai, Zhuhai 519000, Peoples R China
[3] Guangdong Prov Key Lab Fire Sci & Intelligent Emer, Guangzhou 510006, Peoples R China
[4] Northwestern Polytech Univ, Sch Civil Aviat, Xian 710072, Peoples R China
[5] UAS Co Ltd, Aviat Ind Corp China Chengdu, Chengdu 610091, Peoples R China
[6] AVIC Chengdu Aircraft Design & Res Inst, Chengdu 610041, Peoples R China
[7] China Acad Launch Vehicle Technol, Dept Res & Dev Ctr, Beijing 100076, Peoples R China
[8] Natl Univ Def Technol, Coll Adv Interdisciplinary Studies, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
unmanned ground vehicles (UGVs); deep reinforcement learning; deep deterministic policy gradient (DDPG); multiagent systems; distributed formation control; MOBILE ROBOT;
D O I
10.3390/aerospace10020096
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Cooperative formation control of unmanned ground vehicles (UGVs) has become one of the important research hotspots in the application of UGV and attracted more and more attention in the military and civil fields. Compared with traditional formation control algorithms, reinforcement-learning-based algorithms can provide a new solution with a lower complexity for real-time formation control by equipping UGVs with artificial intelligence. Therefore, in this paper, a distributed deep-reinforcement-learning-based cooperative formation control algorithm is proposed to solve the navigation, maintenance, and obstacle avoidance tasks of UGV formations. More importantly, the hierarchical triangular formation structure and the newly designed Markov decision process for UGV formations of leader and follower attributes make the control strategy learned by the algorithm reusable, so that the formation can arbitrarily increase the number of UGVs and realize a more flexible expansion. The effectiveness and scalability of the algorithm is verified by formation simulation experiments of different scales.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Steering control in autonomous vehicles using deep reinforcement learning
    Xue Chong
    Zhang Xinyu
    Jia Peng
    The Journal of China Universities of Posts and Telecommunications, 2018, 25 (06) : 58 - 64
  • [32] Prescriptive Maintenance of Freight Vehicles using Deep Reinforcement Learning
    Tham, Chen-Khong
    Liu, Weihao
    Chattopadhyay, Rajarshi
    2023 IEEE 97TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-SPRING, 2023,
  • [33] Collision avoidance for an unmanned surface vehicle using deep reinforcement learning
    Woo, Joohyun
    Kim, Nakwan
    OCEAN ENGINEERING, 2020, 199
  • [34] Autonomous Control of Combat Unmanned Aerial Vehicles to Evade Surface-to-Air Missiles Using Deep Reinforcement Learning
    Lee, Gyeong Taek
    Kim, Chang Ouk
    IEEE ACCESS, 2020, 8 : 226724 - 226736
  • [35] Collision-avoidance under COLREGS for unmanned surface vehicles via deep reinforcement learning
    Ma, Yong
    Zhao, Yujiao
    Wang, Yulong
    Gan, Langxiong
    Zheng, Yuanzhou
    MARITIME POLICY & MANAGEMENT, 2020, 47 (05) : 665 - 686
  • [36] Global path planning for amphibious unmanned vehicles with multiple constraints via deep reinforcement learning
    Wu, Ting
    Wang, Ronghao
    Zhang, Yan
    Meng, Yuhang
    Xiang, Yuzhu
    Xiang, Zhengrong
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1296 - 1301
  • [37] Scalable and Cooperative Deep Reinforcement Learning Approaches for Multi-UAV Systems: A Systematic Review
    Frattolillo, Francesco
    Brunori, Damiano
    Iocchi, Luca
    DRONES, 2023, 7 (04)
  • [38] Cooperative Traffic Signal Control Using a Distributed Agent-Based Deep Reinforcement Learning With Incentive Communication
    Zhou, Bin
    Zhou, Qishen
    Hu, Simon
    Ma, Dongfang
    Jin, Sheng
    Lee, Der-Horng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 10147 - 10160
  • [39] Negotiating team formation using deep reinforcement learning
    Bachrach, Yoram
    Everett, Richard
    Hughes, Edward
    Lazaridou, Angeliki
    Leibo, Joel Z.
    Lanctot, Marc
    Johanson, Michael
    Czarnecki, Wojciech M.
    Graepel, Thore
    ARTIFICIAL INTELLIGENCE, 2020, 288
  • [40] Double Critic Deep Reinforcement Learning for Mapless 3D Navigation of Unmanned Aerial Vehicles
    Ricardo Bedin Grando
    Junior Costa de Jesus
    Victor Augusto Kich
    Alisson Henrique Kolling
    Paulo Lilles Jorge Drews-Jr
    Journal of Intelligent & Robotic Systems, 2022, 104