Distributed and Scalable Cooperative Formation of Unmanned Ground Vehicles Using Deep Reinforcement Learning

被引：2

作者：

Huang, Shichun ^{[1
]}

Wang, Tao ^{[1
,2
,3
]}

Tang, Yong ^{[4
,5
]}

Hu, Yiwen ^{[6
]}

Xin, Gu ^{[7
]}

Zhou, Dianle ^{[8
]}

机构：

[1] Sun Yat sen Univ, Sch Intelligent Syst Engn, Guangzhou 510275, Peoples R China

[2] Southern Marine Sci & Engn Guangdong Lab Zhuhai, Zhuhai 519000, Peoples R China

[3] Guangdong Prov Key Lab Fire Sci & Intelligent Emer, Guangzhou 510006, Peoples R China

[4] Northwestern Polytech Univ, Sch Civil Aviat, Xian 710072, Peoples R China

[5] UAS Co Ltd, Aviat Ind Corp China Chengdu, Chengdu 610091, Peoples R China

[6] AVIC Chengdu Aircraft Design & Res Inst, Chengdu 610041, Peoples R China

[7] China Acad Launch Vehicle Technol, Dept Res & Dev Ctr, Beijing 100076, Peoples R China

[8] Natl Univ Def Technol, Coll Adv Interdisciplinary Studies, Changsha 410073, Peoples R China

来源：

AEROSPACE | 2023年 / 10卷 / 02期

基金：

中国国家自然科学基金;

关键词：

unmanned ground vehicles (UGVs); deep reinforcement learning; deep deterministic policy gradient (DDPG); multiagent systems; distributed formation control; MOBILE ROBOT;

D O I：

10.3390/aerospace10020096

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Cooperative formation control of unmanned ground vehicles (UGVs) has become one of the important research hotspots in the application of UGV and attracted more and more attention in the military and civil fields. Compared with traditional formation control algorithms, reinforcement-learning-based algorithms can provide a new solution with a lower complexity for real-time formation control by equipping UGVs with artificial intelligence. Therefore, in this paper, a distributed deep-reinforcement-learning-based cooperative formation control algorithm is proposed to solve the navigation, maintenance, and obstacle avoidance tasks of UGV formations. More importantly, the hierarchical triangular formation structure and the newly designed Markov decision process for UGV formations of leader and follower attributes make the control strategy learned by the algorithm reusable, so that the formation can arbitrarily increase the number of UGVs and realize a more flexible expansion. The effectiveness and scalability of the algorithm is verified by formation simulation experiments of different scales.

引用

页数：20

共 50 条

[31] Steering control in autonomous vehicles using deep reinforcement learning
Xue Chong
Zhang Xinyu
Jia Peng
The Journal of China Universities of Posts and Telecommunications, 2018, 25 (06) : 58 - 64
[32] Prescriptive Maintenance of Freight Vehicles using Deep Reinforcement Learning
Tham, Chen-Khong
Liu, Weihao
Chattopadhyay, Rajarshi
2023 IEEE 97TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-SPRING, 2023,
[33] Collision avoidance for an unmanned surface vehicle using deep reinforcement learning
Woo, Joohyun
Kim, Nakwan
OCEAN ENGINEERING, 2020, 199
[34] Autonomous Control of Combat Unmanned Aerial Vehicles to Evade Surface-to-Air Missiles Using Deep Reinforcement Learning
Lee, Gyeong Taek
Kim, Chang Ouk
IEEE ACCESS, 2020, 8 : 226724 - 226736
[35] Collision-avoidance under COLREGS for unmanned surface vehicles via deep reinforcement learning
Ma, Yong
Zhao, Yujiao
Wang, Yulong
Gan, Langxiong
Zheng, Yuanzhou
MARITIME POLICY & MANAGEMENT, 2020, 47 (05) : 665 - 686
[36] Global path planning for amphibious unmanned vehicles with multiple constraints via deep reinforcement learning
Wu, Ting
Wang, Ronghao
Zhang, Yan
Meng, Yuhang
Xiang, Yuzhu
Xiang, Zhengrong
2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1296 - 1301
[37] Scalable and Cooperative Deep Reinforcement Learning Approaches for Multi-UAV Systems: A Systematic Review
Frattolillo, Francesco
Brunori, Damiano
Iocchi, Luca
DRONES, 2023, 7 (04)
[38] Cooperative Traffic Signal Control Using a Distributed Agent-Based Deep Reinforcement Learning With Incentive Communication
Zhou, Bin
Zhou, Qishen
Hu, Simon
Ma, Dongfang
Jin, Sheng
Lee, Der-Horng
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 10147 - 10160
[39] Negotiating team formation using deep reinforcement learning
Bachrach, Yoram
Everett, Richard
Hughes, Edward
Lazaridou, Angeliki
Leibo, Joel Z.
Lanctot, Marc
Johanson, Michael
Czarnecki, Wojciech M.
Graepel, Thore
ARTIFICIAL INTELLIGENCE, 2020, 288
[40] Double Critic Deep Reinforcement Learning for Mapless 3D Navigation of Unmanned Aerial Vehicles
Ricardo Bedin Grando
Junior Costa de Jesus
Victor Augusto Kich
Alisson Henrique Kolling
Paulo Lilles Jorge Drews-Jr
Journal of Intelligent & Robotic Systems, 2022, 104

← 1 2 3 4 5 →