Deep reinforcement learning and ant colony optimization supporting multi-UGV path planning and task assignment in 3D environments

被引:0
作者
Jin, Binghui [1 ]
Sun, Yang [1 ]
Wu, Wenjun [1 ]
Gao, Qiang [1 ]
Si, Pengbo [1 ]
机构
[1] Beijing Univ Sci & Technol, Sch Informat Engn, Beijing 100124, Peoples R China
关键词
ant colony optimization; deep reinforcement learning; multiple unmanned ground vehicles; path planning; task assignment; ALGORITHM; FIELD;
D O I
10.1049/itr2.12535
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the development of artificial intelligence, the application of unmanned ground vehicles (UGV) in outdoor hazardous scenarios has received more attention. However, the terrains in these environments are often complex and undulating, which also pose higher challenges to the multi-UGV path planning and task assignment (MUPPTA) optimization. To efficiently improve the multi-UGV collaboration in 3D environments, a MUPPTA method is proposed based on double deep Q learning network (DDQN) and ant colony optimization (ACO) to jointly optimize the path planning and task assignment decisions of multiple UGVs. The authors first comprehensively consider the characteristics of the 3D environments, and model the MUPPTA problem as a combinatorial optimization problem. To tackle it, the original problem is decomposed into the multi-UGV path planning sub-problem and task assignment sub-problem, and solve them separately. First, the path planning sub-problem in the 3D environments is transformed into a Markov decision process (MDP) model, and a multi-UGV path planning algorithm based on DDQN (MUPP-DDQN) is proposed to obtain the optimal paths and actual path costs between tasks through extensive offline learning and training. Based on this, a multi-UGV task assignment algorithm is further proposed based on ACO (MUTA-ACO) to solve the task assignment sub-problem and achieve the optimal task assignment solution. Simulation results show that the proposed method is more cost-effective and time-saving compared to other comparison algorithms. This paper focus on the multi-UGV path planning and task assignment (MUPPTA) problem in 3D environments, and propose a multi-UGV path planning and task assignment method based on double DQN and ACO. Specifically, the algorithm takes the complex terrain and actual cost in 3D environments into consideration, and an optimization mechanism for multi-UGV path planning and task assignment is established to guide the multi-UGV coordination and reduce the system costs. image
引用
收藏
页码:1652 / 1664
页数:13
相关论文
共 50 条
[31]   Reinforcement learning-based multi-strategy cuckoo search algorithm for 3D UAV path planning [J].
Yu, Xiaobing ;
Luo, Wenguan .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 223
[32]   Connectivity-Aware 3D UAV Path Design With Deep Reinforcement Learning [J].
Xie, Hao ;
Yang, Dingcheng ;
Xiao, Lin ;
Lyu, Jiangbin .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (12) :13022-13034
[33]   Sequence-to-Sequence Multi-Agent Reinforcement Learning for Multi-UAV Task Planning in 3D Dynamic Environment [J].
Liu, Ziwei ;
Qiu, Changzhen ;
Zhang, Zhiyong .
APPLIED SCIENCES-BASEL, 2022, 12 (23)
[34]   Improved Ant Colony optimization Algorithm and Its Application for Path Planning of Mobile Robot in 3-D Space [J].
Zhao Juan-ping ;
Gao Xian-wen ;
Liu Jin-gang ;
Fu Xiu-hui .
2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 3, 2010, :194-198
[35]   Hierarchical Planning with Deep Reinforcement Learning for 3D Navigation of Microrobots in Blood Vessels [J].
Yang, Yuguang ;
Bevan, Michael A. ;
Li, Bo .
ADVANCED INTELLIGENT SYSTEMS, 2022, 4 (11)
[36]   LTL TASK DECOMPOSITION FOR 3D HIGH-LEVEL PATH PLANNING IN KNOWN AND STATIC ENVIRONMENTS [J].
Hustiu, Sofia ;
Hustiu, Ioana ;
Kloetzer, Marius ;
Mahulea, Cristian .
CONTROL ENGINEERING AND APPLIED INFORMATICS, 2021, 23 (03) :76-87
[37]   3D UAV Path Planning via Potential Filed-Imitation Reinforcement Learning [J].
Han, Jiale ;
Yang, Fan ;
Yang, Jian ;
Kang, Xueping .
2024 43RD CHINESE CONTROL CONFERENCE, CCC 2024, 2024, :4742-4748
[38]   Integrating Heuristic Methods with Deep Reinforcement Learning for Online 3D Bin-Packing Optimization [J].
Wong, Ching-Chang ;
Tsai, Tai-Ting ;
Ou, Can-Kun .
SENSORS, 2024, 24 (16)
[39]   Deep reinforcement learning and 3D physical environments applied to crowd evacuation in congested scenarios [J].
Zhang, Dong ;
Li, Wenhang ;
Gong, Jianhua ;
Zhang, Guoyong ;
Liu, Jiantao ;
Huang, Lin ;
Liu, Heng ;
Ma, Haonan .
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2023, 16 (01) :691-714
[40]   UAV Navigation in 3D Urban Environments with Curriculum-based Deep Reinforcement Learning [J].
de Carvalho, Kevin Braathen ;
de Oliveira, Iure Rosa L. ;
Brandao, Alexandre S. .
2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2023, :1249-1255