A Multi-AGV Routing Planning Method Based on Deep Reinforcement Learning and Recurrent Neural Network

被引:16
作者
Lin, Yishuai [1 ]
Hue, Gang [1 ]
Wang, Liang [2 ]
Li, Qingshan [1 ]
Zhu, Jiawei [3 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710000, Peoples R China
[2] Suzhou Mingyi Intelligence Warehousing Informat Te, Kunshan 215300, Peoples R China
[3] Changan Univ, Sch Informat Engn, Xian 710000, Peoples R China
基金
中国国家自然科学基金;
关键词
Planning; Routing; Task analysis; Heuristic algorithms; Path planning; Reinforcement learning; Feature extraction; PATH; SYSTEM;
D O I
10.1109/JAS.2023.123300
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dear Editor, This letter presents a multi-automated guided vehicles (AGV) routing planning method based on deep reinforcement learning (DRL) and recurrent neural network (RNN), specifically utilizing proximal policy optimization (PPO) and long short-term memory (LSTM). Compared to traditional AGV pathing planning methods using genetic algorithm, ant colony optimization algorithm, etc., our proposed method has a higher degree of adaptability to deal with temporary changes of tasks or sudden failures of AGVs. Furthermore, our novel routing method, which uses LSTM to take into account temporal step information, provides a more optimized performance in terms of rewards and convergence speed as compared to existing PPO-based routing methods for AGVs.
引用
收藏
页码:1720 / 1722
页数:3
相关论文
共 17 条
[1]  
Arques Corrales Pilar, 2020, MLMI '20: 2020 The 3rd International Conference on Machine Learning and Machine Intelligence, P65, DOI 10.1145/3426826.3426839
[2]  
Cardona M., 2020, P IEEE ANDESCON DEC, P1
[3]   Research on multi-load AGV path planning of weaving workshop based on time priority [J].
Du, Li-zhen ;
Ke, Shanfu ;
Wang, Zhen ;
Tao, Jing ;
Yu, Lianqing ;
Li, Hongjun .
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2019, 16 (04) :2277-2292
[4]   Human factors in order picking: a content analysis of the literature [J].
Grosse, Eric H. ;
Glock, Christoph H. ;
Neumann, W. Patrick .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2017, 55 (05) :1260-1276
[5]  
Guicheng Shen, 2021, 2021 International Conference on Networking, Communications and Information Technology (NetCIT), P421, DOI 10.1109/NetCIT54147.2021.00090
[6]   An Improved Acceleration Method Based on Multi-Agent System for AGVs Conflict-Free Path Planning in Automated Terminals [J].
Guo, Kunlun ;
Zhu, Jin ;
Shen, Lei .
IEEE ACCESS, 2021, 9 :3326-3338
[7]   Path planning for intelligent robots based on deep Q-learning with experience replay and heuristic knowledge [J].
Jiang, Lan ;
Huang, Hongyun ;
Ding, Zuohua .
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 7 (04) :1179-1189
[8]   A Probabilistic Time-Constrained Based Heuristic Path Planning Algorithm in Warehouse Multi-AGV Systems [J].
Lian, Yindong ;
Xie, Wei ;
Zhang, Langwen .
IFAC PAPERSONLINE, 2020, 53 (02) :2538-2543
[9]   MLATSO: A method for task scheduling optimization in multi-load AGVs-based systems [J].
Lin, Yishuai ;
Xu, Yunlong ;
Zhu, Jiawei ;
Wang, Xuhua ;
Wang, Liang ;
Hu, Gang .
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2023, 79
[10]   Time Window Based Genetic Algorithm for Multi-AGVs Conflict-free Path Planning in Automated Container Terminals [J].
Lu, Tong ;
Sun, Zhao-Hui ;
Qiu, Siqi ;
Ming, Xinguo .
2021 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM21), 2021, :603-607