A Multi-AGV Routing Planning Method Based on Deep Reinforcement Learning and Recurrent Neural Network

被引:16
作者
Lin, Yishuai [1 ]
Hue, Gang [1 ]
Wang, Liang [2 ]
Li, Qingshan [1 ]
Zhu, Jiawei [3 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710000, Peoples R China
[2] Suzhou Mingyi Intelligence Warehousing Informat Te, Kunshan 215300, Peoples R China
[3] Changan Univ, Sch Informat Engn, Xian 710000, Peoples R China
基金
中国国家自然科学基金;
关键词
Planning; Routing; Task analysis; Heuristic algorithms; Path planning; Reinforcement learning; Feature extraction; PATH; SYSTEM;
D O I
10.1109/JAS.2023.123300
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dear Editor, This letter presents a multi-automated guided vehicles (AGV) routing planning method based on deep reinforcement learning (DRL) and recurrent neural network (RNN), specifically utilizing proximal policy optimization (PPO) and long short-term memory (LSTM). Compared to traditional AGV pathing planning methods using genetic algorithm, ant colony optimization algorithm, etc., our proposed method has a higher degree of adaptability to deal with temporary changes of tasks or sudden failures of AGVs. Furthermore, our novel routing method, which uses LSTM to take into account temporal step information, provides a more optimized performance in terms of rewards and convergence speed as compared to existing PPO-based routing methods for AGVs.
引用
收藏
页码:1720 / 1722
页数:3
相关论文
共 17 条
[11]   Energy-Efficient Mobile Crowdsensing by Unmanned Vehicles: A Sequential Deep Reinforcement Learning Approach [J].
Piao, Chengzhe ;
Liu, Chi Harold .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07) :6312-6324
[12]  
Shuo Li, 2022, 2022 2nd Asia-Pacific Conference on Communications Technology and Computer Science (ACCTCS), P11, DOI 10.1109/ACCTCS53867.2022.00011
[13]   A Multi-Agent Deep Reinforcement Learning Approach for Practical Decentralized UAV Collision Avoidance [J].
Thumiger, Nicholas ;
Deghat, Mohammad .
IEEE CONTROL SYSTEMS LETTERS, 2022, 6 :2174-2179
[14]  
Yang Yang, 2020, Bio-inspired Computing: Theories and Applications: 14th International Conference, BIC-TA 2019. Communications in Computer and Information Science (1159), P527, DOI 10.1007/978-981-15-3425-6_41
[15]   Multi-robot path planning based on a deep reinforcement learning DQN algorithm [J].
Yang Yang ;
Li Juntao ;
Peng Lingling .
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2020, 5 (03) :177-183
[16]   New Automated Guided Vehicle System Using Real-Time Holonic Scheduling for Warehouse Picking [J].
Yoshitake, Hiroshi ;
Kamoshida, Ryota ;
Nagashima, Yoshikazu .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02) :1045-1052
[17]  
Zheng Yanbin, 2019, Journal of Computer Applications, V39, P681, DOI 10.11772/j.issn.1001-9081.2018071601