Logistics Distribution Route Optimization With Time Windows Based on MultiAgent Deep Reinforcement Learning

被引：1

作者：

Yu, Fahong ^{[1
]}

Chen, Meijia ^{[1
]}

Xia, Xiaoyun ^{[2
]}

Zhu, Dongping ^{[1
]}

Peng, Qiang ^{[1
]}

Deng, Kuibiao ^{[1
]}

机构：

[1] Shanwei Inst Technol, Ctr Intelligent Comp & Secur Res, Shanwei 516600, Guandong, Peoples R China

[2] Jiaxing Univ, Jiaxing 430010, Zhejiang, Peoples R China

来源：

INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH | 2023年 / 17卷 / 01期

关键词：

Deep Reinforcement Learning; Logistics Distribution; Multi-Depot; Route Optimization; ALGORITHM;

D O I：

10.4018/IJITSA.342084

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-depot vehicle routing problem with time windows (MDVRPTW) is a valuable practical issue in urban logistics. However, heuristic methods may fail to generate high-quality solutions for massive problems instantly. Thus, this article presents a novel reinforcement learning algorithm integrated with a multi-head attention mechanism and a local search strategy to solve the problem efficiently. The routing optimization was regarded as a vehicle tour generation process and an encoder-decoder was used to generate routes for vehicles departing from different depots iteratively. A multi-head attention strategy was employed for mining complex spatiotemporal correlations within time windows in the encoder. Then, a decoder with multi -agent was designed to generate solutions by optimizing reward and observing transition state. Meanwhile, a local search strategy was employed to improve the quality of solutions. The experiments results demonstrate that the proposed method can significantly outperform traditional methods in effectiveness and robustness.

引用

页数：23

共 50 条

[1] Information Retrieval and Optimization in Distribution and Logistics Management Using Deep Reinforcement Learning
Yang, Li
Sathishkumar, V. E.
Manickam, Adhiyaman
INTERNATIONAL JOURNAL OF INFORMATION SYSTEMS AND SUPPLY CHAIN MANAGEMENT, 2023, 16 (01)
[2] Logistics Distribution Route Optimization Based on Improved Particle Swarm Optimization
Zhao H.
Sharma A.
Informatica (Slovenia), 2023, 47 (02): : 243 - 252
[3] An Optimization Method of Logistics Distribution Route based on Autoencoder Network
Miao, Jiaqi
Xu, Panfeng
Sun, Han
Wang, Ruoxi
Wang, Youming
2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2908 - 2913
[4] Route optimization of construction machine by deep reinforcement learning
Tanabe S.
Sun Z.
Nakatani M.
Uchimura Y.
IEEJ Transactions on Industry Applications, 2019, 139 (04): : 401 - 408
[5] Deep Reinforcement Learning Based Volt-VAR Optimization in Smart Distribution Systems
Zhang, Ying
Wang, Xinan
Wang, Jianhui
Zhang, Yingchen
IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (01) : 361 - 371
[6] Deep Reinforcement Learning Based Dynamic Route Planning for Minimizing Travel Time
Geng, Yuanzhe
Liu, Erwu
Wang, Rui
Liu, Yiming
Rao, Weixiong
Feng, Shaojun
Dong, Zhao
Fu, Zhiren
Chen, Yanfen
2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
[7] Route optimization for autonomous bulldozer by distributed deep reinforcement learning
Osaka, Yasuhiro
Odajima, Naoya
Uchimura, Yutaka
2021 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS (ICM), 2021,
[8] Interterminal Truck Routing Optimization Using Cooperative Multiagent Deep Reinforcement Learning
Adi, Taufik Nur
Bae, Hyerim
Iskandar, Yelita Anggiane
PROCESSES, 2021, 9 (10)
[9] A Multiagent Deep Reinforcement Learning Based Approach for the Optimization of Transformer Life Using Coordinated Electric Vehicles
Li, Sichen
Hu, Weihao
Cao, Di
Zhang, Zhenyuan
Huang, Qi
Chen, Zhe
Blaabjerg, Frede
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (11) : 7639 - 7652
[10] DHL: Deep reinforcement learning-based approach for emergency supply distribution in humanitarian logistics
Fan, Junchao
Chang, Xiaolin
Misic, Jelena
Misic, Vojislav B.
Kang, Hongyue
PEER-TO-PEER NETWORKING AND APPLICATIONS, 2022, 15 (05) : 2376 - 2389

← 1 2 3 4 5 →