Logistics Distribution Route Optimization With Time Windows Based on MultiAgent Deep Reinforcement Learning

被引：1

作者：

Yu, Fahong ^{[1
]}

Chen, Meijia ^{[1
]}

Xia, Xiaoyun ^{[2
]}

Zhu, Dongping ^{[1
]}

Peng, Qiang ^{[1
]}

Deng, Kuibiao ^{[1
]}

机构：

[1] Shanwei Inst Technol, Ctr Intelligent Comp & Secur Res, Shanwei 516600, Guandong, Peoples R China

[2] Jiaxing Univ, Jiaxing 430010, Zhejiang, Peoples R China

来源：

INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH | 2023年 / 17卷 / 01期

关键词：

Deep Reinforcement Learning; Logistics Distribution; Multi-Depot; Route Optimization; ALGORITHM;

D O I：

10.4018/IJITSA.342084

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-depot vehicle routing problem with time windows (MDVRPTW) is a valuable practical issue in urban logistics. However, heuristic methods may fail to generate high-quality solutions for massive problems instantly. Thus, this article presents a novel reinforcement learning algorithm integrated with a multi-head attention mechanism and a local search strategy to solve the problem efficiently. The routing optimization was regarded as a vehicle tour generation process and an encoder-decoder was used to generate routes for vehicles departing from different depots iteratively. A multi-head attention strategy was employed for mining complex spatiotemporal correlations within time windows in the encoder. Then, a decoder with multi -agent was designed to generate solutions by optimizing reward and observing transition state. Meanwhile, a local search strategy was employed to improve the quality of solutions. The experiments results demonstrate that the proposed method can significantly outperform traditional methods in effectiveness and robustness.

引用

页数：23

共 50 条

[31] Reentry trajectory optimization based on Deep Reinforcement Learning
Gao, Jiashi
Shi, Xinming
Cheng, Zhongtao
Xiong, Jizhang
Liu, Lei
Wang, Yongji
Yang, Ye
PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 2588 - 2592
[32] Deep Reinforcement Learning Based Train Driving Optimization
Huang, Jin
Zhang, Ende
Zhang, Jiarui
Huang, Siguang
Zhong, Zhihua
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2375 - 2381
[33] Multiobjective Vehicle Routing Optimization With Time Windows: A Hybrid Approach Using Deep Reinforcement Learning and NSGA-II
Wu, Rixin
Wang, Ran
Hao, Jie
Wu, Qiang
Wang, Ping
Niyato, Dusit
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (03) : 4032 - 4047
[34] Deep Reinforcement Learning Based Optimal Route and Charging Station Selection
Lee, Ki-Beom
Ahmed, Mohamed A.
Kang, Dong-Ki
Kim, Young-Chon
ENERGIES, 2020, 13 (23)
[35] Real-time deep reinforcement learning based vehicle navigation
Koh, Songsang
Zhou, Bo
Fang, Hui
Yang, Po
Yang, Zaili
Yang, Qiang
Guan, Lin
Ji, Zhigang
APPLIED SOFT COMPUTING, 2020, 96
[36] Design and optimization of logistics distribution route based on improved ant colony algorithm
Liu, Dan
Hu, Xiulian
Jiang, Qi
OPTIK, 2023, 273
[37] Research on logistics distribution route optimization with time window considering flexible charging strategy
Ge X.-L.
Li Z.-W.
Ge X.-B.
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2020, 37 (06): : 1293 - 1301
[38] Optimization of Material Distribution Route Based on Green Logistics and Ant Colony Algorithm
Xu, TianTian
Chen, TianMei
Zhang, Ying
Yu, ChenXi
2022 6TH INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, ISCSIC, 2022, : 250 - 254
[39] Task assignment in ground-to-air confrontation based on multiagent deep reinforcement learning
Liu, Jia-yi
Wang, Gang
Fu, Qiang
Yue, Shao-hua
Wang, Si-yuan
DEFENCE TECHNOLOGY, 2023, 19 : 210 - 219
[40] Multiagent-based deep reinforcement learning for risk-shifting portfolio management
Lin, Yu-Cen
Chen, Chiao-Ting
Sang, Chuan-Yun
Huang, Szu-Hao
APPLIED SOFT COMPUTING, 2022, 123

← 1 2 3 4 5 →