Logistics Distribution Route Optimization With Time Windows Based on MultiAgent Deep Reinforcement Learning

被引:1
|
作者
Yu, Fahong [1 ]
Chen, Meijia [1 ]
Xia, Xiaoyun [2 ]
Zhu, Dongping [1 ]
Peng, Qiang [1 ]
Deng, Kuibiao [1 ]
机构
[1] Shanwei Inst Technol, Ctr Intelligent Comp & Secur Res, Shanwei 516600, Guandong, Peoples R China
[2] Jiaxing Univ, Jiaxing 430010, Zhejiang, Peoples R China
关键词
Deep Reinforcement Learning; Logistics Distribution; Multi-Depot; Route Optimization; ALGORITHM;
D O I
10.4018/IJITSA.342084
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-depot vehicle routing problem with time windows (MDVRPTW) is a valuable practical issue in urban logistics. However, heuristic methods may fail to generate high-quality solutions for massive problems instantly. Thus, this article presents a novel reinforcement learning algorithm integrated with a multi-head attention mechanism and a local search strategy to solve the problem efficiently. The routing optimization was regarded as a vehicle tour generation process and an encoder-decoder was used to generate routes for vehicles departing from different depots iteratively. A multi-head attention strategy was employed for mining complex spatiotemporal correlations within time windows in the encoder. Then, a decoder with multi -agent was designed to generate solutions by optimizing reward and observing transition state. Meanwhile, a local search strategy was employed to improve the quality of solutions. The experiments results demonstrate that the proposed method can significantly outperform traditional methods in effectiveness and robustness.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Information Retrieval and Optimization in Distribution and Logistics Management Using Deep Reinforcement Learning
    Yang, Li
    Sathishkumar, V. E.
    Manickam, Adhiyaman
    INTERNATIONAL JOURNAL OF INFORMATION SYSTEMS AND SUPPLY CHAIN MANAGEMENT, 2023, 16 (01)
  • [2] Logistics Distribution Route Optimization Based on Improved Particle Swarm Optimization
    Zhao H.
    Sharma A.
    Informatica (Slovenia), 2023, 47 (02): : 243 - 252
  • [3] An Optimization Method of Logistics Distribution Route based on Autoencoder Network
    Miao, Jiaqi
    Xu, Panfeng
    Sun, Han
    Wang, Ruoxi
    Wang, Youming
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2908 - 2913
  • [4] Route optimization of construction machine by deep reinforcement learning
    Tanabe S.
    Sun Z.
    Nakatani M.
    Uchimura Y.
    IEEJ Transactions on Industry Applications, 2019, 139 (04): : 401 - 408
  • [5] Deep Reinforcement Learning Based Volt-VAR Optimization in Smart Distribution Systems
    Zhang, Ying
    Wang, Xinan
    Wang, Jianhui
    Zhang, Yingchen
    IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (01) : 361 - 371
  • [6] Deep Reinforcement Learning Based Dynamic Route Planning for Minimizing Travel Time
    Geng, Yuanzhe
    Liu, Erwu
    Wang, Rui
    Liu, Yiming
    Rao, Weixiong
    Feng, Shaojun
    Dong, Zhao
    Fu, Zhiren
    Chen, Yanfen
    2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
  • [7] Route optimization for autonomous bulldozer by distributed deep reinforcement learning
    Osaka, Yasuhiro
    Odajima, Naoya
    Uchimura, Yutaka
    2021 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS (ICM), 2021,
  • [8] Interterminal Truck Routing Optimization Using Cooperative Multiagent Deep Reinforcement Learning
    Adi, Taufik Nur
    Bae, Hyerim
    Iskandar, Yelita Anggiane
    PROCESSES, 2021, 9 (10)
  • [9] A Multiagent Deep Reinforcement Learning Based Approach for the Optimization of Transformer Life Using Coordinated Electric Vehicles
    Li, Sichen
    Hu, Weihao
    Cao, Di
    Zhang, Zhenyuan
    Huang, Qi
    Chen, Zhe
    Blaabjerg, Frede
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (11) : 7639 - 7652
  • [10] DHL: Deep reinforcement learning-based approach for emergency supply distribution in humanitarian logistics
    Fan, Junchao
    Chang, Xiaolin
    Misic, Jelena
    Misic, Vojislav B.
    Kang, Hongyue
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2022, 15 (05) : 2376 - 2389