Logistics Distribution Route Optimization With Time Windows Based on MultiAgent Deep Reinforcement Learning

被引:1
|
作者
Yu, Fahong [1 ]
Chen, Meijia [1 ]
Xia, Xiaoyun [2 ]
Zhu, Dongping [1 ]
Peng, Qiang [1 ]
Deng, Kuibiao [1 ]
机构
[1] Shanwei Inst Technol, Ctr Intelligent Comp & Secur Res, Shanwei 516600, Guandong, Peoples R China
[2] Jiaxing Univ, Jiaxing 430010, Zhejiang, Peoples R China
关键词
Deep Reinforcement Learning; Logistics Distribution; Multi-Depot; Route Optimization; ALGORITHM;
D O I
10.4018/IJITSA.342084
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-depot vehicle routing problem with time windows (MDVRPTW) is a valuable practical issue in urban logistics. However, heuristic methods may fail to generate high-quality solutions for massive problems instantly. Thus, this article presents a novel reinforcement learning algorithm integrated with a multi-head attention mechanism and a local search strategy to solve the problem efficiently. The routing optimization was regarded as a vehicle tour generation process and an encoder-decoder was used to generate routes for vehicles departing from different depots iteratively. A multi-head attention strategy was employed for mining complex spatiotemporal correlations within time windows in the encoder. Then, a decoder with multi -agent was designed to generate solutions by optimizing reward and observing transition state. Meanwhile, a local search strategy was employed to improve the quality of solutions. The experiments results demonstrate that the proposed method can significantly outperform traditional methods in effectiveness and robustness.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Reentry trajectory optimization based on Deep Reinforcement Learning
    Gao, Jiashi
    Shi, Xinming
    Cheng, Zhongtao
    Xiong, Jizhang
    Liu, Lei
    Wang, Yongji
    Yang, Ye
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 2588 - 2592
  • [32] Deep Reinforcement Learning Based Train Driving Optimization
    Huang, Jin
    Zhang, Ende
    Zhang, Jiarui
    Huang, Siguang
    Zhong, Zhihua
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2375 - 2381
  • [33] Multiobjective Vehicle Routing Optimization With Time Windows: A Hybrid Approach Using Deep Reinforcement Learning and NSGA-II
    Wu, Rixin
    Wang, Ran
    Hao, Jie
    Wu, Qiang
    Wang, Ping
    Niyato, Dusit
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (03) : 4032 - 4047
  • [34] Deep Reinforcement Learning Based Optimal Route and Charging Station Selection
    Lee, Ki-Beom
    Ahmed, Mohamed A.
    Kang, Dong-Ki
    Kim, Young-Chon
    ENERGIES, 2020, 13 (23)
  • [35] Real-time deep reinforcement learning based vehicle navigation
    Koh, Songsang
    Zhou, Bo
    Fang, Hui
    Yang, Po
    Yang, Zaili
    Yang, Qiang
    Guan, Lin
    Ji, Zhigang
    APPLIED SOFT COMPUTING, 2020, 96
  • [36] Design and optimization of logistics distribution route based on improved ant colony algorithm
    Liu, Dan
    Hu, Xiulian
    Jiang, Qi
    OPTIK, 2023, 273
  • [37] Research on logistics distribution route optimization with time window considering flexible charging strategy
    Ge X.-L.
    Li Z.-W.
    Ge X.-B.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2020, 37 (06): : 1293 - 1301
  • [38] Optimization of Material Distribution Route Based on Green Logistics and Ant Colony Algorithm
    Xu, TianTian
    Chen, TianMei
    Zhang, Ying
    Yu, ChenXi
    2022 6TH INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, ISCSIC, 2022, : 250 - 254
  • [39] Task assignment in ground-to-air confrontation based on multiagent deep reinforcement learning
    Liu, Jia-yi
    Wang, Gang
    Fu, Qiang
    Yue, Shao-hua
    Wang, Si-yuan
    DEFENCE TECHNOLOGY, 2023, 19 : 210 - 219
  • [40] Multiagent-based deep reinforcement learning for risk-shifting portfolio management
    Lin, Yu-Cen
    Chen, Chiao-Ting
    Sang, Chuan-Yun
    Huang, Szu-Hao
    APPLIED SOFT COMPUTING, 2022, 123