A Hybrid of Deep Reinforcement Learning and Local Search for the Vehicle Routing Problems

被引:87
|
作者
Zhao, Jiuxia [1 ]
Mao, Minjia [2 ]
Zhao, Xi [3 ]
Zou, Jianhua [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Management, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Routing; Adaptation models; Heuristic algorithms; Search problems; Training; Optimization; VRP; VRPTW; routing simulator; deep reinforcement learning; adaptive critic; local search; LARGE NEIGHBORHOOD SEARCH; OPTIMIZATION; ALGORITHMS; DELIVERY;
D O I
10.1109/TITS.2020.3003163
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Different variants of the Vehicle Routing Problem (VRP) have been studied for decades. State-of-the-art methods based on local search have been developed for VRPs, while still facing problems of slow running time and poor solution quality in the case of large problem size. To overcome these problems, we first propose a novel deep reinforcement learning (DRL) model, which is composed of an actor, an adaptive critic and a routing simulator. The actor, based on the attention mechanism, is designed to generate routing strategies. The adaptive critic is devised to change the network structure adaptively, in order to accelerate the convergence rate and improve the solution quality during training. The routing simulator is developed to provide graph information and reward with the actor and adaptive cirtic. Then, we combine this DRL model with a local search method to further improve the solution quality. The output of the DRL model can serve as the initial solution for the following local search method, from where the final solution of the VRP is obtained. Tested on three datasets with customer points of 20, 50 and 100 respectively, experimental results demonstrate that the DRL model alone finds better solutions compared to construction algorithms and previous DRL approaches, while enabling a 5- to 40-fold speedup. We also observe that combining the DRL model with various local search methods yields excellent solutions at a superior generation speed, comparing to that of other initial solutions.
引用
收藏
页码:7208 / 7218
页数:11
相关论文
共 50 条
  • [21] A hybrid multi-objective genetic local search algorithm for the prize-collecting vehicle routing problem
    Long, Jianyu
    Sun, Zhenzhong
    Pardalos, Panos M.
    Hong, Ying
    Zhang, Shaohui
    Li, Chuan
    INFORMATION SCIENCES, 2019, 478 : 40 - 61
  • [22] Energy Management Strategy for a Hybrid Electric Vehicle Based on Deep Reinforcement Learning
    Hu, Yue
    Li, Weimin
    Xu, Kun
    Zahid, Taimoor
    Qin, Feiyan
    Li, Chenming
    APPLIED SCIENCES-BASEL, 2018, 8 (02):
  • [23] Deep Reinforcement Learning Based Routing in IP Media Broadcast Networks: Feasibility and Performance
    Amaral, Pedro
    Simoes, Diogo
    IEEE ACCESS, 2022, 10 : 62459 - 62470
  • [24] Deep Reinforcement Learning Using Optimized Monte Carlo Tree Search in EWN
    Zhang, Yixian
    Li, Zhuoxuan
    Cao, Yiding
    Zhao, Xuan
    Cao, Jinde
    IEEE TRANSACTIONS ON GAMES, 2024, 16 (03) : 544 - 555
  • [25] Ensemble-based Deep Reinforcement Learning for Vehicle Routing Problems under Distribution Shift
    Jiang, Yuan
    Cao, Zhiguang
    Wu, Yaoxin
    Song, Wen
    Zhang, Jie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [26] Synergetic attention-driven transformer: A deep reinforcement learning approach for vehicle routing problems
    Guan, Qingshu
    Cao, Hui
    Jia, Lixin
    Yan, Dapeng
    Chen, Badong
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 274
  • [27] Collaborative Control of Vehicle Platoon Based on Deep Reinforcement Learning
    Chen, Jianzhong
    Wu, Xiaobao
    Lv, Zekai
    Xu, Zhihe
    Wang, Wenjie
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (10) : 14399 - 14414
  • [28] A Deep Reinforcement Learning-Based Geographic Packet Routing Optimization
    Bai, Yijie
    Zhang, Xia
    Yu, Daojie
    Li, Shengxiang
    Wang, Yu
    Lei, Shuntian
    Tian, Zhoutai
    IEEE ACCESS, 2022, 10 : 108785 - 108796
  • [29] An iterated local search algorithm for latency vehicle routing problems with multiple depots
    Osorio-Mora, Alan
    Escobar, John Willmer
    Toth, Paolo
    COMPUTERS & OPERATIONS RESEARCH, 2023, 158
  • [30] Deep Reinforcement Learning-Based Joint Caching and Routing in AI-Driven Networks
    Yang, Meiyi
    Gao, Deyun
    Zhang, Weiting
    Yang, Dong
    Niyato, Dusit
    Zhang, Hongke
    Leung, Victor C. M.
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (03) : 1322 - 1337