A Hybrid of Deep Reinforcement Learning and Local Search for the Vehicle Routing Problems

被引:87
|
作者
Zhao, Jiuxia [1 ]
Mao, Minjia [2 ]
Zhao, Xi [3 ]
Zou, Jianhua [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Management, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Routing; Adaptation models; Heuristic algorithms; Search problems; Training; Optimization; VRP; VRPTW; routing simulator; deep reinforcement learning; adaptive critic; local search; LARGE NEIGHBORHOOD SEARCH; OPTIMIZATION; ALGORITHMS; DELIVERY;
D O I
10.1109/TITS.2020.3003163
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Different variants of the Vehicle Routing Problem (VRP) have been studied for decades. State-of-the-art methods based on local search have been developed for VRPs, while still facing problems of slow running time and poor solution quality in the case of large problem size. To overcome these problems, we first propose a novel deep reinforcement learning (DRL) model, which is composed of an actor, an adaptive critic and a routing simulator. The actor, based on the attention mechanism, is designed to generate routing strategies. The adaptive critic is devised to change the network structure adaptively, in order to accelerate the convergence rate and improve the solution quality during training. The routing simulator is developed to provide graph information and reward with the actor and adaptive cirtic. Then, we combine this DRL model with a local search method to further improve the solution quality. The output of the DRL model can serve as the initial solution for the following local search method, from where the final solution of the VRP is obtained. Tested on three datasets with customer points of 20, 50 and 100 respectively, experimental results demonstrate that the DRL model alone finds better solutions compared to construction algorithms and previous DRL approaches, while enabling a 5- to 40-fold speedup. We also observe that combining the DRL model with various local search methods yields excellent solutions at a superior generation speed, comparing to that of other initial solutions.
引用
收藏
页码:7208 / 7218
页数:11
相关论文
共 50 条
  • [41] Deep Reinforcement Learning-Based Routing on Software-Defined Networks
    Kim, Gyungmin
    Kim, Yohan
    Lim, Hyuk
    IEEE ACCESS, 2022, 10 : 18121 - 18133
  • [42] RL-Routing: An SDN Routing Algorithm Based on Deep Reinforcement Learning
    Chen, Yi-Ren
    Rezapour, Amir
    Tzeng, Wen-Guey
    Tsai, Shi-Chun
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (04): : 3185 - 3199
  • [43] Online Vehicle Routing With Neural Combinatorial Optimization and Deep Reinforcement Learning
    Yu, James J. Q.
    Yu, Wen
    Gu, Jiatao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (10) : 3806 - 3817
  • [44] Local search for vehicle routing and scheduling problems:: Review and conceptual integration
    Funke, B
    Grünert, T
    Irnich, S
    JOURNAL OF HEURISTICS, 2005, 11 (04) : 267 - 306
  • [45] A New Hybrid Iterated Local Search for the Open Vehicle Routing Problem
    Chen, Ping
    Qu, Youli
    Huang, Houkuan
    Dong, Xingye
    PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 857 - 861
  • [46] Local Search for Vehicle Routing and Scheduling Problems: Review and Conceptual Integration
    Birger Funke
    Tore Grünert
    Stefan Irnich
    Journal of Heuristics, 2005, 11 : 267 - 306
  • [47] A hybrid algorithm for a class of vehicle routing problems
    Subramanian, Anand
    Uchoa, Eduardo
    Ochi, Luiz Satoru
    COMPUTERS & OPERATIONS RESEARCH, 2013, 40 (10) : 2519 - 2531
  • [48] Vehicle Routing Problems for Drone Delivery
    Dorling, Kevin
    Heinrichs, Jordan
    Messier, Geoffrey G.
    Magierowski, Sebastian
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (01): : 70 - 85
  • [49] Iterative Local-Search Heuristic for Weighted Vehicle Routing Problem
    Wang, Xinyu
    Shao, Shuai
    Tang, Jiafu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) : 3444 - 3454
  • [50] ENERO: Efficient real-time WAN routing optimization with Deep Reinforcement Learning
    Almasan, Paul
    Xiao, Shihan
    Cheng, Xiangle
    Shi, Xiang
    Barlet-Ros, Pere
    Cabellos-Aparicio, Albert
    COMPUTER NETWORKS, 2022, 214