Online Vehicle Routing With Neural Combinatorial Optimization and Deep Reinforcement Learning

被引:192
作者
Yu, James J. Q. [1 ]
Yu, Wen [2 ]
Gu, Jiatao [3 ]
机构
[1] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen 518055, Peoples R China
[2] Natl Polytech Inst CINVESTAV IPN, Dept Automat Control, Mexico City 07360, DF, Mexico
[3] Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong, Peoples R China
关键词
Online vehicle routing; logistic system; neural combinatorial optimization; deep reinforcement learning; intelligent transportation; ELECTRIC VEHICLES;
D O I
10.1109/TITS.2019.2909109
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Online vehicle routing is an important task of the modern transportation service provider. Contributed by the ever-increasing real-time demand on the transportation system, especially small-parcel last-mile delivery requests, vehicle route generation is becoming more computationally complex than before. The existing routing algorithms are mostly based on mathematical programming, which requires huge computation time in city-size transportation networks. To develop routes with minimal time, in this paper, we propose a novel deep reinforcement learning-based neural combinatorial optimization strategy. Specifically, we transform the online routing problem to a vehicle tour generation problem, and propose a structural graph embedded pointer network to develop these tours iteratively. Furthermore, since constructing supervised training data for the neural network is impractical due to the high computation complexity, we propose a deep reinforcement learning mechanism with an unsupervised auxiliary network to train the model parameters. A multisampling scheme is also devised to further improve the system performance. Since the parameter training process is offline, the proposed strategy can achieve a superior online route generation speed. To assess the proposed strategy, we conduct comprehensive case studies with a real-world transportation network. The simulation results show that the proposed strategy can significantly outperform conventional strategies with limited computation time in both static and dynamic logistic systems. In addition, the influence of control parameters on the system performance is investigated.
引用
收藏
页码:3806 / 3817
页数:12
相关论文
共 38 条
  • [1] Electric Vehicles in Logistics and Transportation: A Survey on Emerging Environmental, Strategic, and Operational Challenges
    Alejandro Juan, Angel
    Alberto Mendez, Carlos
    Faulin, Javier
    de Armas, Jesica
    Grasman, Scott Erwin
    [J]. ENERGIES, 2016, 9 (02)
  • [2] [Anonymous], IEEE T INTELL TRANSP
  • [3] [Anonymous], TECH REP
  • [4] [Anonymous], 2017, Gurobi Optimization Reference Manual, V7th
  • [5] [Anonymous], IEEE T SMART GRID
  • [6] [Anonymous], 2017, MODEL S TESLA
  • [7] Long short-term memory
    Hochreiter, S
    Schmidhuber, J
    [J]. NEURAL COMPUTATION, 1997, 9 (08) : 1735 - 1780
  • [8] [Anonymous], 2017, ADV NEURAL INFORM PR
  • [9] [Anonymous], 2016, DEEP LEARNING
  • [10] [Anonymous], 2017, NISSAN LEAF ELECT CA