Online Vehicle Routing With Neural Combinatorial Optimization and Deep Reinforcement Learning

被引：192

作者：

Yu, James J. Q. ^{[1
]}

Yu, Wen ^{[2
]}

Gu, Jiatao ^{[3
]}

机构：

[1] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen 518055, Peoples R China

[2] Natl Polytech Inst CINVESTAV IPN, Dept Automat Control, Mexico City 07360, DF, Mexico

[3] Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2019年 / 20卷 / 10期

关键词：

Online vehicle routing; logistic system; neural combinatorial optimization; deep reinforcement learning; intelligent transportation; ELECTRIC VEHICLES;

D O I：

10.1109/TITS.2019.2909109

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Online vehicle routing is an important task of the modern transportation service provider. Contributed by the ever-increasing real-time demand on the transportation system, especially small-parcel last-mile delivery requests, vehicle route generation is becoming more computationally complex than before. The existing routing algorithms are mostly based on mathematical programming, which requires huge computation time in city-size transportation networks. To develop routes with minimal time, in this paper, we propose a novel deep reinforcement learning-based neural combinatorial optimization strategy. Specifically, we transform the online routing problem to a vehicle tour generation problem, and propose a structural graph embedded pointer network to develop these tours iteratively. Furthermore, since constructing supervised training data for the neural network is impractical due to the high computation complexity, we propose a deep reinforcement learning mechanism with an unsupervised auxiliary network to train the model parameters. A multisampling scheme is also devised to further improve the system performance. Since the parameter training process is offline, the proposed strategy can achieve a superior online route generation speed. To assess the proposed strategy, we conduct comprehensive case studies with a real-world transportation network. The simulation results show that the proposed strategy can significantly outperform conventional strategies with limited computation time in both static and dynamic logistic systems. In addition, the influence of control parameters on the system performance is investigated.

引用

页码：3806 / 3817

页数：12

共 38 条

[1] Electric Vehicles in Logistics and Transportation: A Survey on Emerging Environmental, Strategic, and Operational Challenges
Alejandro Juan, Angel
Alberto Mendez, Carlos
Faulin, Javier
de Armas, Jesica
Grasman, Scott Erwin
[J]. ENERGIES, 2016, 9 (02)
[2] [Anonymous], IEEE T INTELL TRANSP
[3] [Anonymous], TECH REP
[4] [Anonymous], 2017, Gurobi Optimization Reference Manual, V7th
[5] [Anonymous], IEEE T SMART GRID
[6] [Anonymous], 2017, MODEL S TESLA
[7] Long short-term memory
Hochreiter, S
Schmidhuber, J
[J]. NEURAL COMPUTATION, 1997, 9 (08) : 1735 - 1780
[8] [Anonymous], 2017, ADV NEURAL INFORM PR
[9] [Anonymous], 2016, DEEP LEARNING
[10] [Anonymous], 2017, NISSAN LEAF ELECT CA

← 1 2 3 4 →