Deep attention models with dimension-reduction and gate mechanisms for solving practical time-dependent vehicle routing problems

被引:9
作者
Guo, Feng [1 ,2 ]
Wei, Qu [3 ,4 ]
Wang, Miao [1 ]
Guo, Zhaoxia [1 ]
Wallace, Stein W. [1 ,5 ]
机构
[1] Sichuan Univ, Business Sch, Chengdu 610065, Peoples R China
[2] Hong Kong Polytech Univ, Dept Logist & Maritime Studies, Hong Kong, Peoples R China
[3] Sichuan Univ, West China Hosp, West China Biomed Big Data Ctr, Chengdu 610041, Peoples R China
[4] Sichuan Univ, Med X Ctr Informat, Chengdu 610041, Peoples R China
[5] NHH Norwegian Sch Econ, Bergen, Norway
基金
中国国家自然科学基金;
关键词
Time-dependent vehicle routing problems; Deep reinforcement learning; Attention model; Dimension-reduction mechanism; Gate mechanism; TABU SEARCH; ALGORITHMS; BRANCH; PRICE;
D O I
10.1016/j.tre.2023.103095
中图分类号
F [经济];
学科分类号
02 ;
摘要
Time dependencies of travel speeds in time-dependent vehicle routing problems (TDVRPs) are usually accounted for by discretizing the planning horizon into several time periods. However, travel speeds usually change frequently in real road networks, so many time periods are needed to evaluate candidate solutions accurately in model and solution construction for practical TDVRPs, which increases substantially the computational complexity of TDVRPs. We develop two deep attention models with dimension-reduction and gate mechanisms to solve practical TDVRPs in real urban road networks. In the two models, a multi-head attention-based dimension-reduction mechanism is proposed to reduce the dimension of model inputs and obtain enhanced node representation, whereas a gate mechanism is introduced to obtain better information representation. On the basis of a travel speed dataset from an urban road network, we conduct extensive experiments to validate the effectiveness of the proposed models on practical TDVRPs with or without consideration of time windows. Experimental results show that our models can solve TDVRPs with 240 time periods and up to 250 customers effectively and efficiently and provide significantly superior overall performances over two representative heuristics and two state-of-the-art deep reinforcement learning models. Especially, compared with a recent tabu search method, our models can reduce the computation time by up to 3,540 times and improve the solution performance by up to 23%. Moreover, our models have an outstanding generalization performance. The model trained for the 30-customer TDVRP with time windows can be used directly to solve problems with up to 250 customers effectively by generating superior solutions over those generated by benchmarking methods.
引用
收藏
页数:17
相关论文
共 49 条
[1]   Valid Inequalities for the Fleet Size and Mix Vehicle Routing Problem with Fixed Costs [J].
Baldacci, Roberto ;
Battarra, Maria ;
Vigo, Daniele .
NETWORKS, 2009, 54 (04) :178-189
[2]   Dynamic stochastic electric vehicle routing with safe reinforcement learning [J].
Basso, Rafael ;
Kulcsar, Balazs ;
Sanchez-Diaz, Ivan ;
Qu, Xiaobo .
TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2022, 157
[3]  
Bello I., 2016, PREPRINT
[4]  
Ben Ticha H., 2021, SN OPER RES FORUM, V2, P1
[5]   Machine learning for combinatorial optimization: A methodological tour d'horizon [J].
Bengio, Yoshua ;
Lodi, Andrea ;
Prouvost, Antoine .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 290 (02) :405-421
[6]   Solving Multi-Agent Routing Problems Using Deep Attention Mechanisms [J].
Bono, Guillaume ;
Dibangoye, Jilles S. ;
Simonin, Olivier ;
Matignon, Laetitia ;
Pereyron, Florian .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (12) :7804-7813
[7]  
Chen XY, 2019, ADV NEUR IN, V32
[8]   Branch and Price for the Time-Dependent Vehicle Routing Problem with Time Windows [J].
Dabia, Said ;
Ropke, Stefan ;
van Woensel, Tom ;
De Kok, Ton .
TRANSPORTATION SCIENCE, 2013, 47 (03) :380-396
[9]   Time dependent vehicle routing problem with a multi ant colony system [J].
Donati, Alberto V. ;
Montemanni, Roberto ;
Casagrande, Norman ;
Rizzoll, Andrea E. ;
Gambardella, Luca M. .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2008, 185 (03) :1174-1191
[10]   Cooperative pursuit of unauthorized UAVs in urban airspace via Multi-agent reinforcement learning [J].
Du, Wenbo ;
Guo, Tong ;
Chen, Jun ;
Li, Biyue ;
Zhu, Guangxiang ;
Cao, Xianbin .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 128