Deep Policy Dynamic Programming for Vehicle Routing Problems

被引：31

作者：

Kool, Wouter ^{[1
,2
]}

van Hoof, Herke ^{[1
]}

Gromicho, Joaquim ^{[1
,2
]}

Welling, Max ^{[1
]}

机构：

[1] Univ Amsterdam, Amsterdam, Netherlands

[2] ORTEC, Zoetermeer, Netherlands

来源：

INTEGRATION OF CONSTRAINT PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND OPERATIONS RESEARCH, CPAIOR 2022 | 2022年 / 13292卷

关键词：

Dynamic Programming; Deep Learning; Vehicle Routing; TRAVELING SALESMAN PROBLEM; LARGE NEIGHBORHOOD SEARCH; ALGORITHM; OPTIMIZATION;

D O I：

10.1007/978-3-031-08011-1_14

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Routing problems are a class of combinatorial problems with many practical applications. Recently, end-to-end deep learning methods have been proposed to learn approximate solution heuristics for such problems. In contrast, classical dynamic programming (DP) algorithms guarantee optimal solutions, but scale badly with the problem size. We propose Deep Policy Dynamic Programming (DPDP), which aims to combine the strengths of learned neural heuristics with those of DP algorithms. DPDP prioritizes and restricts the DP state space using a policy derived from a deep neural network, which is trained to predict edges from example solutions. We evaluate our framework on the travelling salesman problem (TSP), the vehicle routing problem (VRP) and TSP with time windows (TSPTW) and show that the neural policy improves the performance of (restricted) DP algorithms, making them competitive to strong alternatives such as LKH, while also outperforming most other 'neural approaches' for solving TSPs, VRPs and TSPTWs with 100 nodes.

引用

页码：190 / 213

页数：24

共 50 条

[31] Policy adaptation for vehicle routing
Cazenave, Tristan
Lucas, Jean-Yves
Triboulet, Thomas
Kim, Hyoseok
AI COMMUNICATIONS, 2021, 34 (01) : 21 - 35
[32] An adaptive large neighborhood search heuristic for dynamic vehicle routing problems
Chen, Shifeng
Chen, Rong
Wang, Gai-Ge
Gao, Jian
Sangaiah, Arun Kumar
COMPUTERS & ELECTRICAL ENGINEERING, 2018, 67 : 596 - 607
[33] Metaheuristics for vehicle routing problems with three-dimensional loading constraints
Fuellerer, Guenther
Doerner, Karl F.
Hartl, Richard F.
Iori, Manuel
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 201 (03) : 751 - 759
[34] Vehicle routing problems over time: a survey
Mor, A.
Speranza, M. G.
4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2020, 18 (02): : 129 - 149
[35] Reinforcement Learning With Multiple Relational Attention for Solving Vehicle Routing Problems
Xu, Yunqiu
Fang, Meng
Chen, Ling
Xu, Gangyan
Du, Yali
Zhang, Chengqi
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (10) : 11107 - 11120
[36] Electric Vehicle Routing Problems with Stochastic Demands and Dynamic Remedial Measures
Ge, Xianlong
Zhu, Ziqiang
Jin, Yuanzhi
MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
[37] Vehicle routing problems over time: a survey
Mor, A.
Speranza, M. G.
ANNALS OF OPERATIONS RESEARCH, 2022, 314 (01) : 255 - 275
[38] Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem
Li, Jingwen
Ma, Yining
Gao, Ruize
Cao, Zhiguang
Lim, Andrew
Song, Wen
Zhang, Jie
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 13572 - 13585
[39] A unified exact method for solving different classes of vehicle routing problems
Baldacci, Roberto
Mingozzi, Aristide
MATHEMATICAL PROGRAMMING, 2009, 120 (02) : 347 - 380
[40] The Vehicle Routing Problem with Dynamic Occasional Drivers
Dahle, Lars
Andersson, Henrik
Christiansen, Marielle
COMPUTATIONAL LOGISTICS, ICCL 2017, 2017, 10572 : 49 - 63

← 1 2 3 4 5 →