Deep Reinforcement Learning for Solving Two-Echelon Capacity Vehicle Routing Problem: An End-to-End Method

被引:1
作者
Sun, Weice [1 ]
Pei, Zhi [1 ]
机构
[1] Zhejiang Univ Technol, Coll Mech Engn, Hangzhou 310023, Peoples R China
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2025年 / 10卷 / 06期
关键词
Satellites; Vehicle routing; Mathematical models; Heuristic algorithms; Deep reinforcement learning; Vehicle dynamics; Markov decision processes; Logistics; Costs; Urban areas; Automation logistics; deep reinforcement learning (DRL); Markov decision process (MDP); multi head attention (MHA); ALGORITHM;
D O I
10.1109/LRA.2025.3568309
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Two-echelon distribution networks significantly enhance the delivery speed and reduce the distribution cost. Recent years have seen a growing trend in applying the reinforcement learning method to deal with combinatorial optimization problems such as the Vehicle Routing Problem (VRP). The advantage of Deep Reinforcement Learning (DRL) in this context lies in the fast solving of instances under the same distribution via the trained models. To the best of our knowledge, no prior research has applied the DRL to tackle the Two-Echelon Capacitated Vehicle Routing Problem (2E-CVRP). This letter, for the first time, models 2E-CVRP as a Markov Decision Process (MDP) and proposes an end-to-end DRL approach to handle it. Experimental results show that our proposed DRL-2E-CVRP method can rapidly solve unseen instances from the same distribution and improve the solution quality through transfer learning. It is observed that the solution speed surpasses that of commercial solvers, and the solution accuracy matches or even exceeds them within a limited time span. In addition, our method also demonstrates strong performance on benchmarks with unknown distributions.
引用
收藏
页码:6432 / 6439
页数:8
相关论文
共 24 条
[1]  
Amarouche Y., 2018, P 18 WORKSH ALG APPR, P11
[2]   An Exact Algorithm for the Two-Echelon Capacitated Vehicle Routing Problem [J].
Baldacci, Roberto ;
Mingozzi, Aristide ;
Roberti, Roberto ;
Clavo, Roberto Wolfler .
OPERATIONS RESEARCH, 2013, 61 (02) :298-314
[3]   The vehicle routing problem: State of the art classification and review [J].
Braekers, Kris ;
Ramaekers, Katrien ;
Van Nieuwenhuyse, Inneke .
COMPUTERS & INDUSTRIAL ENGINEERING, 2016, 99 :300-313
[4]   The electric two-echelon vehicle routing problem [J].
Breunig, U. ;
Baldacci, R. ;
Hartl, R. F. ;
Vidal, T. .
COMPUTERS & OPERATIONS RESEARCH, 2019, 103 :198-210
[5]   A large neighbourhood based heuristic for two-echelon routing problems [J].
Breunig, U. ;
Schmid, V. ;
Hartl, R. F. ;
Vidal, T. .
COMPUTERS & OPERATIONS RESEARCH, 2016, 76 :208-225
[6]  
Crainic TG, 2011, LECT NOTES COMPUT SC, V6622, P179, DOI 10.1007/978-3-642-20364-0_16
[7]  
Feliu J. G., 2008, The two-echelon capacitated vehicle routing problem
[8]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[9]   An adaptive large neighborhood search heuristic for Two-Echelon Vehicle Routing Problems arising in city logistics [J].
Hemmelmayr, Vera C. ;
Cordeau, Jean-Francois ;
Crainic, Teodor Gabriel .
COMPUTERS & OPERATIONS RESEARCH, 2012, 39 (12) :3215-3228
[10]   A Branch-and-Cut Algorithm for the Symmetric Two-Echelon Capacitated Vehicle Routing Problem [J].
Jepsen, Mads ;
Spoorendonk, Simon ;
Ropke, Stefan .
TRANSPORTATION SCIENCE, 2013, 47 (01) :23-37