Reinforcement Learning Approach to Stochastic Vehicle Routing Problem With Correlated Demands

被引:4
作者
Iklassov, Zangir [1 ]
Sobirov, Ikboljon [1 ]
Solozabal, Ruben [1 ]
Takac, Martin [1 ]
机构
[1] Mohamed bin Zayed Univ Artificial Intelligence MBZ, Dept Machine Learning, Abu Dhabi, U Arab Emirates
关键词
Reinforcement learning; stopchastic optimization; vehicle routing problem; PRICE ALGORITHM; GAME;
D O I
10.1109/ACCESS.2023.3306076
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a novel end-to-end framework for solving the Vehicle Routing Problem with stochastic demands (VRPSD) using Reinforcement Learning (RL). Our formulation incorporates the correlation between stochastic demands through other observable stochastic variables, thereby offering an experimental demonstration of the theoretical premise that non-i.i.d. stochastic demands provide opportunities for improved routing solutions. Our approach bridges the gap in the application of RL to VRPSD and consists of a parameterized stochastic policy optimized using a policy gradient algorithm to generate a sequence of actions that form the solution. Our model outperforms previous state-of-the-art metaheuristics and demonstrates robustness to changes in the environment, such as the supply type, vehicle capacity, correlation, and noise levels of demand. Moreover, the model can be easily retrained for different VRPSD scenarios by observing the reward signals and following feasibility constraints, making it highly flexible and scalable. These findings highlight the potential of RL to enhance the transportation efficiency and mitigate its environmental impact in stochastic routing problems. Our implementation is available in https://github.com/Zangir/SVRP.
引用
收藏
页码:87958 / 87969
页数:12
相关论文
共 38 条
[1]   Guidelines for the computational testing of machine learning approaches to vehicle routing problems [J].
Accorsi, Luca ;
Lodi, Andrea ;
Vigo, Daniele .
OPERATIONS RESEARCH LETTERS, 2022, 50 (02) :229-234
[2]   Learning dexterous in-hand manipulation [J].
Andrychowicz, Marcin ;
Baker, Bowen ;
Chociej, Maciek ;
Jozefowicz, Rafal ;
McGrew, Bob ;
Pachocki, Jakub ;
Petron, Arthur ;
Plappert, Matthias ;
Powell, Glenn ;
Ray, Alex ;
Schneider, Jonas ;
Sidor, Szymon ;
Tobin, Josh ;
Welinder, Peter ;
Weng, Lilian ;
Zaremba, Wojciech .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (01) :3-20
[3]   VeSoNet: Traffic-Aware Content Caching for Vehicular Social Networks Using Deep Reinforcement Learning [J].
Aung, Nyothiri ;
Dhelim, Sahraoui ;
Chen, Liming ;
Lakas, Abderrahmane ;
Zhang, Wenyin ;
Ning, Huansheng ;
Chaib, Souleyman ;
Kechadi, Mohand Tahar .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (08) :8638-8649
[4]   T-Coin: Dynamic Traffic Congestion Pricing System for the Internet of Vehicles in Smart Cities [J].
Aung, Nyothiri ;
Zhang, Weidong ;
Dhelim, Sahraoui ;
Ai, Yibo .
INFORMATION, 2020, 11 (03)
[5]   Dynamic stochastic electric vehicle routing with safe reinforcement learning [J].
Basso, Rafael ;
Kulcsar, Balazs ;
Sanchez-Diaz, Ivan ;
Qu, Xiaobo .
TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2022, 157
[6]  
Bello I., 2017, arXiv, DOI 10.48550/arXiv.1611.09940
[7]   On the stochastic vehicle routing problem with time windows, correlated travel times, and time dependency [J].
Bomboi, Federica ;
Buchheim, Christoph ;
Pruente, Jonas .
4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2022, 20 (02) :217-239
[8]   A branch-and-price algorithm for the capacitated vehicle routing problem with stochastic demands [J].
Christiansen, Christian H. ;
Lysgaard, Jens .
OPERATIONS RESEARCH LETTERS, 2007, 35 (06) :773-781
[9]  
Cordeau JF, 2007, HBK OPERAT RES MANAG, V14, P367, DOI 10.1016/S0927-0507(06)14006-2
[10]   THE TRUCK DISPATCHING PROBLEM [J].
DANTZIG, GB ;
RAMSER, JH .
MANAGEMENT SCIENCE, 1959, 6 (01) :80-91