A new Hyper-heuristic based on Adaptive Simulated Annealing and Reinforcement Learning for the Capacitated Electric Vehicle Routing Problem

被引:19
作者
Rodriguez-Esparza, Erick [1 ]
Masegosa, Antonio D. [1 ,2 ]
Oliva, Diego [3 ]
Onieva, Enrique [1 ]
机构
[1] Univ Deusto, Fac Engn, DeustoTech, Ave Univ 24, Bilbao 48007, Spain
[2] Ikerbasque, Basque Fdn Sci, Plaza Euskadi 5, Bilbao 48009, Spain
[3] Univ Guadalajara, Dept Ingn Electrofoton, CUCEI, Ave Revoluc 1500, Guadalajara 44430, Jal, Mexico
关键词
Last-mile logistics; Hyper-heuristic; Electric vehicles; Capacitated electric vehicle routing problem; Combinatorial optimization; Reinforcement learning; TIME WINDOWS; LOCAL SEARCH; OPTIMIZATION; IMPACT; FLEET;
D O I
10.1016/j.eswa.2024.124197
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Electric vehicles (EVs) have been adopted in urban areas to reduce environmental pollution and global warming due to the increasing number of freight vehicles. However, there are still deficiencies in routing the trajectories of last-mile logistics that continue to impact social and economic sustainability. For that reason, in this paper, a hyper-heuristic (HH) approach called Hyper-heuristic Adaptive Simulated Annealing with Reinforcement Learning (HHASARL) is proposed. It is composed of a multi-armed bandit method and the self-adaptive Simulated Annealing (SA) metaheuristic algorithm for solving the problem called Capacitated Electric Vehicle Routing Problem (CEVRP). Due to the limited number of charging stations and the travel range of EVs, the EVs must require battery recharging moments in advance and reduce travel times and costs. The implementation of the HH improves multiple minimum best-known solutions and obtains the best mean values for some high-dimensional instances for the proposed benchmark for the IEEE WCCI2020 competition.
引用
收藏
页数:15
相关论文
共 66 条
[1]  
[Anonymous], 2011, Multi-armed Bandit Allocation Indices
[2]   Recent challenges in Routing and Inventory Routing: E-commerce and last-mile delivery [J].
Archetti, Claudia ;
Bertazzi, Luca .
NETWORKS, 2021, 77 (02) :255-268
[3]   Green vehicle routing problem: A state-of-the-art review [J].
Asghari, Mohammad ;
Al-e-hashem, S. Mohammad J. Mirzapour .
INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2021, 231
[4]   Finite-time analysis of the multiarmed bandit problem [J].
Auer, P ;
Cesa-Bianchi, N ;
Fischer, P .
MACHINE LEARNING, 2002, 47 (2-3) :235-256
[5]  
Aziz N. A., 2016, NAT C POSTGR RES, P469
[6]  
Bhatti A., 2020, INT J FUTUR GENER CO, V13, P1449, DOI DOI 10.1016/J.CHB.2016.09.026
[7]  
Blocho M, 2020, INTELL DAT CENT SYST, P101, DOI 10.1016/B978-0-12-815715-2.00009-9
[8]   A Reinforcement Learning Approach for Rebalancing Electric Vehicle Sharing Systems [J].
Bogyrbayeva, Aigerim ;
Jang, Sungwook ;
Shah, Ankit ;
Jang, Young Jae ;
Kwon, Changhyun .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) :8704-8714
[9]   Urban Freight Last Mile Logistics-Challenges and Opportunities to Improve Sustainability: A Literature Review [J].
Bosona, Techane .
SUSTAINABILITY, 2020, 12 (21) :1-20
[10]   Hyper-heuristics: a survey of the state of the art [J].
Burke, Edmund K. ;
Gendreau, Michel ;
Hyde, Matthew ;
Kendall, Graham ;
Ochoa, Gabriela ;
Oezcan, Ender ;
Qu, Rong .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2013, 64 (12) :1695-1724