Automated design of search algorithms based on reinforcement learning

被引：5

作者：

Yi, Wenjie ^{[1
]}

Qu, Rong ^{[1
]}

机构：

[1] Univ Nottingham, Sch Comp Sci, Nottingham, England

来源：

INFORMATION SCIENCES | 2023年 / 649卷

关键词：

Automated algorithm design; Reinforcement learning; Vehicle routing problem with time windows; VEHICLE-ROUTING PROBLEM; LOCAL SEARCH; SOLVE;

D O I：

10.1016/j.ins.2023.119639

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Automated algorithm design has attracted increasing research attention recently in the evolution-ary computation community. The main design decisions include selection heuristics and evolution operators in the search algorithms. Most existing studies, however, have focused on the automated design of evolution operators, neglecting selection heuristics for evolution and for replacement, not to mention considering all of the design decisions. This limited the scope of the algorithms under consideration. This study aims to systematically investigate automated design of search algorithms by exploring the impact of individual algorithmic components within a general search framework and the synergy among these multiple algorithmic components utilising a reinforce-ment learning technique. Comprehensive computational experiments are conducted on different benchmark instances of the capacitated vehicle routing problem with time windows to evaluate the effectiveness and generality of the proposed method. This study contributes to knowledge discovery in automated algorithm design using machine learning towards significantly enhanced generality of search algorithms.

引用

页数：20

共 47 条

[1] A two-stage hybrid local search for the vehicle routing problem with time windows
Bent, R
Van Hentenryck, P
[J]. TRANSPORTATION SCIENCE, 2004, 38 (04) : 515 - 530
[2] Burke EK, 2011, GECCO-2011: PROCEEDINGS OF THE 13TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, P1987
[3] An evolutionary algorithm based hyper-heuristic framework for the set packing problem
Chaurasia, Sachchida Nand
Kim, Joong Hoon
[J]. INFORMATION SCIENCES, 2019, 505 : 1 - 31
[4] Chen Y., 2016, A Multi-Arm Bandit Neighbourhood Search for Routing and Scheduling Problems
[5] Automatic design of hyper-heuristic based on reinforcement learning
Choong, Shin Siang
Wong, Li-Pei
Lim, Chee Peng
[J]. INFORMATION SCIENCES, 2018, 436 : 89 - 107
[6] Consoli Pietro, 2014, Evolutionary Computation in Combinatorial Optimisation. 14th European Conference (EvoCOP 2014). Revised Selected Papers: LNCS 8600, P97, DOI 10.1007/978-3-662-44320-0_9
[7] A unified tabu search heuristic for vehicle routing problems with time windows
Cordeau, JF
Laporte, G
Mercier, A
[J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2001, 52 (08) : 928 - 936
[8] The multi-depot vehicle routing problem with inter-depot routes
Crevier, Benoit
Cordeau, Jean-Francois
Laporte, Gilbert
[J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 176 (02) : 756 - 773
[9] Parallel simulated annealing for the vehicle routing problem with time windows
Czech, ZJ
Czarnas, P
[J]. 10TH EUROMICRO WORKSHOP ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, PROCEEDINGS, 2002, : 376 - 383
[10] Dantas Augusto, 2021, GECCO '21: Proceedings of the Genetic and Evolutionary Computation Conference Companion, P1488, DOI 10.1145/3449726.3463187

← 1 2 3 4 5 →