Automated design of search algorithms based on reinforcement learning

被引:5
作者
Yi, Wenjie [1 ]
Qu, Rong [1 ]
机构
[1] Univ Nottingham, Sch Comp Sci, Nottingham, England
关键词
Automated algorithm design; Reinforcement learning; Vehicle routing problem with time windows; VEHICLE-ROUTING PROBLEM; LOCAL SEARCH; SOLVE;
D O I
10.1016/j.ins.2023.119639
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automated algorithm design has attracted increasing research attention recently in the evolution-ary computation community. The main design decisions include selection heuristics and evolution operators in the search algorithms. Most existing studies, however, have focused on the automated design of evolution operators, neglecting selection heuristics for evolution and for replacement, not to mention considering all of the design decisions. This limited the scope of the algorithms under consideration. This study aims to systematically investigate automated design of search algorithms by exploring the impact of individual algorithmic components within a general search framework and the synergy among these multiple algorithmic components utilising a reinforce-ment learning technique. Comprehensive computational experiments are conducted on different benchmark instances of the capacitated vehicle routing problem with time windows to evaluate the effectiveness and generality of the proposed method. This study contributes to knowledge discovery in automated algorithm design using machine learning towards significantly enhanced generality of search algorithms.
引用
收藏
页数:20
相关论文
共 47 条
  • [1] A two-stage hybrid local search for the vehicle routing problem with time windows
    Bent, R
    Van Hentenryck, P
    [J]. TRANSPORTATION SCIENCE, 2004, 38 (04) : 515 - 530
  • [2] Burke EK, 2011, GECCO-2011: PROCEEDINGS OF THE 13TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, P1987
  • [3] An evolutionary algorithm based hyper-heuristic framework for the set packing problem
    Chaurasia, Sachchida Nand
    Kim, Joong Hoon
    [J]. INFORMATION SCIENCES, 2019, 505 : 1 - 31
  • [4] Chen Y., 2016, A Multi-Arm Bandit Neighbourhood Search for Routing and Scheduling Problems
  • [5] Automatic design of hyper-heuristic based on reinforcement learning
    Choong, Shin Siang
    Wong, Li-Pei
    Lim, Chee Peng
    [J]. INFORMATION SCIENCES, 2018, 436 : 89 - 107
  • [6] Consoli Pietro, 2014, Evolutionary Computation in Combinatorial Optimisation. 14th European Conference (EvoCOP 2014). Revised Selected Papers: LNCS 8600, P97, DOI 10.1007/978-3-662-44320-0_9
  • [7] A unified tabu search heuristic for vehicle routing problems with time windows
    Cordeau, JF
    Laporte, G
    Mercier, A
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2001, 52 (08) : 928 - 936
  • [8] The multi-depot vehicle routing problem with inter-depot routes
    Crevier, Benoit
    Cordeau, Jean-Francois
    Laporte, Gilbert
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 176 (02) : 756 - 773
  • [9] Parallel simulated annealing for the vehicle routing problem with time windows
    Czech, ZJ
    Czarnas, P
    [J]. 10TH EUROMICRO WORKSHOP ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, PROCEEDINGS, 2002, : 376 - 383
  • [10] Dantas Augusto, 2021, GECCO '21: Proceedings of the Genetic and Evolutionary Computation Conference Companion, P1488, DOI 10.1145/3449726.3463187