Adaptive Large Neighbourhood Search (ALNS) is a popular metaheuristic with renowned efficiency in solving combinatorial optimisation problems. However, despite 18 years of intensive research into ALNS, the design of an effective adaptive layer for selecting operators to improve the solution remains an open question. In this work, we isolate this problem by formulating it as a Markov Decision Process, in which an agent is rewarded proportionally to the improvement of the incumbent. We propose Graph Reinforcement Learning for Operator Selection (GRLOS), a method based on Deep Reinforcement Learning and Graph Neural Networks, as well as Learned Roulette Wheel (LRW), a lightweight approach inspired by the classic Roulette Wheel adaptive layer. The methods, which are broadly applicable to optimisation problems that can be represented as graphs, are comprehensively evaluated on 5 routing problems using a large portfolio of 28 destroy and 7 repair operators. Results show that both GRLOS and LRW outperform the classic selection mechanism in ALNS, owing to the operator choices being learned in a prior training phase. GRLOS is also shown to consistently achieve better performance than a recent Deep Reinforcement Learning method due to its substantially more flexible state representation. The evaluation further examines the impact of the operator budget and type of initial solution, and is applied to problem instances with up to 1000 customers. The findings arising from our extensive benchmarking bear relevance to the wider literature of hybrid methods combining metaheuristics and machine learning.
机构:
Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R ChinaCent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R China
Chen, Jiamin
Gao, Jianliang
论文数: 0引用数: 0
h-index: 0
机构:
Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R ChinaCent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R China
Gao, Jianliang
Chen, Yibo
论文数: 0引用数: 0
h-index: 0
机构:
State Grid Hunan Elect Power Co Ltd, Informat & Commun Branch, Changsha 410007, Peoples R ChinaCent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R China
Chen, Yibo
Oloulade, Babatounde Moctard
论文数: 0引用数: 0
h-index: 0
机构:
Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R ChinaCent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R China
Oloulade, Babatounde Moctard
Lyu, Tengfei
论文数: 0引用数: 0
h-index: 0
机构:
Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R ChinaCent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R China
Lyu, Tengfei
Li, Zhao
论文数: 0引用数: 0
h-index: 0
机构:
Alibaba Zhejiang Univ Joint Res Inst Frontier Tec, Ecommerce Ranking & Recommendat Syst, Hangzhou 311121, Zhejiang, Peoples R ChinaCent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R China
机构:
Univ Illinois, Dept Educ Psychol, 236A Educ Bldg,1310 S Sixth St, Champaign, IL 61820 USAUniv Illinois, Dept Educ Psychol, 236A Educ Bldg,1310 S Sixth St, Champaign, IL 61820 USA
Li, Xiao
Xu, Hanchen
论文数: 0引用数: 0
h-index: 0
机构:
Univ Illinois, Dept Elect & Comp Engn, 306 N Wright St MC 702, Urbana, IL 61801 USAUniv Illinois, Dept Educ Psychol, 236A Educ Bldg,1310 S Sixth St, Champaign, IL 61820 USA
Xu, Hanchen
Zhang, Jinming
论文数: 0引用数: 0
h-index: 0
机构:
Univ Illinois, Dept Educ Psychol, 236A Educ Bldg,1310 S Sixth St, Champaign, IL 61820 USAUniv Illinois, Dept Educ Psychol, 236A Educ Bldg,1310 S Sixth St, Champaign, IL 61820 USA
Zhang, Jinming
Chang, Hua-hua
论文数: 0引用数: 0
h-index: 0
机构:
Purdue Univ, Dept Educ Studies, Steven C Beering Hall Liberal Arts & Educ, W Lafayette, IN 47907 USAUniv Illinois, Dept Educ Psychol, 236A Educ Bldg,1310 S Sixth St, Champaign, IL 61820 USA