Reinforcement learning based local search for grouping problems: A case study on graph coloring

被引:52
作者
Zhou, Yangming [1 ]
Hao, Jin-Kao [1 ,2 ]
Duval, Beatrice [1 ]
机构
[1] Univ Angers, LERIA, 2 Bd Lavoisier, F-49045 Angers, France
[2] Inst Univ France, Paris, France
关键词
Grouping problems; Reinforcement learning; Heuristics; Learning-based optimization; GENETIC ALGORITHM; OPTIMIZATION;
D O I
10.1016/j.eswa.2016.07.047
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Grouping problems aim to partition a set of items into multiple mutually disjoint subsets according to some specific criterion and constraints. Grouping problems cover a large class of computational problems including clustering and classification that frequently appear in expert and intelligent systems as well as many real applications. This paper focuses on developing a general-purpose solution approach for grouping problems, i.e., reinforcement learning based local search (RLS), which combines reinforcement learning techniques with local search. This paper makes the following contributions: we show that (1) reinforcement learning can help obtain useful information from discovered local optimum solutions; (2) the learned information can be advantageously used to guide the search algorithm towards promising regions. To the best of our knowledge, this is the first attempt to. propose a formal model that combines reinforcement learning and local search for solving grouping problems. The proposed approach is verified on a well-known representative grouping problem (graph coloring). The generality of the approach makes it applicable to other grouping problems. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:412 / 422
页数:11
相关论文
共 54 条
[1]   A new grouping genetic algorithm for clustering problems [J].
Agustin-Blas, L. E. ;
Salcedo-Sanz, S. ;
Jimenez-Fernandez, S. ;
Carro-Calvo, L. ;
Del Ser, J. ;
Portilla-Figueras, J. A. .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (10) :9695-9703
[2]  
Ahmed L. N., 2013, EXPERT SYSTEMS APPL, V42, P5463
[3]  
[Anonymous], 2013, Handbook of Optimization: From Classical to Modern Approach
[4]  
[Anonymous], LION WAY MACHINE LEA
[5]  
[Anonymous], 2013, J MATH MODELLING ALG
[6]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theory of NP-Completeness
[7]  
BALUJA S, 2000, NEURAL COMPUTING SUR, V3, P1
[8]   Learning evaluation functions to improve optimization by local search [J].
Boyan, JA ;
Moore, AW .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :77-112
[9]   An ant-based algorithm for coloring graphs [J].
Bui, Thang N. ;
Nguyen, ThanhVu H. ;
Patel, Chirag M. ;
Phan, Kim-Anh T. .
DISCRETE APPLIED MATHEMATICS, 2008, 156 (02) :190-200
[10]   A tabu-search hyperheuristic for timetabling and rostering [J].
Burke, EK ;
Kendall, G ;
Soubeiga, E .
JOURNAL OF HEURISTICS, 2003, 9 (06) :451-470