Reinforcement learning based local search for grouping problems: A case study on graph coloring

被引：52

作者：

Zhou, Yangming ^{[1
]}

Hao, Jin-Kao ^{[1
,2
]}

Duval, Beatrice ^{[1
]}

机构：

[1] Univ Angers, LERIA, 2 Bd Lavoisier, F-49045 Angers, France

[2] Inst Univ France, Paris, France

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2016年 / 64卷

关键词：

Grouping problems; Reinforcement learning; Heuristics; Learning-based optimization; GENETIC ALGORITHM; OPTIMIZATION;

D O I：

10.1016/j.eswa.2016.07.047

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Grouping problems aim to partition a set of items into multiple mutually disjoint subsets according to some specific criterion and constraints. Grouping problems cover a large class of computational problems including clustering and classification that frequently appear in expert and intelligent systems as well as many real applications. This paper focuses on developing a general-purpose solution approach for grouping problems, i.e., reinforcement learning based local search (RLS), which combines reinforcement learning techniques with local search. This paper makes the following contributions: we show that (1) reinforcement learning can help obtain useful information from discovered local optimum solutions; (2) the learned information can be advantageously used to guide the search algorithm towards promising regions. To the best of our knowledge, this is the first attempt to. propose a formal model that combines reinforcement learning and local search for solving grouping problems. The proposed approach is verified on a well-known representative grouping problem (graph coloring). The generality of the approach makes it applicable to other grouping problems. (C) 2016 Elsevier Ltd. All rights reserved.

引用

页码：412 / 422

页数：11

共 54 条

[1] A new grouping genetic algorithm for clustering problems [J].

Agustin-Blas, L. E. ;

Salcedo-Sanz, S. ;

Jimenez-Fernandez, S. ;

Carro-Calvo, L. ;

Del Ser, J. ;

Portilla-Figueras, J. A. .

EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (10) :9695-9703

[2]

Ahmed L. N., 2013, EXPERT SYSTEMS APPL, V42, P5463

[3]

[Anonymous], 2013, Handbook of Optimization: From Classical to Modern Approach

[4]

[Anonymous], LION WAY MACHINE LEA

[5]

[Anonymous], 2013, J MATH MODELLING ALG

[6]

[Anonymous], 1979, Computers and Intractablity: A Guide to the Theory of NP-Completeness

[7]

BALUJA S, 2000, NEURAL COMPUTING SUR, V3, P1

[8] Learning evaluation functions to improve optimization by local search [J].

Boyan, JA ;

Moore, AW .

JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :77-112

[9] An ant-based algorithm for coloring graphs [J].

Bui, Thang N. ;

Nguyen, ThanhVu H. ;

Patel, Chirag M. ;

Phan, Kim-Anh T. .

DISCRETE APPLIED MATHEMATICS, 2008, 156 (02) :190-200

[10] A tabu-search hyperheuristic for timetabling and rostering [J].

Burke, EK ;

Kendall, G ;

Soubeiga, E .

JOURNAL OF HEURISTICS, 2003, 9 (06) :451-470

← 1 2 3 4 5 6 →