Using Q-learning algorithm for initialization of the GRASP metaheuristic and genetic algorithm

被引：0

作者：

de Lima Junior, Francisco Chagas

de Melo, Jorge Dantas

Doria Neto, Adriao Duarte

机构：

来源：

2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6 | 2007年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Techniques of optimization, known as metaheuristics, have achieved success in the resolution of many problems classified as NP-Hard. These methods use non-deterministic approaches that find good solutions which, however, do not guarantee the determination of the global optimum. Beyond the inherent difficulties related to the complexity that characterizes the optimization problems, the metaheuristics still face the dilemma of the exploitation - exploration, which consists of choosing between a greedy search and a wider exploration of the solution space. A way to guide such algorithms during the search of better solutions is supplying them with more knowledge through the learning of the environment. This way, this work proposes the use of a technique of Reinforcement Learning - Q-Learning Algorithm - for the constructive phase of GRASP metaheuristic and to generate the initial population of a Genetic Algorithm. The proposed methods will be applied to the symmetrical traveling salesman problem.

引用

页码：1243 / 1248

页数：6

共 14 条

[1] Learning evaluation functions to improve optimization by local search [J].

Boyan, JA ;

Moore, AW .

JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :77-112

[2]

DARRELL W, 2003, GENETIC ALGORITHM TU

[3] GREEDY RANDOMIZED ADAPTIVE SEARCH PROCEDURES [J].

FEO, TA ;

RESENDE, MGC .

JOURNAL OF GLOBAL OPTIMIZATION, 1995, 6 (02) :109-133

[4]

Festa P., 2001, GRASP ANNOTATED BIBL

[5]

Gamberdella LM., 1995, P 12 INT C MACH LEAR, P252

[6]

Garey MR, 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness

[7]

HAUPT RL, 2004, PRATICAL GENETIC ALG

[8]

Karp R. M., 1975, Networks, V5, P45

[9]

LIMA, 2001, HIERARCHICAL REINFOR, P251

[10]

MARTIN L, 1994, MARKOV DECISION PROB

← 1 2 →