Using Q-learning algorithm for initialization of the GRASP metaheuristic and genetic algorithm

被引:0
作者
de Lima Junior, Francisco Chagas
de Melo, Jorge Dantas
Doria Neto, Adriao Duarte
机构
来源
2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6 | 2007年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Techniques of optimization, known as metaheuristics, have achieved success in the resolution of many problems classified as NP-Hard. These methods use non-deterministic approaches that find good solutions which, however, do not guarantee the determination of the global optimum. Beyond the inherent difficulties related to the complexity that characterizes the optimization problems, the metaheuristics still face the dilemma of the exploitation - exploration, which consists of choosing between a greedy search and a wider exploration of the solution space. A way to guide such algorithms during the search of better solutions is supplying them with more knowledge through the learning of the environment. This way, this work proposes the use of a technique of Reinforcement Learning - Q-Learning Algorithm - for the constructive phase of GRASP metaheuristic and to generate the initial population of a Genetic Algorithm. The proposed methods will be applied to the symmetrical traveling salesman problem.
引用
收藏
页码:1243 / 1248
页数:6
相关论文
共 14 条
[1]   Learning evaluation functions to improve optimization by local search [J].
Boyan, JA ;
Moore, AW .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :77-112
[2]  
DARRELL W, 2003, GENETIC ALGORITHM TU
[3]   GREEDY RANDOMIZED ADAPTIVE SEARCH PROCEDURES [J].
FEO, TA ;
RESENDE, MGC .
JOURNAL OF GLOBAL OPTIMIZATION, 1995, 6 (02) :109-133
[4]  
Festa P., 2001, GRASP ANNOTATED BIBL
[5]  
Gamberdella LM., 1995, P 12 INT C MACH LEAR, P252
[6]  
Garey MR, 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
[7]  
HAUPT RL, 2004, PRATICAL GENETIC ALG
[8]  
Karp R. M., 1975, Networks, V5, P45
[9]  
LIMA, 2001, HIERARCHICAL REINFOR, P251
[10]  
MARTIN L, 1994, MARKOV DECISION PROB