Learning while solving problems in best first search

被引:6
作者
Sarkar, S [1 ]
Chakrabarti, PP
Ghose, S
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Guwahati 781001, India
[2] Indian Inst Technol, Dept Comp Sci & Engn, Kharagpur 721302, W Bengal, India
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS | 1998年 / 28卷 / 04期
关键词
anytime algorithm; best first search; continuous learning; heuristic features; learning; problem solving;
D O I
10.1109/3468.686716
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We investigate the role of learning in search-based systems for solving optimization problems. Many AI problem solving systems solve problems repeatedly from the same domain. If the problems come from the same distribution in the learning phase and the problem solving phase, the problem solver can acquire information while solving problems, which can be used to solve subsequent problems faster. We use a learning model, where the values of a set of features can be used to induce a clustering of the problem state space. The feasible set of h* values corresponding to each cluster is called h*set. If we relax the optimality guarantee, and tolerate a risk factor, the distribution of h*set can be used to expedite search and produce results within a given risk of suboptimality. The offline learning method consists of solving a batch of problems by using A* to learn the distribution of the h*set in the learning phase. This distribution can be used to solve the-rest of the problems effectively. We show how the knowledge acquisition phase can be integrated with the problem solving phase. We present a continuous on-line learning scheme that uses an "anytime" algorithm to learn continuously while solving problems. The system starts with initial assumed distributions of h*set which are used to solve the initial problems. The results are used to update the distributions continuously and with time the distributions converge.
引用
收藏
页码:535 / 542
页数:8
相关论文
共 18 条
[1]  
BRAMANTIGREGOR A, 1992, P 10 EUR C ART INT, P6
[2]  
BRAMANTIGREGOR A, 1993, P 13 INT JOINT C ART, P1079
[3]  
BRAMANTIGREGOR A, 1991, P 12 INT JOINT C ART
[4]  
CHENOWETH SV, 1991, P 12 INT JOINT C ART
[5]  
CHRISTENSEN J, 1986, P 5 NAT C ART INT
[6]  
GASCHNIG JG, 1979, THESIS CARNEGIE MELO
[7]   METHOD FOR COMPUTING HEURISTICS IN PROBLEM-SOLVING [J].
GUIDA, G ;
SOMALVICO, M .
INFORMATION SCIENCES, 1979, 19 (03) :251-259
[8]  
LEE KF, 1988, ARTIF INTELL, V36
[9]  
MOSTOW J, 1989, P 11 INT JOINT C ART
[10]  
Nilsson N.J., 1980, PRINCIPLES ARTIFICIA