Improving heuristic mini-max search by supervised learning

被引:30
作者
Buro, M [1 ]
机构
[1] NEC Res Inst, Princeton, NJ 08540 USA
关键词
selective game-tree search; evaluation function; feature construction; opening book learning; GLEM; ProbCut; LOGISTELLO;
D O I
10.1016/S0004-3702(01)00093-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article surveys three techniques for enhancing heuristic game-tree search pioneered in the author's Othello program LOGISTELLO, which dominated the computer Othello scene for several years and won against the human World-champion 6-0 in 1997. First, a generalized linear evaluation model (GLEM) is described that combines conjunctions of Boolean features linearly. This approach allows an automatic, data driven exploration of the feature space. Combined with efficient least squares weight fitting, GLEM greatly eases the programmer's task of finding significant features and assigning weights to them. Second, the selective search heuristic PROBCUT and its enhancements are discussed. Based on evaluation correlations PROBCUT can prune probably irrelevant sub-trees with a prescribed confidence. Tournament results indicate a considerable playing strength improvement compared to full-width alpha-beta search. Third, an opening book framework is presented that enables programs to improve upon previous play and to explore new opening lines by constructing and searching a game-tree based on evaluations of played variations. These general methods represent the state-of-the-art in computer Othello programming and begin to attract researchers in related fields. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:85 / 99
页数:15
相关论文
共 26 条
[1]   A Bayesian approach to relevance in game playing [J].
Baum, EB ;
Smith, WD .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :195-242
[2]   A GENERALIZED QUIESCENCE SEARCH ALGORITHM [J].
BEAL, DF .
ARTIFICIAL INTELLIGENCE, 1990, 43 (01) :85-98
[3]  
BERLINER H, 2000, IEEE INTELLIGENT JAN, V15, P9
[4]   PROBCUT - AN EFFECTIVE SELECTIVE EXTENSION OF THE ALPHA-BETA ALGORITHM [J].
BURO, M .
ICCA JOURNAL, 1995, 18 (02) :71-76
[5]  
Buro M, 1999, LECT NOTES COMPUT SC, V1558, P126
[7]   The Othello match of the year: Takeshi Murakami vs. Logistello [J].
Buro, M .
ICCA JOURNAL, 1997, 20 (03) :189-193
[8]   Toward opening book learning [J].
Buro, M .
ICCA JOURNAL, 1999, 22 (02) :98-102
[9]  
Buro M., 2000, GAMES AI RES, P77
[10]  
HANSON SJ, 1990, ADV NEURAL INFORMATI, P541