Learning extension parameters in game-tree search

被引：10

作者：

Björnsson, Y ^{[1
]}

Marsland, TA ^{[1
]}

机构：

[1] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada

来源：

INFORMATION SCIENCES | 2003年 / 154卷 / 3-4期

关键词：

D O I：

10.1016/S0020-0255(03)00045-8

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The strength of a program for playing an adversary game like chess or checkers is greatly influenced by how selectively it explores the various branches of the game tree. Typically, some branch paths are discontinued early while others are explored more deeply. Finding the best set of parameters to control these extensions is a difficult, time-consuming, and tedious task. In this paper we describe a method for automatically tuning search-extension parameters in adversary search. Based on the new method, two learning variants are introduced: one for offline learning and the other for online learning. The two approaches are compared and experimental results provided in the domain of chess. (C) 2003 Elsevier Science Inc. All rights reserved.

引用

页码：95 / 118

页数：24

共 23 条

[1] SINGULAR EXTENSIONS - ADDING SELECTIVITY TO BRUTE-FORCE SEARCHING [J].

ANANTHARAMAN, T ;

CAMPBELL, MS ;

HSU, FH .

ARTIFICIAL INTELLIGENCE, 1990, 43 (01) :99-109

[2]

[Anonymous], MACHINES LEARN PLAY

[3]

[Anonymous], 2001, P 17 INT JOINT C ART

[4] Learning to play chess using temporal differences [J].

Baxter, J ;

Tridgell, A ;

Weaver, L .

MACHINE LEARNING, 2000, 40 (03) :243-263

[5] Quantification of search-extension benefits [J].

Beal, DF ;

Smith, MC .

ICCA JOURNAL, 1995, 18 (04) :205-218

[6] Learning piece values using temporal differences [J].

Beal, DF ;

Smith, MC .

ICCA JOURNAL, 1997, 20 (03) :147-151

[7]

BJORNSSON Y, 2001, ADV COMPUTER GAMES, V9, P157

[8]

BJORNSSON Y, 2002, THESIS U ALBERTA EDM

[9]

Buro M., 2000, GAMES AI RES, P77

[10]

Cazenave T., 2001, ADV COMPUTER GAMES, V9, P275

← 1 2 3 →