Observing the evolution of neural networks learning to play the game of Othello

被引：24

作者：

Chong, SY ^{[1
]}

Tan, MK

White, JD

机构：

[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England

[2] Multimedia Univ, Fac Engn & Technol, Melaka 75450, Malaysia

[3] Yuan Ze Univ, Dept Elect Engn, Taoyuan 320, Taiwan

来源：

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION | 2005年 / 9卷 / 03期

关键词：

artificial intelligence; coevolution; evolutionary computation; neural networks; Othello;

D O I：

10.1109/TEVC.2005.843750

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A study was conducted to find out how game-playing strategies for Othello (also known as reversi) can be learned without expert knowledge. The approach used the coevolution of a fixed-architecture neural-network-based evaluation function combined with a standard minimax search algorithm. Comparisons between evolving neural networks and computer players that used deterministic strategies allowed evolution to be observed in real-time. Neural networks evolved to outperform the computer players playing at higher ply-depths, despite being handicapped by playing black and using minimax at ply-depth of two. In addition, the playing ability of the population progressed from novice, to intermediate, and then to master's level. Individual neural networks discovered various game-playing strategies, starting with positional and later mobility. These results show that neural networks can be evolved as evaluation functions, despite the general difficulties associated with this approach. Success in this case was due to a simple spatial preprocessing layer in the neural network that captured spatial information, self-adaptation of every weight and bias of the neural network, and a selection method that allowed a diverse population of neural networks to be carried forward from one generation to the next.

引用

页码：240 / 251

页数：12

共 47 条

[1] ANGELINE PJ, 1993, PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON GENETIC ALGORITHMS, P264
[2] Back T., 1994, Proceedings of the First IEEE Conference on Evolutionary Computation. IEEE World Congress on Computational Intelligence (Cat. No.94TH0650-2), P57, DOI 10.1109/ICEC.1994.350042
[3] Back T., 1997, IEEE Transactions on Evolutionary Computation, V1, P3, DOI 10.1109/4235.585888
[4] Back T., 1996, EVOLUTIONARY ALGORIT
[5] The challenge of poker
Billings, D
Davidson, A
Schaeffer, J
Szafron, D
[J]. ARTIFICIAL INTELLIGENCE, 2002, 134 (1-2) : 201 - 240
[6] STRATEGY KNOWLEDGE AND STRATEGY CHANGE IN SKILLED PERFORMANCE - A STUDY OF THE GAME OTHELLO
BILLMAN, D
SHAMAN, D
[J]. AMERICAN JOURNAL OF PSYCHOLOGY, 1990, 103 (02) : 145 - 166
[7] Improving heuristic mini-max search by supervised learning
Buro, M
[J]. ARTIFICIAL INTELLIGENCE, 2002, 134 (1-2) : 85 - 99
[8] Chellapilla K., 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406), P1885, DOI 10.1109/CEC.1999.785503
[9] Evolution, neural networks, games, and intelligence
Chellapilla, K
Fogel, DB
[J]. PROCEEDINGS OF THE IEEE, 1999, 87 (09) : 1471 - 1496
[10] Chellapilla K., 1998, IEEE Transactions on Evolutionary Computation, V2, P91, DOI 10.1109/4235.735431

← 1 2 3 4 5 →