Colearning in Differential Games

被引:0
作者
John W. Sheppard
机构
[1] ARINC,
来源
Machine Learning | 1998年 / 33卷
关键词
Markov games; differential games; pursuit games; multiagent learning; reinforcement learning; Q-learning;
D O I
暂无
中图分类号
学科分类号
摘要
Game playing has been a popular problem area for research in artificial intelligence and machine learning for many years. In almost every study of game playing and machine learning, the focus has been on games with a finite set of states and a finite set of actions. Further, most of this research has focused on a single player or team learning how to play against another player or team that is applying a fixed strategy for playing the game. In this paper, we explore multiagent learning in the context of game playing and develop algorithms for “co-learning” in which all players attempt to learn their optimal strategies simultaneously. Specifically, we address two approaches to colearning, demonstrating strong performance by a memory-based reinforcement learner and comparable but faster performance with a tree-based reinforcement learner.
引用
收藏
页码:201 / 233
页数:32
相关论文
共 28 条
[1]  
Aha D.(1992)Tolerating noise, irrelevant, and novel attributes in instance-based learning algorithms International Journal of Man-Machine Studies 16 267-287
[2]  
Bentley J.(1980)Multidimensional divide and conquer Communications of the ACM 23 214-229
[3]  
Dayan P.(1992)The convergence of TD(λ) for general λ Machine Learning 8 341-362
[4]  
Dorigo M.(1996)The ant system: Optimization by a colony of cooperating agents IEEE Transactions on Systems, Man, and Cybernetics 26 1-13
[5]  
Maniezzo V.(1988)Credit assignment in rule discovery systems based on genetic algorithms Machine Learning 3 225-245
[6]  
Colorni A.(1990)Learning sequential decision rules using simulation models and competition Machine Learning 5 355-381
[7]  
Grefenstette J.(1980)Pursuit-evasion of two aircraft in a horizontal plane Journal of Guidance and Control 3 261-267
[8]  
Grefenstette J.(1995)Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term Games and Economic Behavior 8 164-212
[9]  
Ramsey C.(1959)Some studies in machine learning using the game of checkers IBM Journal of Research and Development 3 211-229
[10]  
Schultz A.(1995)Multiagent reinforcement learning in the iterated prisoner's dilemma Biosystems 37 147-166