Training Agents with Neural Networks in Systems with Imperfect Information

被引:0
作者
Korukhova, Yulia [1 ]
Kuryshev, Sergey [1 ]
机构
[1] Moscow MV Lomonosov State Univ, Computat Math & Cybernet Fac, GSP 1, Moscow 119991, Russia
来源
ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1 | 2017年
关键词
Multi-agent Systems; Neural Networks; Dominated Strategies;
D O I
10.5220/0006242102960301
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper deals with multi-agent system that represents trading agents acting in the environment with imperfect information. Fictitious play algorithm, first proposed by Brown in 1951, is a popular theoretical model of training agents. However, it is not applicable to larger systems with imperfect information due to its computational complexity. In this paper we propose a modification of the algorithm. We use neural networks for fast approximate calculation of the best responses. An important feature of the algorithm is the absence of agent's a priori knowledge about the system. Agents' learning goes through trial and error with winning actions being reinforced and entered into the training set and losing actions being cut from the strategy. The proposed algorithm has been used in a small game with imperfect information. And the ability of the algorithm to remove iteratively dominated strategies of agents' behavior has been demonstrated.
引用
收藏
页码:296 / 301
页数:6
相关论文
共 8 条
  • [1] Brown GW., 1951, Activity analysis of production and allocation, V13
  • [2] Gibson R., 2014, THESIS
  • [3] Johanson M., 2013, TR1301 U ALB DEP COM
  • [4] Koller D., 1994, Proceedings of the Twenty-Sixth Annual ACM Symposium on the Theory of Computing, P750, DOI 10.1145/195058.195451
  • [5] Kuhn H. W., 1950, Contributions to the Theory of Games, V24, P97
  • [6] NASH J, 1951, ANN MATH, V54, P286, DOI 10.2307/1969529
  • [7] Osborne M. J., 1994, A Course in Game Theory
  • [8] Zinkevich Martin, 2007, Advances in Neural Information Processing Systems