Adaptive game playing using multiplicative weights

被引:304
作者
Freund, Y [1 ]
Schapire, RE [1 ]
机构
[1] AT&T Labs Res, Shannon Lab, Florham Pk, NJ 07932 USA
关键词
D O I
10.1006/game.1999.0738
中图分类号
F [经济];
学科分类号
02 ;
摘要
We present a simple algorithm for playing a repeated game. We show that a player using this algorithm suffers average loss that is guaranteed to come close to the minimum loss achievable by any fixed strategy. Our bounds are nonasymptotic and hold for any opponent. The algorithm, which uses the multiplicative-weight methods of Littlestone and Warmuth, is analyzed using the Kullback-Liebler divergence. This analysis yields a new, simple proof of the min-max theorem, as well as a provable method of approximately solving a game. A variant of our game-playing algorithm is proved to be optimal in a very strong sense. Journal of Economic Literature Classification Numbers: C44, C70, D83. (C) 1999 Academic Press.
引用
收藏
页码:79 / 103
页数:25
相关论文
共 31 条
[1]   FAST PROBABILISTIC ALGORITHMS FOR HAMILTONIAN CIRCUITS AND MATCHINGS [J].
ANGLUIN, D ;
VALIANT, LG .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1979, 18 (02) :155-193
[2]  
[Anonymous], 1982, GAME THEORY
[3]  
Auer P, 1995, AN S FDN CO, P322, DOI 10.1109/SFCS.1995.492488
[4]  
Blackwell D., 1956, PAC J MATH, V6, P1, DOI [DOI 10.2140/PJM.1956.6.1, 10.2140/pjm.1956.6.1]
[5]  
Blackwell D, 1954, Theory of Games and Statistical Decisions
[6]   How to use expert advice [J].
CesaBianchi, N ;
Freund, Y ;
Haussler, D ;
Helmbold, DP ;
Schapire, RE ;
Warmuth, MK .
JOURNAL OF THE ACM, 1997, 44 (03) :427-485
[7]  
Cover T. M., 1991, Mathematical Finance, V1, P1, DOI [https://doi.org/10.1111/j.1467-9965.1991.tb00002.x, DOI 10.1111/J.1467-9965.1991.TB00002.X, 10.1111/j.1467-9965.1991.tb00002.x]
[8]  
COVER TM, 1996, IEEE T INFORM THEORY
[9]   STATISTICAL-THEORY - THE PREQUENTIAL APPROACH [J].
DAWID, AP .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1984, 147 :278-292
[10]   UNIVERSAL PREDICTION OF INDIVIDUAL SEQUENCES [J].
FEDER, M ;
MERHAV, N ;
GUTMAN, M .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (04) :1258-1270