A behavioral learning process in games

被引:46
作者
Laslier, JF
Topol, R
Walliser, B [1 ]
机构
[1] Ecole Natl Ponts & Chaussees, CERAS, Paris, France
[2] Ecole Polytech, CREA, F-75230 Paris, France
[3] Ecole Polytech, CNRS, F-75230 Paris, France
[4] Ecole Polytech, Lab Econometrie, F-75230 Paris, France
关键词
evolution; learning; Nash equilibrium; Polya urn; reinforcement;
D O I
10.1006/game.2000.0841
中图分类号
F [经济];
学科分类号
02 ;
摘要
This paper studies the cumulative proportional reinforcement (CPR) rule, according to which an agent plays, at each period, an action with a probability proportional to the cumulative utility that the agent has obtained with that action. The asymptotic properties of this learning process are examined for a decision-maker under risk, where it converges almost surely toward the expected utility maximizing action(s). The process is further considered in a two-player game; it converges with positive probability toward any strict pure Nash equilibrium and converges with zero probability toward some mixed equilibria (which are characterized). The CPR rule is compared in its principles with other reinforcement rules and with replicator dynamics. (C) 2001 Academic Press.
引用
收藏
页码:340 / 366
页数:27
相关论文
共 26 条
[1]  
[Anonymous], SEMINAIRE PROBABILIT
[2]  
[Anonymous], [No title captured], DOI DOI 10.1007/BF01199986
[3]  
ARTHUR B, 1984, KI8BERNETIKA, V1, P49
[4]  
Benaim M., 1996, Journal of Dynamics and Differential Equations, V8, P141, DOI 10.1007/BF02218617
[5]   Mixed equilibria and dynamical systems arising from fictitious play in perturbed games [J].
Benaïm, M ;
Hirsch, MW .
GAMES AND ECONOMIC BEHAVIOR, 1999, 29 (1-2) :36-72
[6]   Learning through reinforcement and replicator dynamics [J].
Borgers, T ;
Sarin, R .
JOURNAL OF ECONOMIC THEORY, 1997, 77 (01) :1-14
[7]   Naive reinforcement learning with endogenous aspirations [J].
Börgers, T ;
Sarin, R .
INTERNATIONAL ECONOMIC REVIEW, 2000, 41 (04) :921-950
[8]  
Bush RR, 1955, Stochastic models for learning, DOI DOI 10.1037/14496-000
[9]   STOCHASTIC LEARNING MODEL OF ECONOMIC BEHAVIOR [J].
CROSS, JG .
QUARTERLY JOURNAL OF ECONOMICS, 1973, 87 (02) :239-266
[10]  
Erev I, 1998, AM ECON REV, V88, P848