A behavioral learning process in games

被引：46

作者：

Laslier, JF

Topol, R

Walliser, B ^{[1
]}

机构：

[1] Ecole Natl Ponts & Chaussees, CERAS, Paris, France

[2] Ecole Polytech, CREA, F-75230 Paris, France

[3] Ecole Polytech, CNRS, F-75230 Paris, France

[4] Ecole Polytech, Lab Econometrie, F-75230 Paris, France

来源：

GAMES AND ECONOMIC BEHAVIOR | 2001年 / 37卷 / 02期

关键词：

evolution; learning; Nash equilibrium; Polya urn; reinforcement;

D O I：

10.1006/game.2000.0841

中图分类号：

F [经济];

学科分类号：

02 ;

摘要：

This paper studies the cumulative proportional reinforcement (CPR) rule, according to which an agent plays, at each period, an action with a probability proportional to the cumulative utility that the agent has obtained with that action. The asymptotic properties of this learning process are examined for a decision-maker under risk, where it converges almost surely toward the expected utility maximizing action(s). The process is further considered in a two-player game; it converges with positive probability toward any strict pure Nash equilibrium and converges with zero probability toward some mixed equilibria (which are characterized). The CPR rule is compared in its principles with other reinforcement rules and with replicator dynamics. (C) 2001 Academic Press.

引用

页码：340 / 366

页数：27

共 26 条

[1]

[Anonymous], SEMINAIRE PROBABILIT

[2]

[Anonymous], [No title captured], DOI DOI 10.1007/BF01199986

[3]

ARTHUR B, 1984, KI8BERNETIKA, V1, P49

[4]

Benaim M., 1996, Journal of Dynamics and Differential Equations, V8, P141, DOI 10.1007/BF02218617