A behavioral learning process in games

被引:46
作者
Laslier, JF
Topol, R
Walliser, B [1 ]
机构
[1] Ecole Natl Ponts & Chaussees, CERAS, Paris, France
[2] Ecole Polytech, CREA, F-75230 Paris, France
[3] Ecole Polytech, CNRS, F-75230 Paris, France
[4] Ecole Polytech, Lab Econometrie, F-75230 Paris, France
关键词
evolution; learning; Nash equilibrium; Polya urn; reinforcement;
D O I
10.1006/game.2000.0841
中图分类号
F [经济];
学科分类号
02 ;
摘要
This paper studies the cumulative proportional reinforcement (CPR) rule, according to which an agent plays, at each period, an action with a probability proportional to the cumulative utility that the agent has obtained with that action. The asymptotic properties of this learning process are examined for a decision-maker under risk, where it converges almost surely toward the expected utility maximizing action(s). The process is further considered in a two-player game; it converges with positive probability toward any strict pure Nash equilibrium and converges with zero probability toward some mixed equilibria (which are characterized). The CPR rule is compared in its principles with other reinforcement rules and with replicator dynamics. (C) 2001 Academic Press.
引用
收藏
页码:340 / 366
页数:27
相关论文
共 50 条
  • [1] A reinforcement learning process in extensive form games
    Jean-François Laslier
    Bernard Walliser
    International Journal of Game Theory, 2005, 33 : 219 - 227
  • [2] A reinforcement learning process in extensive form games
    Laslier, JF
    Walliser, B
    INTERNATIONAL JOURNAL OF GAME THEORY, 2005, 33 (02) : 219 - 227
  • [3] A behavioral study of "noise" in coordination games
    Mas, Michael
    Nax, Heinrich H.
    JOURNAL OF ECONOMIC THEORY, 2016, 162 : 195 - 208
  • [4] Learning process in public goods games
    Amado, Andre
    Huang, Weini
    Campos, Paulo R. A.
    Ferreira, Fernando Fagundes
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2015, 430 : 21 - 31
  • [5] Dynamic learning in behavioral games: A hidden Markov mixture of experts approach
    Asim Ansari
    Ricardo Montoya
    Oded Netzer
    Quantitative Marketing and Economics, 2012, 10 : 475 - 503
  • [6] Dynamic learning in behavioral games: A hidden Markov mixture of experts approach
    Ansari, Asim
    Montoya, Ricardo
    Netzer, Oded
    QME-QUANTITATIVE MARKETING AND ECONOMICS, 2012, 10 (04): : 475 - 503
  • [7] Learning across games
    Mengel, Friederike
    GAMES AND ECONOMIC BEHAVIOR, 2012, 74 (02) : 601 - 619
  • [8] Learning in network games
    Kovarik, Jaromir
    Mengel, Friederike
    Gabriel Romero, Jose
    QUANTITATIVE ECONOMICS, 2018, 9 (01) : 85 - 139
  • [9] On the robustness of learning in games with stochastically perturbed payoff observations
    Bravo, Mario
    Mertikopoulos, Panayotis
    GAMES AND ECONOMIC BEHAVIOR, 2017, 103 : 41 - 66
  • [10] Interactive games in the teaching-learning process of a foreing language
    Martinez Sanchez, Maritza Maribel
    Perez Morfin, Armando
    Portillo Campos, Vilma E.
    TEORIA Y PRAXIS, 2007, 3 (04): : 47 - 66