Introspection dynamics: a simple model of counterfactual learning in asymmetric games

被引:15
|
作者
Couto, M. C. [1 ]
Giaimo, S. [2 ]
Hilbe, C. [1 ]
机构
[1] Max Planck Inst Evolutionary Biol, Max Planck Res Grp Dynam Social Behav, D-24306 Plon, Germany
[2] Max Planck Inst Evolutionary Biol, Dept Evolutionary Theory, D-24306 Plon, Germany
来源
NEW JOURNAL OF PHYSICS | 2022年 / 24卷 / 06期
基金
欧洲研究理事会;
关键词
evolutionary game theory; counterfactual learning; myopic updating; asymmetric games; social dilemmas; volunteer's dilemma; PRISONERS-DILEMMA GAME; EVOLUTIONARY DYNAMICS; STATISTICAL-MECHANICS; IMITATION PROCESSES; BIMATRIX GAMES; STRATEGIES; COOPERATION; MUTATIONS; SELECTION; FIXATION;
D O I
10.1088/1367-2630/ac6f76
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Social behavior in human and animal populations can be studied as an evolutionary process. Individuals often make decisions between different strategies, and those strategies that yield a fitness advantage tend to spread. Traditionally, much work in evolutionary game theory considers symmetric games: individuals are assumed to have access to the same set of strategies, and they experience the same payoff consequences. As a result, they can learn more profitable strategies by imitation. However, interactions are oftentimes asymmetric. In that case, imitation may be infeasible (because individuals differ in the strategies they are able to use), or it may be undesirable (because individuals differ in their incentives to use a strategy). Here, we consider an alternative learning process which applies to arbitrary asymmetric games, introspection dynamics. According to this dynamics, individuals regularly compare their present strategy to a randomly chosen alternative strategy. If the alternative strategy yields a payoff advantage, it is more likely adopted. In this work, we formalize introspection dynamics for pairwise games. We derive simple and explicit formulas for the abundance of each strategy over time and apply these results to several well-known social dilemmas. In particular, for the volunteer's timing dilemma, we show that the player with the lowest cooperation cost learns to cooperate without delay.
引用
收藏
页数:23
相关论文
共 40 条
  • [21] Evolutionary dynamics for the firm's organizational mode based on the theory of learning in games
    Shi, Kui-Ran
    Sheng, Zhao-Han
    Xiao, Tiao-Jun
    Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2007, 27 (06): : 64 - 70
  • [22] Neural networks as a unifying learning model for random normal form games
    Spiliopoulos, Leonidas
    ADAPTIVE BEHAVIOR, 2011, 19 (06) : 383 - 408
  • [23] Online optimal learning algorithm for Stackelberg games with partially unknown dynamics and constrained inputs
    Cui, Xiaohong
    Wang, Binrui
    Wang, Lina
    Chen, Jiayu
    NEUROCOMPUTING, 2021, 445 : 1 - 11
  • [24] Learning and propagation: Evolutionary dynamics in spatial public goods games through combined Q-learning and Fermi rule
    Shen, Yong
    Ma, Yujie
    Kang, Hongwei
    Sun, Xingping
    Chen, Qingyi
    CHAOS SOLITONS & FRACTALS, 2024, 187
  • [25] A Dynamic Graph Model of Strategy Learning for Predicting Human Behavior in Repeated Games
    Vazifedan, Afrooz
    Izadi, Mohammad
    B E JOURNAL OF THEORETICAL ECONOMICS, 2023, 23 (01) : 371 - 403
  • [26] Learning in Multi-Memory Games Triggers Complex Dynamics Diverging from Nash Equilibrium
    Fujimoto, Yuma
    Ariu, Kaito
    Abe, Kenshi
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 118 - 125
  • [27] Relaxation Dynamics and Finite-Size Effects in a Simple Model of Condensation
    Gotti, Gabriele
    Iubini, Stefano
    Politi, Paolo
    FLUCTUATION AND NOISE LETTERS, 2024, 23 (01):
  • [28] MACHINE LEARNING MODEL FOR GLARE PREDICTION IN OFFICES WITH SIMPLE ARCHITECTURAL FEATURES
    Sanjeev, Kumar T.
    Kurian, Ciji Pearl
    Colaco, Sheryl Grace
    Mathew, Veena
    JOURNAL OF GREEN BUILDING, 2022, 17 (04): : 79 - 97
  • [29] Exponential Reduction in Sample Complexity with Learning of Ising Model Dynamics
    Dutt, Arkopal
    Lokhov, Andrey Y.
    Vuffray, Marc
    Misra, Sidhant
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [30] Tipping points and user-resource system collapse in a simple model of evolutionary dynamics
    Brandt, Gunnar
    Merico, Agostino
    ECOLOGICAL COMPLEXITY, 2013, 13 : 46 - 52