Introspection dynamics: a simple model of counterfactual learning in asymmetric games

被引：15

作者：

Couto, M. C. ^{[1
]}

Giaimo, S. ^{[2
]}

Hilbe, C. ^{[1
]}

机构：

[1] Max Planck Inst Evolutionary Biol, Max Planck Res Grp Dynam Social Behav, D-24306 Plon, Germany

[2] Max Planck Inst Evolutionary Biol, Dept Evolutionary Theory, D-24306 Plon, Germany

来源：

NEW JOURNAL OF PHYSICS | 2022年 / 24卷 / 06期

基金：

欧洲研究理事会;

关键词：

evolutionary game theory; counterfactual learning; myopic updating; asymmetric games; social dilemmas; volunteer's dilemma; PRISONERS-DILEMMA GAME; EVOLUTIONARY DYNAMICS; STATISTICAL-MECHANICS; IMITATION PROCESSES; BIMATRIX GAMES; STRATEGIES; COOPERATION; MUTATIONS; SELECTION; FIXATION;

D O I：

10.1088/1367-2630/ac6f76

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

Social behavior in human and animal populations can be studied as an evolutionary process. Individuals often make decisions between different strategies, and those strategies that yield a fitness advantage tend to spread. Traditionally, much work in evolutionary game theory considers symmetric games: individuals are assumed to have access to the same set of strategies, and they experience the same payoff consequences. As a result, they can learn more profitable strategies by imitation. However, interactions are oftentimes asymmetric. In that case, imitation may be infeasible (because individuals differ in the strategies they are able to use), or it may be undesirable (because individuals differ in their incentives to use a strategy). Here, we consider an alternative learning process which applies to arbitrary asymmetric games, introspection dynamics. According to this dynamics, individuals regularly compare their present strategy to a randomly chosen alternative strategy. If the alternative strategy yields a payoff advantage, it is more likely adopted. In this work, we formalize introspection dynamics for pairwise games. We derive simple and explicit formulas for the abundance of each strategy over time and apply these results to several well-known social dilemmas. In particular, for the volunteer's timing dilemma, we show that the player with the lowest cooperation cost learns to cooperate without delay.

引用

页数：23

共 40 条

[21] Evolutionary dynamics for the firm's organizational mode based on the theory of learning in games
Shi, Kui-Ran
Sheng, Zhao-Han
Xiao, Tiao-Jun
Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2007, 27 (06): : 64 - 70
[22] Neural networks as a unifying learning model for random normal form games
Spiliopoulos, Leonidas
ADAPTIVE BEHAVIOR, 2011, 19 (06) : 383 - 408
[23] Online optimal learning algorithm for Stackelberg games with partially unknown dynamics and constrained inputs
Cui, Xiaohong
Wang, Binrui
Wang, Lina
Chen, Jiayu
NEUROCOMPUTING, 2021, 445 : 1 - 11
[24] Learning and propagation: Evolutionary dynamics in spatial public goods games through combined Q-learning and Fermi rule
Shen, Yong
Ma, Yujie
Kang, Hongwei
Sun, Xingping
Chen, Qingyi
CHAOS SOLITONS & FRACTALS, 2024, 187
[25] A Dynamic Graph Model of Strategy Learning for Predicting Human Behavior in Repeated Games
Vazifedan, Afrooz
Izadi, Mohammad
B E JOURNAL OF THEORETICAL ECONOMICS, 2023, 23 (01) : 371 - 403
[26] Learning in Multi-Memory Games Triggers Complex Dynamics Diverging from Nash Equilibrium
Fujimoto, Yuma
Ariu, Kaito
Abe, Kenshi
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 118 - 125
[27] Relaxation Dynamics and Finite-Size Effects in a Simple Model of Condensation
Gotti, Gabriele
Iubini, Stefano
Politi, Paolo
FLUCTUATION AND NOISE LETTERS, 2024, 23 (01):
[28] MACHINE LEARNING MODEL FOR GLARE PREDICTION IN OFFICES WITH SIMPLE ARCHITECTURAL FEATURES
Sanjeev, Kumar T.
Kurian, Ciji Pearl
Colaco, Sheryl Grace
Mathew, Veena
JOURNAL OF GREEN BUILDING, 2022, 17 (04): : 79 - 97
[29] Exponential Reduction in Sample Complexity with Learning of Ising Model Dynamics
Dutt, Arkopal
Lokhov, Andrey Y.
Vuffray, Marc
Misra, Sidhant
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[30] Tipping points and user-resource system collapse in a simple model of evolutionary dynamics
Brandt, Gunnar
Merico, Agostino
ECOLOGICAL COMPLEXITY, 2013, 13 : 46 - 52

← 1 2 3 4 →