Emergence of exploitation as symmetry breaking in iterated prisoner's dilemma

被引:12
|
作者
Fujimoto, Yuma [1 ]
Kaneko, Kunihiko [1 ,2 ]
机构
[1] Univ Tokyo, Grad Sch Arts & Sci, Dept Basic Sci, Meguro Ku, 3-8-1 Komaba, Tokyo 1538902, Japan
[2] Univ Tokyo, Universal Biol Inst, Res Ctr Complex Syst Biol, 3-8-1 Komaba, Tokyo 1538902, Japan
来源
PHYSICAL REVIEW RESEARCH | 2019年 / 1卷 / 03期
关键词
EVOLUTION; REINFORCEMENT; STRATEGIES; EXTORTION;
D O I
10.1103/PhysRevResearch.1.033077
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In society, mutual cooperation, defection, and exploitative relationships are common. Whereas cooperation and defection are studied extensively in the literature on game theory, exploitative relationships between players, in which one receives a larger benefit than the other while the game itself is symmetric, are little explored. In a recent seminal study, Press and Dyson demonstrated that if only one player can learn about the other, asymmetric exploitation is achieved in the prisoner's dilemma game. In their study, however, asymmetry is assumed in decision making between persons; the exploiting player one-sidedly determines and fixes the strategy and the exploited player follows it. It is unknown whether such exploitation emerges and is stably established even when both players learn about each other symmetrically and try to optimize their payoffs. Here, we first formulate a dynamical system that describes the change in a player's probabilistic strategy with reinforcement learning to obtain greater payoffs, based on the recognition of the other player. By applying this formulation to the standard prisoner's dilemma game, we numerically and analytically demonstrate that an exploitative relationship can be achieved despite symmetric strategy dynamics and symmetric rule of games. This exploitative relationship is stabilized by both the players: The exploiting player demands the other's unfair cooperation. Even though the exploited player, who receives a lower payoff than the exploiting player, has optimized the own strategy, the player accepts the other's defection to some degree. Whether the final equilibrium state is mutual cooperation, defection, or exploitation crucially depends on the initial conditions. Response to decrease the cooperation probability against a defector leads to oscillations in the probabilities of cooperation between the players and thus a complicated basin structure to the final equilibrium. In particular, any slight difference between both players' initial strategies can be amplified and fixed as a large difference in the probabilities of cooperation, leading to fixation of exploitation. In other words, symmetry breaking between the exploiting and exploited players results. Considering the generality of the result, this study provides another perspective on the origin of exploitation in society.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Recognising and evaluating the effectiveness of extortion in the Iterated Prisoner's Dilemma
    Knight, Vincent
    Harper, Marc
    Glynatsi, Nikoleta E.
    Gillard, Jonathan
    PLOS ONE, 2024, 19 (07):
  • [22] GETTING RECOMMENDATION IS NOT ALWAYS BETTER IN ITERATED PRISONER'S DILEMMA
    Cinar, Zeynep B.
    Bingol, Haluk O.
    ADVANCES IN COMPLEX SYSTEMS, 2020, 23 (05):
  • [23] The interplay of emotion expressions and strategy in promoting cooperation in the iterated prisoner's dilemma
    de Melo, Celso M.
    Terada, Kazunori
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [24] Asymmetrical expectations of future interaction and cooperation in the iterated prisoner's dilemma game
    Zeng, Weijun
    Ai, Hongfeng
    Zhao, Man
    APPLIED MATHEMATICS AND COMPUTATION, 2019, 359 : 148 - 164
  • [25] A bibliometric study of research topics, collaboration, and centrality in the iterated prisoner's dilemma
    Glynatsi, Nikoleta E.
    Knight, Vincent A.
    HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2021, 8 (01):
  • [26] The Effects of Temperature Priming on Cooperation in the Iterated Prisoner's Dilemma
    Storey, Simon
    Workman, Lance
    EVOLUTIONARY PSYCHOLOGY, 2013, 11 (01): : 52 - 67
  • [27] Adapation of Iterated Prisoner's Dilemma strategies by evolution and learning
    Quek, Han Yang
    Goh, Chi Keong
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND GAMES, 2007, : 40 - 47
  • [28] Discrimination emerging through spontaneous symmetry breaking in a spatial prisoner's dilemma model with multiple labels
    Jensen, Gorm Gruner
    Tischel, Frederik
    Bornholdt, Stefan
    PHYSICAL REVIEW E, 2019, 100 (06)
  • [29] On Learning Stable Cooperation in the Iterated Prisoner's Dilemma with Paid Incentives
    Sun, Xiyue
    Pieroth, Fabian R.
    Schmid, Kyrill
    Wirsing, Martin
    Belzner, Lenz
    2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW), 2022, : 113 - 118
  • [30] Exploiting Evolutionary Modeling to Prevail in Iterated Prisoner's Dilemma Tournaments
    Gaudesi, Marco
    Piccolo, Elio
    Squillero, Giovanni
    Tonda, Alberto
    IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2016, 8 (03) : 288 - 300