Lévy noise promotes cooperation in the prisoner’s dilemma game with reinforcement learning

被引:0
作者
Lu Wang
Danyang Jia
Long Zhang
Peican Zhu
Matjaž Perc
Lei Shi
Zhen Wang
机构
[1] Northwestern Polytechnical University,School of Mechanical Engineering and School of Artificial Intelligence, Optics and Electronics (iOPEN)
[2] Northwestern Polytechnical University,School of Computer Science and School of Artificial Intelligence, Optics and Electronics (iOPEN)
[3] University of Maribor,Faculty of Natural Sciences and Mathematics
[4] China Medical University Hospital,Department of Medical Research
[5] China Medical University,School of Statistics and Mathematics
[6] Complexity Science Hub Vienna,School of Mechanical Engineering, School of Artificial Intelligence, Optics and Electronics (iOPEN), and School of Cybersecurity
[7] Alma Mater Europaea,undefined
[8] Yunnan University of Finance and Economics,undefined
[9] Northwestern Polytechnical University,undefined
来源
Nonlinear Dynamics | 2022年 / 108卷
关键词
Evolutionary dynamics; Prisoner’s dilemma; Cooperation; Self-regarding ; -learning; Lévy noise;
D O I
暂无
中图分类号
学科分类号
摘要
Uncertainties are ubiquitous in everyday life, and it is thus important to explore their effects on the evolution of cooperation. In this paper, the prisoner’s dilemma game with reinforcement learning subject to Lévy noise is studied. Specifically, diverse fluctuations mimicked by Lévy distributed noise are reflected in the payoff matrix of each player. At the same time, the self-regarding Q-learning algorithm is considered as the strategy update rule to learn the behavior that achieves the highest payoff. The results show that not only does Lévy noise promote the evolution of cooperation with reinforcement learning, it does so comparatively better than Gaussian noise. We explain this with the iterative updating pattern of the self-regarding Q-learning algorithm, which has an accumulative effect on the noise entering the payoff matrix. It turns out that under Lévy noise, the Q-value of cooperative behavior becomes significantly larger than that of defective behavior when the current strategy is defection, which ultimately leads to the prevalence of cooperation, while this is absent with Gaussian noise or without noise. This research thus unveils a particular positive role of Lévy noise in the evolutionary dynamics of social dilemmas.
引用
收藏
页码:1837 / 1845
页数:8
相关论文
共 50 条
  • [1] Levy noise promotes cooperation in the prisoner's dilemma game with reinforcement learning
    Wang, Lu
    Jia, Danyang
    Zhang, Long
    Zhu, Peican
    Perc, Matjaz
    Shi, Lei
    Wang, Zhen
    NONLINEAR DYNAMICS, 2022, 108 (02) : 1837 - 1845
  • [2] Historical payoff promotes cooperation in the prisoner's dilemma game
    Deng, Zhenghong
    Ma, Chunmiao
    Mao, Xudong
    Wang, Shenglan
    Niu, Zhenxi
    Gao, Li
    CHAOS SOLITONS & FRACTALS, 2017, 104 : 1 - 5
  • [3] Cautious strategy update promotes cooperation in spatial prisoner's dilemma game
    Liu, Yongkui
    Zhang, Lin
    Chen, Xiaojie
    Ren, Lei
    Wang, Long
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2013, 392 (17) : 3640 - 3647
  • [4] EFFECTS OF LEARNING ACTIVITY ON COOPERATION IN EVOLUTIONARY PRISONER'S DILEMMA GAME
    Chen, Xiaojie
    Fu, Feng
    Wang, Long
    INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2008, 19 (09): : 1377 - 1387
  • [5] Gradual learning supports cooperation in spatial prisoner's dilemma game
    Szolnoki, Attila
    Chen, Xiaojie
    CHAOS SOLITONS & FRACTALS, 2020, 130
  • [6] An Adaptive Strategy via Reinforcement Learning for the Prisoner's Dilemma Game
    Xue, Lei
    Sun, Changyin
    Wunsch, Donald
    Zhou, Yingjiang
    Yu, Fang
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2018, 5 (01) : 301 - 310
  • [7] Heterogeneity of Networks Promotes Cooperation in the Prisoner’s Dilemma and the Snowdrift Game
    Ruyu Li
    Zhaojin Xu
    Lianzhong Zhang
    Journal of the Korean Physical Society, 2019, 74 : 831 - 837
  • [8] Surrounding information consideration promotes cooperation in Prisoner's dilemma game
    Shu, Gang
    Du, Xia
    Li, Ya
    CHAOS SOLITONS & FRACTALS, 2016, 91 : 689 - 694
  • [9] Uneven Resources network promotes cooperation in the prisoner's dilemma game
    Wang, Zi-Ren
    Deng, Zheng-Hong
    Wang, Huan-Bo
    Li, HuXiong Li
    Fei-Wang, X.
    APPLIED MATHEMATICS AND COMPUTATION, 2022, 413
  • [10] Acceptability of strategy promotes cooperation in a spatial prisoner's dilemma game
    Su, Ran
    Qian, Jia-Li
    Hao, Qing-Yi
    Wu, Chao-Yun
    Guo, Ning
    Ling, Xiang
    JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2023, 2023 (01):