Lévy noise promotes cooperation in the prisoner’s dilemma game with reinforcement learning

被引：0

作者：

Lu Wang

Danyang Jia

Long Zhang

Peican Zhu

Matjaž Perc

Lei Shi

Zhen Wang

机构：

[1] Northwestern Polytechnical University,School of Mechanical Engineering and School of Artificial Intelligence, Optics and Electronics (iOPEN)

[2] Northwestern Polytechnical University,School of Computer Science and School of Artificial Intelligence, Optics and Electronics (iOPEN)

[3] University of Maribor,Faculty of Natural Sciences and Mathematics

[4] China Medical University Hospital,Department of Medical Research

[5] China Medical University,School of Statistics and Mathematics

[6] Complexity Science Hub Vienna,School of Mechanical Engineering, School of Artificial Intelligence, Optics and Electronics (iOPEN), and School of Cybersecurity

[7] Alma Mater Europaea,undefined

[8] Yunnan University of Finance and Economics,undefined

[9] Northwestern Polytechnical University,undefined

来源：

Nonlinear Dynamics | 2022年 / 108卷

关键词：

Evolutionary dynamics; Prisoner’s dilemma; Cooperation; Self-regarding ; -learning; Lévy noise;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Uncertainties are ubiquitous in everyday life, and it is thus important to explore their effects on the evolution of cooperation. In this paper, the prisoner’s dilemma game with reinforcement learning subject to Lévy noise is studied. Specifically, diverse fluctuations mimicked by Lévy distributed noise are reflected in the payoff matrix of each player. At the same time, the self-regarding Q-learning algorithm is considered as the strategy update rule to learn the behavior that achieves the highest payoff. The results show that not only does Lévy noise promote the evolution of cooperation with reinforcement learning, it does so comparatively better than Gaussian noise. We explain this with the iterative updating pattern of the self-regarding Q-learning algorithm, which has an accumulative effect on the noise entering the payoff matrix. It turns out that under Lévy noise, the Q-value of cooperative behavior becomes significantly larger than that of defective behavior when the current strategy is defection, which ultimately leads to the prevalence of cooperation, while this is absent with Gaussian noise or without noise. This research thus unveils a particular positive role of Lévy noise in the evolutionary dynamics of social dilemmas.

引用

页码：1837 / 1845

页数：8

共 50 条

[21] Asymmetric population promotes and jeopardizes cooperation in spatial prisoner's dilemma game
Sharma, Gopal
He, Zhixue
Shen, Chen
Tanimoto, Jun
CHAOS SOLITONS & FRACTALS, 2024, 187
[22] Understanding cooperation in the Prisoner's Dilemma game
Pothos, Emmanuel M.
Perry, Gavin
Corr, Philip J.
Matthew, Mervin R.
Busemeyer, Jerome R.
PERSONALITY AND INDIVIDUAL DIFFERENCES, 2011, 51 (03) : 210 - 215
[23] EFFECT OF LEARNING IN THE PRISONER'S DILEMMA GAME
Vesely, Stepan
STUDIA PSYCHOLOGICA, 2012, 54 (02) : 143 - 156
[24] An Adaptive Strategy via Reinforcement Learning for the Prisoner's Dilemma Game
Lei Xue
Changyin Sun
Donald Wunsch
Yingjiang Zhou
Fang Yu
IEEE/CAA Journal of Automatica Sinica, 2018, (01) : 301 - 310
[25] Cooperation in the prisoner's dilemma game on tunable community networks
Liu, Penghui
Liu, Jing
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2017, 472 : 156 - 163
[26] Sustaining Mutual Cooperation in Iterated Prisoner's Dilemma Game
Minsam, Kim
Yip, Szeto Kwok
30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 335 - 337
[27] Locus of control and learning to cooperate in a prisoner's dilemma game
Boone, C
De Brabander, B
Carree, M
de Jong, G
van Olffen, W
van Witteloostuijn, A
PERSONALITY AND INDIVIDUAL DIFFERENCES, 2002, 32 (05) : 929 - 946
[28] Environmental feedback promotes cooperation in a spatial prisoner's dilemma game with preferential selection
Li, Minlan
Wang, Chao
Han, Yanyan
Wang, Si-Yi
Wang, Ruiwu
APPLIED MATHEMATICS AND COMPUTATION, 2025, 495
[29] Integrating neighborhoods in the evaluation of fitness promotes cooperation in the spatial prisoner's dilemma game
Wang, Zhen
Du, Wen-Bo
Cao, Xian-Bin
Zhang, Lian-Zhong
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2011, 390 (07) : 1234 - 1239
[30] Reputation-based popularity promotes cooperation in the spatial prisoner's dilemma game
Chu, Chen
Zhai, Yao
Mu, Chunjiang
Hu, Die
Li, Tong
Shi, Lei
APPLIED MATHEMATICS AND COMPUTATION, 2019, 362

← 1 2 3 4 5 →