Neighbor-aware reinforcement learning fosters cooperation in spatial public goods games

被引:0
作者
Kang, Hongwei [1 ]
Jiang, Chao [1 ]
Shen, Yong [1 ]
Sun, Xingping [1 ]
Chen, Qingyi [1 ]
机构
[1] Yunnan Univ, Sch Software, Kunming 650000, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Spatial public goods game; Neighbor influence; Reputation mechanism; Perceptual range; EVOLUTIONARY GAMES; PRISONERS-DILEMMA; REPUTATION; PUNISHMENT; DYNAMICS; TRAGEDY;
D O I
10.1016/j.chaos.2025.116862
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The emergence and self-organization of cooperation within social dilemmas remain a central challenge in the study of evolutionary public goods games. Traditional models often rely on fixed behavioral update rules, inadequately capturing the learning capabilities of real-world individuals. To address this, we develop and investigate a novel model of cooperative evolution that integrates individual reinforcement learning (RL) with a mechanism termed "Neighbor Influence" (NI). In this framework, agents are not passive rule-followers; they employ Q-learning to dynamically decide between cooperation and defection. Importantly, their decisions are shaped by both individual trial-and-error and the influence of their neighbors. This study systematically analyzes the role of the NI mechanism and explores how reputation systems and agents' perceptual ranges (direct versus indirect neighbors) modulate cooperative evolution. Extensive simulations reveal that NI is crucial for promoting the spontaneous formation of cooperative clusters, thereby mitigating the "Tragedy of the Commons". This mechanism substantially boosts cooperation levels, even under conditions where conventional RL models falter. Our research further elucidates how the strength of neighbor influence, individual preferences for reputation versus direct payoffs, and perceptual range critically shape the dynamics and eventual stability of cooperative behavior. By focusing on adaptive learning and local social interactions, this work offers a computational model that clarifies the mechanisms underpinning macroscopic cooperation in complex systems and provides valuable theoretical insights for designing strategies to foster collective collaboration.
引用
收藏
页数:10
相关论文
共 64 条
[1]   Robustness and resilience of complex networks [J].
Artime, Oriol ;
Grassia, Marco ;
De Domenico, Manlio ;
Gleeson, James P. ;
Makse, Hernan A. ;
Mangioni, Giuseppe ;
Perc, Matjaz ;
Radicchi, Filippo .
NATURE REVIEWS PHYSICS, 2024, 6 (02) :114-131
[2]   THE EVOLUTION OF COOPERATION [J].
AXELROD, R ;
HAMILTON, WD .
SCIENCE, 1981, 211 (4489) :1390-1396
[3]   Preferential selection based on adaptive attractiveness induce by reinforcement learning promotes cooperation [J].
Bai, Pengzhou ;
Qiang, Bingzhuang ;
Zou, Kuan ;
Huang, Changwei .
CHAOS SOLITONS & FRACTALS, 2024, 180
[4]   Evolutionary games for cooperation in open data management [J].
Benko, Tina P. ;
Pi, Bin ;
Li, Qin ;
Feng, Minyu ;
Perc, Matjaz ;
Vosner, Helena Blazun .
APPLIED MATHEMATICS AND COMPUTATION, 2025, 496
[5]   Punishment and reputation in spatial public goods games [J].
Brandt, H ;
Hauert, C ;
Sigmund, K .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2003, 270 (1519) :1099-1104
[6]   The evolutionary public goods game on scale-free networks with heterogeneous investment [J].
Cao, Xian-Bin ;
Du, Wen-Bo ;
Rong, Zhi-Hai .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2010, 389 (06) :1273-1280
[7]   Competition and cooperation among different punishing strategies in the spatial public goods game [J].
Chen, Xiaojie ;
Szolnoki, Attila ;
Perc, Matjaz .
PHYSICAL REVIEW E, 2015, 92 (01)
[8]   First carrot, then stick: how the adaptive hybridization of incentives promotes cooperation [J].
Chen, Xiaojie ;
Sasaki, Tatsuya ;
Braennstroem, Ake ;
Dieckmann, Ulf .
JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2015, 12 (102)
[9]   Probabilistic sharing solves the problem of costly punishment [J].
Chen, Xiaojie ;
Szolnoki, Attila ;
Perc, Matjaz .
NEW JOURNAL OF PHYSICS, 2014, 16
[10]   Models of cooperation based on the Prisoner's Dilemma and the Snowdrift game [J].
Doebeli, M ;
Hauert, C .
ECOLOGY LETTERS, 2005, 8 (07) :748-766