Reputation-Based Interaction Promotes Cooperation With Reinforcement Learning

被引:2
作者
Ren, Tianyu [1 ]
Zeng, Xiao-Jun [1 ]
机构
[1] Univ Manchester, Dept Comp Sci, Manchester M13 9PL, England
关键词
Games; Statistics; Sociology; Heuristic algorithms; Simulation; Task analysis; Intelligent agents; Computer simulation; evolutionary computation; intelligent agents; multiagent systems; nonlinear network analysis; EVOLUTION; GAMES; NORM;
D O I
10.1109/TEVC.2023.3304911
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamical interaction represents a fundamental coevolutionary rule that addresses the intricacies of cooperation in social dilemmas. It provides a normative account for the changes in ties within interaction networks in response to the behavior of social partners. While considerable efforts have explored the role of partner selection in fostering cooperation, there remains a limited understanding of how agents learn to establish effective interaction patterns and adapt their connections accordingly. To bridge this knowledge gap, we leverage recent advancements in reinforcement learning (RL) and propose an adaptive interaction mechanism to investigate self-organization behavior in the iterated prisoner's dilemma game. Within this framework, artificial agents are trained using a self-regarding Roth-Erev algorithm, utilizing reputation as a dynamic signal to update their willingness to engage with neighbors. Additionally, these agents are endowed with the capability to sever inactive connections. Simulation results demonstrate the effectiveness of utilizing RL and local information from reputation to capture the dynamics of interactions. Notably, we discover that the entangled coevolution of strategy and interaction network can facilitate the emergence and maintenance of cooperation, despite the optimal tolerance threshold for ineffective neighbors varying depending on the strength of the social dilemma. Furthermore, the emerging network topology presented in this work accurately captures the assortative mixing pattern observed in previous experiments and realistic evidence. Finally, we validate the simulation results through theoretical analysis and confirm the robustness of the proposed mechanism across populations of varying sizes and initial structures.
引用
收藏
页码:1177 / 1188
页数:12
相关论文
共 67 条
[1]  
Anastassacos N, 2020, AAAI CONF ARTIF INTE, V34, P7047
[2]   Mathematical foundations of moral preferences [J].
Capraro, Valerio ;
Perc, Matjaz .
JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2021, 18 (175)
[3]   Cooperative AI: machines must learn to find common ground [J].
Dafoe, Allan ;
Bachrach, Yoram ;
Hadfield, Gillian ;
Horvitz, Eric ;
Larson, Kate ;
Graepel, Thore .
NATURE, 2021, 593 (7857) :33-36
[4]   Coevolutionary games on networks [J].
Ebel, H ;
Bornholdt, S .
PHYSICAL REVIEW E, 2002, 66 (05) :8-056118
[5]   Coevolutionary dynamics of opinions and networks: From diversity to uniformity [J].
Fu, Feng ;
Wang, Long .
PHYSICAL REVIEW E, 2008, 78 (01)
[6]   Cooperation in evolving social networks [J].
Hanaki, Nobuyuki ;
Peterhansl, Alexander ;
Dodds, Peter S. ;
Watts, Duncan J. .
MANAGEMENT SCIENCE, 2007, 53 (07) :1036-1050
[7]   TRAGEDY OF COMMONS [J].
HARDIN, G .
SCIENCE, 1968, 162 (3859) :1243-+
[8]  
Hou Y., 2023, IEEE Trans. Evol Compat., DOI [10.1109/IT/VC.2023.3268076, DOI 10.1109/IT/VC.2023.3268076]
[9]   Adaptive Reputation Promotes Trust in Social Networks [J].
Hu, Zhengyang ;
Li, Xiaopeng ;
Wang, Juan ;
Xia, Chengyi ;
Wang, Zhen ;
Perc, Matjaz .
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2021, 8 (04) :3087-3098
[10]   Scaling the phase-planes of social dilemma strengths shows game-class changes n the five rules governing the evolution of cooperation [J].
Ito, Hiromu ;
Tanimoto, Jun .
ROYAL SOCIETY OPEN SCIENCE, 2018, 5 (10)