Learning enables adaptation in cooperation for multi-player stochastic games

被引：11

作者：

Huang, Feng ^{[1
,2
]}

Cao, Ming ^{[2
]}

Wang, Long ^{[1
]}

机构：

[1] Peking Univ, Coll Engn, Ctr Syst & Control, Beijing 100871, Peoples R China

[2] Univ Groningen, Fac Sci & Engn, Ctr Data Sci & Syst Complex, NL-9747 AG Groningen, Netherlands

来源：

JOURNAL OF THE ROYAL SOCIETY INTERFACE | 2020年 / 17卷 / 172期

基金：

中国国家自然科学基金; 欧洲研究理事会;

关键词：

reinforcement learning; evolutionary game theory; stochastic game; adaptive behaviour; social dilemma; EVOLUTIONARY DYNAMICS; COLLECTIVE ACTION; STABILITY; EMERGENCE; TRAGEDY; RISK;

D O I：

10.1098/rsif.2020.0639

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Interactions among individuals in natural populations often occur in a dynamically changing environment. Understanding the role of environmental variation in population dynamics has long been a central topic in theoretical ecology and population biology. However, the key question of how individuals, in the middle of challenging social dilemmas (e.g. the 'tragedy of the commons'), modulate their behaviours to adapt to the fluctuation of the environment has not yet been addressed satisfactorily. Using evolutionary game theory, we develop a framework of stochastic games that incorporates the adaptive mechanism of reinforcement learning to investigate whether cooperative behaviours can evolve in the ever-changing group interaction environment. When the action choices of players are just slightly influenced by past reinforcements, we construct an analytical condition to determine whether cooperation can be favoured over defection. Intuitively, this condition reveals why and how the environment can mediate cooperative dilemmas. Under our model architecture, we also compare this learning mechanism with two non-learning decision rules, and we find that learning significantly improves the propensity for cooperation in weak social dilemmas, and, in sharp contrast, hinders cooperation in strong social dilemmas. Our results suggest that in complex social-ecological dilemmas, learning enables the adaptation of individuals to varying environments.

引用

页数：12

共 50 条

[41] Cooperative Multi-player Multi-Armed Bandit: Computation Offloading in a Vehicular Cloud Network
Xu, Shilin
Guo, Caili
Hu, Rose Qingyang
Qian, Yi
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
[42] Millimeter-Wave Concurrent Beamforming: A Multi-Player Multi-Armed Bandit Approach
Mohamed, Ehab Mahmoud
Hashima, Sherief
Hatano, Kohei
Kasban, Hani
Rihan, Mohamed
CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 65 (03): : 1987 - 2007
[43] Bayesian Method-Based Learning Automata for Two-Player Stochastic Games with Incomplete Information
Hua Ding
Chong Di
Li Shenghong
COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 25 - 33
[44] Stochastic Stability in Three-Player Games with Time Delays
Jacek Miȩkisz
Michał Matuszak
Jan Poleszczuk
Dynamic Games and Applications, 2014, 4 : 489 - 498
[45] Multi-Player Evolutionary Game of Network Attack and Defense Based on System Dynamics
Yang, Pengxi
Gao, Fei
Zhang, Hua
MATHEMATICS, 2021, 9 (23)
[46] The average abundance function with mutation of the multi-player snowdrift evolutionary game model
Xia, Ke
Wang, Xianjia
ACTA MATHEMATICA SCIENTIA, 2021, 41 (01) : 127 - 163
[47] Finite-time safe reinforcement learning control of multi-player nonzero-sum game for quadcopter systems
Tan, Junkai
Xue, Shuangsi
Guan, Qingshu
Qu, Kai
Cao, Hui
INFORMATION SCIENCES, 2025, 712
[48] The mechanism of alliance promotes cooperation in the spatial multi-games
Li, Xiaopeng
Wang, Huaibin
Hao, Gang
Xia, Chengyi
PHYSICS LETTERS A, 2020, 384 (20)
[49] Decentralized optimal large scale multi-player pursuit-evasion strategies: A mean field game approach with reinforcement learning
Zhou, Zejian
Xu, Hao
NEUROCOMPUTING, 2022, 484 : 46 - 58
[50] Evolution of Global Cooperation in Multi-Level Threshold Public Goods Games With Income Redistribution
Du, Jinming
Wang, Baokui
FRONTIERS IN PHYSICS, 2018, 6

← 1 2 3 4 5 →