Learning enables adaptation in cooperation for multi-player stochastic games

被引：11

作者：

Huang, Feng ^{[1
,2
]}

Cao, Ming ^{[2
]}

Wang, Long ^{[1
]}

机构：

[1] Peking Univ, Coll Engn, Ctr Syst & Control, Beijing 100871, Peoples R China

[2] Univ Groningen, Fac Sci & Engn, Ctr Data Sci & Syst Complex, NL-9747 AG Groningen, Netherlands

来源：

JOURNAL OF THE ROYAL SOCIETY INTERFACE | 2020年 / 17卷 / 172期

基金：

中国国家自然科学基金; 欧洲研究理事会;

关键词：

reinforcement learning; evolutionary game theory; stochastic game; adaptive behaviour; social dilemma; EVOLUTIONARY DYNAMICS; COLLECTIVE ACTION; STABILITY; EMERGENCE; TRAGEDY; RISK;

D O I：

10.1098/rsif.2020.0639

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Interactions among individuals in natural populations often occur in a dynamically changing environment. Understanding the role of environmental variation in population dynamics has long been a central topic in theoretical ecology and population biology. However, the key question of how individuals, in the middle of challenging social dilemmas (e.g. the 'tragedy of the commons'), modulate their behaviours to adapt to the fluctuation of the environment has not yet been addressed satisfactorily. Using evolutionary game theory, we develop a framework of stochastic games that incorporates the adaptive mechanism of reinforcement learning to investigate whether cooperative behaviours can evolve in the ever-changing group interaction environment. When the action choices of players are just slightly influenced by past reinforcements, we construct an analytical condition to determine whether cooperation can be favoured over defection. Intuitively, this condition reveals why and how the environment can mediate cooperative dilemmas. Under our model architecture, we also compare this learning mechanism with two non-learning decision rules, and we find that learning significantly improves the propensity for cooperation in weak social dilemmas, and, in sharp contrast, hinders cooperation in strong social dilemmas. Our results suggest that in complex social-ecological dilemmas, learning enables the adaptation of individuals to varying environments.

引用

页数：12

共 50 条

[31] Two-player stochastic games II: The case of recursive games
Vieille, N
ISRAEL JOURNAL OF MATHEMATICS, 2000, 119 (1) : 93 - 126
[32] Neural networks-based optimal tracking control for nonzero-sum games of multi-player continuous-time nonlinear systems via reinforcement learning
Zhao, Jingang
NEUROCOMPUTING, 2020, 412 : 167 - 176
[33] Learning and cooperation in sequential games
Valluri, Annapurna
ADAPTIVE BEHAVIOR, 2006, 14 (03) : 195 - 209
[34] Stochastic stability in spatial three-player games
Miekisz, J
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2004, 343 : 175 - 184
[35] Two-player stochastic games I: A reduction
Nicolas Vieille
Israel Journal of Mathematics, 2000, 119 : 55 - 91
[36] Multi-Player Pursuit-Evasion Differential Game with Equal Speed
Al-Talabi, Ahmad A.
2017 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2017,
[37] Evolution of coordination in pairwise and multi-player interactions via prior commitments
Ogbo, Ndidi Bianca
Elgarig, Aiman
Han, The Anh
ADAPTIVE BEHAVIOR, 2022, 30 (03) : 257 - 277
[38] A Multi-Player Framework for Sustainable Traffic Optimization in the Era of Digital Transportation
Kotsi, Areti
Politis, Ioannis
Chaniotakis, Emmanouil
Mitsakis, Evangelos
INFRASTRUCTURES, 2025, 10 (01)
[39] Data-based approximate optimal control for nonzero-sum games of multi-player systems using adaptive dynamic programming
Jiang, He
Zhang, Huaguang
Xiao, Geyang
Cui, Xiaohong
NEUROCOMPUTING, 2018, 275 : 192 - 199
[40] Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games
Li, Weifan
Zhuand, Yuanheng
Zhao, Dongbin
2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 57 - 63

← 1 2 3 4 5 →