Learning enables adaptation in cooperation for multi-player stochastic games

被引:11
|
作者
Huang, Feng [1 ,2 ]
Cao, Ming [2 ]
Wang, Long [1 ]
机构
[1] Peking Univ, Coll Engn, Ctr Syst & Control, Beijing 100871, Peoples R China
[2] Univ Groningen, Fac Sci & Engn, Ctr Data Sci & Syst Complex, NL-9747 AG Groningen, Netherlands
基金
中国国家自然科学基金; 欧洲研究理事会;
关键词
reinforcement learning; evolutionary game theory; stochastic game; adaptive behaviour; social dilemma; EVOLUTIONARY DYNAMICS; COLLECTIVE ACTION; STABILITY; EMERGENCE; TRAGEDY; RISK;
D O I
10.1098/rsif.2020.0639
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Interactions among individuals in natural populations often occur in a dynamically changing environment. Understanding the role of environmental variation in population dynamics has long been a central topic in theoretical ecology and population biology. However, the key question of how individuals, in the middle of challenging social dilemmas (e.g. the 'tragedy of the commons'), modulate their behaviours to adapt to the fluctuation of the environment has not yet been addressed satisfactorily. Using evolutionary game theory, we develop a framework of stochastic games that incorporates the adaptive mechanism of reinforcement learning to investigate whether cooperative behaviours can evolve in the ever-changing group interaction environment. When the action choices of players are just slightly influenced by past reinforcements, we construct an analytical condition to determine whether cooperation can be favoured over defection. Intuitively, this condition reveals why and how the environment can mediate cooperative dilemmas. Under our model architecture, we also compare this learning mechanism with two non-learning decision rules, and we find that learning significantly improves the propensity for cooperation in weak social dilemmas, and, in sharp contrast, hinders cooperation in strong social dilemmas. Our results suggest that in complex social-ecological dilemmas, learning enables the adaptation of individuals to varying environments.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Two-player stochastic games II: The case of recursive games
    Vieille, N
    ISRAEL JOURNAL OF MATHEMATICS, 2000, 119 (1) : 93 - 126
  • [32] Neural networks-based optimal tracking control for nonzero-sum games of multi-player continuous-time nonlinear systems via reinforcement learning
    Zhao, Jingang
    NEUROCOMPUTING, 2020, 412 : 167 - 176
  • [33] Learning and cooperation in sequential games
    Valluri, Annapurna
    ADAPTIVE BEHAVIOR, 2006, 14 (03) : 195 - 209
  • [35] Two-player stochastic games I: A reduction
    Nicolas Vieille
    Israel Journal of Mathematics, 2000, 119 : 55 - 91
  • [36] Multi-Player Pursuit-Evasion Differential Game with Equal Speed
    Al-Talabi, Ahmad A.
    2017 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2017,
  • [37] Evolution of coordination in pairwise and multi-player interactions via prior commitments
    Ogbo, Ndidi Bianca
    Elgarig, Aiman
    Han, The Anh
    ADAPTIVE BEHAVIOR, 2022, 30 (03) : 257 - 277
  • [38] A Multi-Player Framework for Sustainable Traffic Optimization in the Era of Digital Transportation
    Kotsi, Areti
    Politis, Ioannis
    Chaniotakis, Emmanouil
    Mitsakis, Evangelos
    INFRASTRUCTURES, 2025, 10 (01)
  • [39] Data-based approximate optimal control for nonzero-sum games of multi-player systems using adaptive dynamic programming
    Jiang, He
    Zhang, Huaguang
    Xiao, Geyang
    Cui, Xiaohong
    NEUROCOMPUTING, 2018, 275 : 192 - 199
  • [40] Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games
    Li, Weifan
    Zhuand, Yuanheng
    Zhao, Dongbin
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 57 - 63