Learning enables adaptation in cooperation for multi-player stochastic games

被引:11
|
作者
Huang, Feng [1 ,2 ]
Cao, Ming [2 ]
Wang, Long [1 ]
机构
[1] Peking Univ, Coll Engn, Ctr Syst & Control, Beijing 100871, Peoples R China
[2] Univ Groningen, Fac Sci & Engn, Ctr Data Sci & Syst Complex, NL-9747 AG Groningen, Netherlands
基金
中国国家自然科学基金; 欧洲研究理事会;
关键词
reinforcement learning; evolutionary game theory; stochastic game; adaptive behaviour; social dilemma; EVOLUTIONARY DYNAMICS; COLLECTIVE ACTION; STABILITY; EMERGENCE; TRAGEDY; RISK;
D O I
10.1098/rsif.2020.0639
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Interactions among individuals in natural populations often occur in a dynamically changing environment. Understanding the role of environmental variation in population dynamics has long been a central topic in theoretical ecology and population biology. However, the key question of how individuals, in the middle of challenging social dilemmas (e.g. the 'tragedy of the commons'), modulate their behaviours to adapt to the fluctuation of the environment has not yet been addressed satisfactorily. Using evolutionary game theory, we develop a framework of stochastic games that incorporates the adaptive mechanism of reinforcement learning to investigate whether cooperative behaviours can evolve in the ever-changing group interaction environment. When the action choices of players are just slightly influenced by past reinforcements, we construct an analytical condition to determine whether cooperation can be favoured over defection. Intuitively, this condition reveals why and how the environment can mediate cooperative dilemmas. Under our model architecture, we also compare this learning mechanism with two non-learning decision rules, and we find that learning significantly improves the propensity for cooperation in weak social dilemmas, and, in sharp contrast, hinders cooperation in strong social dilemmas. Our results suggest that in complex social-ecological dilemmas, learning enables the adaptation of individuals to varying environments.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Cooperative Multi-player Multi-Armed Bandit: Computation Offloading in a Vehicular Cloud Network
    Xu, Shilin
    Guo, Caili
    Hu, Rose Qingyang
    Qian, Yi
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [42] Millimeter-Wave Concurrent Beamforming: A Multi-Player Multi-Armed Bandit Approach
    Mohamed, Ehab Mahmoud
    Hashima, Sherief
    Hatano, Kohei
    Kasban, Hani
    Rihan, Mohamed
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 65 (03): : 1987 - 2007
  • [43] Bayesian Method-Based Learning Automata for Two-Player Stochastic Games with Incomplete Information
    Hua Ding
    Chong Di
    Li Shenghong
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 25 - 33
  • [44] Stochastic Stability in Three-Player Games with Time Delays
    Jacek Miȩkisz
    Michał Matuszak
    Jan Poleszczuk
    Dynamic Games and Applications, 2014, 4 : 489 - 498
  • [45] Multi-Player Evolutionary Game of Network Attack and Defense Based on System Dynamics
    Yang, Pengxi
    Gao, Fei
    Zhang, Hua
    MATHEMATICS, 2021, 9 (23)
  • [46] The average abundance function with mutation of the multi-player snowdrift evolutionary game model
    Xia, Ke
    Wang, Xianjia
    ACTA MATHEMATICA SCIENTIA, 2021, 41 (01) : 127 - 163
  • [47] Finite-time safe reinforcement learning control of multi-player nonzero-sum game for quadcopter systems
    Tan, Junkai
    Xue, Shuangsi
    Guan, Qingshu
    Qu, Kai
    Cao, Hui
    INFORMATION SCIENCES, 2025, 712
  • [48] The mechanism of alliance promotes cooperation in the spatial multi-games
    Li, Xiaopeng
    Wang, Huaibin
    Hao, Gang
    Xia, Chengyi
    PHYSICS LETTERS A, 2020, 384 (20)
  • [49] Decentralized optimal large scale multi-player pursuit-evasion strategies: A mean field game approach with reinforcement learning
    Zhou, Zejian
    Xu, Hao
    NEUROCOMPUTING, 2022, 484 : 46 - 58
  • [50] Evolution of Global Cooperation in Multi-Level Threshold Public Goods Games With Income Redistribution
    Du, Jinming
    Wang, Baokui
    FRONTIERS IN PHYSICS, 2018, 6