Spatial reciprocity under reinforcement learning mechanism

被引:0
|
作者
Wang, Lu [1 ]
Shi, Xiaoqiu [1 ,2 ]
Zhou, Yang [3 ]
机构
[1] Southwest Univ Sci & Technol, Sch Mfg Sci & Engn, Mianyang 621000, Sichuan, Peoples R China
[2] Mianyang Sci & Technol City Intelligent Mfg Ind Te, Mianyang 621000, Sichuan, Peoples R China
[3] Southwest Univ Sci & Technol, Engn Technol Ctr, Mianyang 621000, Sichuan, Peoples R China
关键词
NETWORK RECIPROCITY; EVOLUTIONARY DYNAMICS; GAME;
D O I
10.1063/5.0246843
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
At present, the research on the dynamics of cooperative behavior of agents under reinforcement learning mechanism either assumes that agents have global interaction, that is, agents interact with all other agents in the population, or directly study the influence of relevant factors on cooperation evolution based on the local interaction in a network structure. It neglects to formally study how the limitation of agents that only interact with local agents affects their strategy choice. Thus, in this paper, we study the cooperative behavior of agents in a typical social decision-making environment with conflicts between individual interests and collective interests. On the one hand, a programmed game model in game theory, namely, prisoner's dilemma game, is used to capture the essence of real-world dilemmas. On the other hand, the effects of local and global strategy learning on the cooperative evolution of agents are investigated separately, and the nature of spatial reciprocity under the reinforcement learning mechanism is found. Specifically, when there is no inherent connection between the interacting agents and the learning agents within the system, the network structure has a limited effect on promoting cooperation. It is only when there is an overlap between the interacting agents and the learning agents that the spatial reciprocity effect observed in the traditional evolutionary game theory can be fully realized.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] LEARNING UNDER CONDITIONS OF DIRECT AND VICARIOUS REINFORCEMENT
    VANWAGENEN, RK
    TRAVERS, RMW
    JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1963, 54 (06) : 356 - 362
  • [32] Online Inverse Reinforcement Learning Under Occlusion
    Arora, Saurabh
    Doshi, Prashant
    Banerjee, Bikramjit
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1170 - 1178
  • [33] Equivariant Reinforcement Learning under Partial Observability
    Nguyen, Hai
    Baisero, Andrea
    Klee, David
    Wang, Dian
    Platt, Robert
    Amato, Christopher
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [34] Attainability of boundary points under reinforcement learning
    Hopkins, E
    Posch, M
    GAMES AND ECONOMIC BEHAVIOR, 2005, 53 (01) : 110 - 125
  • [35] Portfolio Optimization Under The Framework Of Reinforcement Learning
    Li Xucheng
    Peng Zhihao
    2019 11TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2019), 2019, : 799 - 802
  • [36] A spatial dependency based reinforcement learning model for selecting features in spatial classification
    Wei, Cheng
    Yu, Wenhao
    GEOINFORMATICA, 2025, 29 (01) : 1 - 29
  • [37] Inverse Reinforcement Learning Under Noisy Observations
    Shahryari, Shervin
    Doshi, Prashant
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1733 - 1735
  • [38] A Procedural Constructive Learning Mechanism with Deep Reinforcement Learning for Cognitive Agents
    Rossi, Leonardo de Lellis
    Rohmer, Eric
    Costa, Paula Dornhofer Paro
    Colombini, Esther Luna
    Simoes, Alexandre da Silva
    Gudwin, Ricardo Ribeiro
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (01)
  • [39] A Procedural Constructive Learning Mechanism with Deep Reinforcement Learning for Cognitive Agents
    Leonardo de Lellis Rossi
    Eric Rohmer
    Paula Dornhofer Paro Costa
    Esther Luna Colombini
    Alexandre da Silva Simões
    Ricardo Ribeiro Gudwin
    Journal of Intelligent & Robotic Systems, 2024, 110
  • [40] Reinforcement learning with the mechanism of short-term depression for learning rate
    Kubota, S.
    PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 16TH '11), 2011, : 800 - 803