Towards Circular and Asymmetric Cooperation in a Multi-player Graph-based Iterated Prisoner's Dilemma

被引:0
作者
Le Gleau, Tangui [1 ,2 ]
Marjou, Xavier [1 ]
Lemlouma, Tayeb [2 ]
Radier, Benoit [1 ]
机构
[1] Orange Labs, Lannion, France
[2] IRISA, Lannion, France
来源
ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2 | 2022年
关键词
Game Theory; Non-cooperative Games; Iterated Prisoner's Dilemma; Tit-for-Tat;
D O I
10.5220/0010837900003116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In collaborations involving multiple actors, it is well known that tensions between individual interest and global welfare can emerge: actors are personally incentivized to have selfish behavior whereas mutual cooperation may provide a better outcome for all. Known as social dilemmas, these cooperation paradigms have aroused renewed interest in solving social issues, particularly in environmental and energy issues. Hybrids methods with Reinforcement Learning (RL) policies and Tit-for-Tat (TFT) strategies have proven successful to identify fruitful collaboration in complex social dilemmas. However, there are also many situations, where cooperation cannot always be given back directly, and has instead to be carried out through one or more intermediary actor(s). This specificity ruins win-win approaches like TFT. To address this specificity, we introduce a Graph-based Iterated Prisoner's Dilemma: a N-player game in which the possible cooperation between players is modeled by a weighted directed graph. In addition to this new paradigm, we propose a graph-based TFT algorithm that we evaluate on multiple scenarios and compare to other algorithms. Our experiments show that leveraging a graph-based structure in the original TFT algorithm allows it to spread favor better collaboration synergies in most situations.
引用
收藏
页码:293 / 303
页数:11
相关论文
共 29 条
  • [1] Agudo J. E., 2011, Proceedings of the 2011 Seventh International Conference on Computational Intelligence and Security (CIS 2011), P472, DOI 10.1109/CIS.2011.111
  • [2] [Anonymous], 1965, PRISONERS DILEMMA ST
  • [3] Cooperation in Prisoner's dilemma on graphs
    Ashlock, Daniel A.
    [J]. 2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND GAMES, 2007, : 48 - 55
  • [4] THE EVOLUTION OF COOPERATION
    AXELROD, R
    HAMILTON, WD
    [J]. SCIENCE, 1981, 211 (4489) : 1390 - 1396
  • [5] Beaufils B., 2001, P ART INT SIM BEH S
  • [6] SOME EXPERIMENTAL GAMES
    FLOOD, MM
    [J]. MANAGEMENT SCIENCE, 1958, 5 (01) : 5 - 26
  • [7] Ford Jr L. R., 1956, NETWORK FLOW THEORY
  • [8] Ford L.R., 1956, Canad. J. Math., V8, P399, DOI 10.4153/CJM-1956-045-5
  • [9] Hager G. D., 2019, ARXIV PREPRINT ARXIV
  • [10] N-PERSON PRISONERS DILEMMA
    HAMBURGER, H
    [J]. JOURNAL OF MATHEMATICAL SOCIOLOGY, 1973, 3 (01) : 27 - 48