Admission-Based Reinforcement-Learning Algorithm in Sequential Social Dilemmas

被引:4
|
作者
Guo, Ting [1 ,2 ]
Yuan, Yuyu [1 ,2 ]
Zhao, Pengqian [1 ,2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Natl Pilot Software Engn Sch, Beijing 100876, Peoples R China
[2] Minist Educ, Key Lab Trustworthy Distributed Comp & Serv, Beijing 100876, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 03期
基金
中国国家自然科学基金;
关键词
multi-agent reinforcement learning; hierarchical network; the give-or-take-some paradigm; sequential social dilemmas; TRAGEDY; COMMONS; GAMES;
D O I
10.3390/app13031807
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Recently, the social dilemma problem is no longer limited to unrealistic stateless matrix games but has been extended to temporally and spatially extended Markov games by multi-agent reinforcement learning. Many multi-agent reinforcement-learning algorithms have been proposed to solve sequential social dilemmas. However, most current algorithms focus on cooperation to improve the overall reward while ignoring the equality among agents, which could be improved in terms of practicality. Here, we propose a novel admission-based hierarchical multi-agent reinforcement-learning algorithm to promote cooperation and equality among agents. We extend the give-or-take-some model to Markov games, decompose the fairness of each agent, and propose an Admission reward. For better learning, we design a hierarchy consisting of a high-level policy and multiple low-level policies, where the high-level policy maximizes the Admission reward by choosing different low-level policies to interact with environments. In addition, the learning and execution of policies are realized through a decentralized method. We conduct experiments in multiple sequential social dilemmas environments and show that the Admission algorithm significantly outperforms the baselines, demonstrating that our algorithm can learn cooperation and equality well.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Multi-agent Reinforcement Learning in Sequential Social Dilemmas
    Leibo, Joel Z.
    Zambaldi, Vinicius
    Lanctot, Marc
    Marecki, Janusz
    Graepel, Thore
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 464 - 473
  • [2] Collaborative Reinforcement Learning Model for Sustainability of Cooperation in Sequential Social Dilemmas
    Chaudhuri, Ritwik
    Mukherjee, Kushal
    Narayanam, Ramasuri
    Vallam, Rohith Dwarakanath
    Kumar, Ayush
    Mathur, Antriksh
    Garg, Shweta
    Singh, Sudhanshu
    Parija, Gyana
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1877 - 1879
  • [3] A reinforcement-learning approach for admission control in distributed network service systems
    Xiaonong Lu
    Baoqun Yin
    Haipeng Zhang
    Journal of Combinatorial Optimization, 2016, 31 : 1241 - 1268
  • [4] Collaborative Reinforcement Learning Framework to Model Evolution of Cooperation in Sequential Social Dilemmas
    Chaudhuri, Ritwik
    Mukherjee, Kushal
    Narayanam, Ramasuri
    Vallam, Rohith D.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT I, 2021, 12712 : 15 - 26
  • [5] A reinforcement-learning approach for admission control in distributed network service systems
    Lu, Xiaonong
    Yin, Baoqun
    Zhang, Haipeng
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2016, 31 (03) : 1241 - 1268
  • [6] A Reinforcement-Learning Style Algorithm for Black Box Automata
    Cohen, Itay
    Fogler, Roi
    Peled, Doron
    2022 20TH ACM-IEEE INTERNATIONAL CONFERENCE ON FORMAL METHODS AND MODELS FOR SYSTEM DESIGN (MEMOCODE), 2022,
  • [7] Reinforcement Learning Dynamics in Social Dilemmas
    Izquierdo, Segismundo S.
    Izquierdo, Luis R.
    Gotts, Nicholas M.
    JASSS-THE JOURNAL OF ARTIFICIAL SOCIETIES AND SOCIAL SIMULATION, 2008, 11 (02):
  • [8] Social Learning for Sequential Driving Dilemmas
    Chen, Xu
    Di, Xuan
    Li, Zechu
    GAMES, 2023, 14 (03):
  • [9] A hyper-heuristic based reinforcement-learning algorithm to train feedforward neural networks
    Ozsoydan, Fehmi Burcin
    Golcuk, Lker
    ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2022, 35
  • [10] A Reinforcement-Learning Algorithm for Sampling Design in Markov Random Fields
    Bonneau, Mathieu
    Peyrard, Nathalie
    Sabbadin, Regis
    20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 181 - 186