Theoretical advantages of lenient learners: An evolutionary game theoretic perspective

被引:0
作者
Panait, Liviu [1 ]
Tuyls, Karl [2 ]
Luke, Sean [3 ]
机构
[1] Google Inc, Santa Monica, CA 90401 USA
[2] Maastricht Univ, MiCC IKAT, Maastricht, Netherlands
[3] George Mason Univ, Dept Comp Sci, Fairfax, VA 22030 USA
关键词
multiagent learning; reinforcement learning; cooperative coevolution; evolutionary game theory; formal models; visualization; basins of attraction;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the dynamics of multiple learning agents from an evolutionary game theoretic perspective. We provide replicator dynamics models for cooperative coevolutionary algorithms and for traditional multiagent Q-learning, and we extend these differential equations to account for lenient learners: agents that forgive possible mismatched teammate actions that resulted in low rewards. We use these extended formal models to study the convergence guarantees for these algorithms, and also to visualize the basins of attraction to optimal and suboptimal solutions in two benchmark coordination problems. The paper demonstrates that lenience provides learners with more accurate information about the benefits of performing their actions, resulting in higher likelihood of convergence to the globally optimal solution. In addition, the analysis indicates that the choice of learning algorithm has an insignificant impact on the overall performance of multiagent learning algorithms; rather, the performance of these algorithms depends primarily on the level of lenience that the agents exhibit to one another. Finally, the research herein supports the strength and generality of evolutionary game theory as a backbone for multiagent learning.
引用
收藏
页码:423 / 457
页数:35
相关论文
共 50 条
  • [21] Graphical Evolutionary Game Theoretic Modeling of Strategy Evolution Over Heterogeneous Networks
    Li, Yuejiang
    Zhao, H. Vicky
    Chen, Yan
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2022, 8 : 739 - 754
  • [22] Promoting Construction Labor Professionalization: An Evolutionary Game Perspective
    Chen, Wei
    Yang, Zhuzhang
    Yan, Hang
    Zhao, Ying
    SUSTAINABILITY, 2023, 15 (12)
  • [23] A Heuristic Evolutionary Game Theoretic Methodology for Conjunctive Use of Surface and Groundwater Resources
    Parna Parsapour-Moghaddam
    Armaghan Abed-Elmdoust
    Reza Kerachian
    Water Resources Management, 2015, 29 : 3905 - 3918
  • [24] The ecology of cancer from an evolutionary game theory perspective
    Pacheco, Jorge M.
    Santos, Francisco C.
    Dingli, David
    INTERFACE FOCUS, 2014, 4 (04)
  • [25] A Heuristic Evolutionary Game Theoretic Methodology for Conjunctive Use of Surface and Groundwater Resources
    Parsapour-Moghaddam, Parna
    Abed-Elmdoust, Armaghan
    Kerachian, Reza
    WATER RESOURCES MANAGEMENT, 2015, 29 (11) : 3905 - 3918
  • [26] Evolutionary Game Theoretic Analysis of Advanced Persistent Threats Against Cloud Storage
    Abass, Ahmed A. Alabdel
    Xiao, Liang
    Mandayam, Narayan B.
    Gajic, Zoran
    IEEE ACCESS, 2017, 5 : 8482 - 8491
  • [27] Stable federated fog formation: An evolutionary game theoretical approach
    Hammoud, Ahmad
    Otrok, Hadi
    Mourad, Azzam
    Dziong, Zbigniew
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 124 : 21 - 32
  • [28] An Evolutionary Game-Theoretic Approach for Base Station Allocation in Wireless Femtocell Networks
    Azadeh Pourkabirian
    Mehdi Dehghan Takht Fooladi
    Esmaeil Zeinali Khosraghi
    Amir Masoud Rahmani
    Wireless Personal Communications, 2019, 107 : 217 - 242
  • [29] An Evolutionary Game Theoretic Approach to Multi-Sector Coordination and Self-Organization
    Santos, Fernando P.
    Encarnacao, Sara
    Santos, Francisco C.
    Portugali, Juval
    Pacheco, Jorge M.
    ENTROPY, 2016, 18 (04)
  • [30] An Evolutionary Game-Theoretic Approach for Base Station Allocation in Wireless Femtocell Networks
    Pourkabirian, Azadeh
    Fooladi, Mehdi Dehghan Takht
    Khosraghi, Esmaeil Zeinali
    Rahmani, Amir Masoud
    WIRELESS PERSONAL COMMUNICATIONS, 2019, 107 (01) : 217 - 242