Multi-Agent Reinforcement Learning: A Review of Challenges and Applications

被引:154
作者
Canese, Lorenzo [1 ]
Cardarilli, Gian Carlo [1 ]
Di Nunzio, Luca [1 ]
Fazzolari, Rocco [1 ]
Giardino, Daniele [1 ]
Re, Marco [1 ]
Spano, Sergio [1 ]
机构
[1] Univ Roma Tor Vergata, Dept Elect Engn, Via Politecn 1, I-00133 Rome, Italy
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 11期
关键词
machine learning; reinforcement learning; multi-agent; swarm; FRAMEWORK; SHOGI; CHESS; GO;
D O I
10.3390/app11114948
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this review, we present an analysis of the most used multi-agent reinforcement learning algorithms. Starting with the single-agent reinforcement learning algorithms, we focus on the most critical issues that must be taken into account in their extension to multi-agent scenarios. The analyzed algorithms were grouped according to their features. We present a detailed taxonomy of the main multi-agent approaches proposed in the literature, focusing on their related mathematical models. For each algorithm, we describe the possible application fields, while pointing out its pros and cons. The described multi-agent algorithms are compared in terms of the most important characteristics for multi-agent reinforcement learning applications-namely, nonstationarity, scalability, and observability. We also describe the most common benchmark environments used to evaluate the performances of the considered methods.
引用
收藏
页数:25
相关论文
共 68 条
[1]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[2]  
Bengio Y., 2009, P 26 ANN INT C MACH, DOI DOI 10.1145/1553374.15533802,5
[3]   The complexity of decentralized control of Markov decision processes [J].
Bernstein, DS ;
Givan, R ;
Immerman, N ;
Zilberstein, S .
MATHEMATICS OF OPERATIONS RESEARCH, 2002, 27 (04) :819-840
[4]  
Bloembergen D., 2010, LENIENT FREQUENCY AD, P19
[5]  
Calvo A., 2018, P 26 IRISH C ARTIFIC
[6]  
Cardarilli G.C., 2020, P 2020 ASILOMAR C SI
[7]  
Claus C, 1998, FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, P746
[8]   Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks [J].
Cui, Jingjing ;
Liu, Yuanwei ;
Nallanathan, Arumugam .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) :729-743
[9]   Multi-Agent Reinforcement Learning Approach for Residential Microgrid Energy Scheduling [J].
Fang, Xiaohan ;
Wang, Jinkuan ;
Song, Guanru ;
Han, Yinghua ;
Zhao, Qiang ;
Cao, Zhiao .
ENERGIES, 2020, 13 (01)
[10]  
Foerster J., 2018, P AAAI 2018 32 AAAI