Multi-Agent Reinforcement Learning: A Review of Challenges and Applications

被引：192

作者：

Canese, Lorenzo ^{[1
]}

Cardarilli, Gian Carlo ^{[1
]}

Di Nunzio, Luca ^{[1
]}

Fazzolari, Rocco ^{[1
]}

Giardino, Daniele ^{[1
]}

Re, Marco ^{[1
]}

Spano, Sergio ^{[1
]}

机构：

[1] Univ Roma Tor Vergata, Dept Elect Engn, Via Politecn 1, I-00133 Rome, Italy

来源：

APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 11期

关键词：

machine learning; reinforcement learning; multi-agent; swarm; FRAMEWORK; SHOGI; CHESS; GO;

D O I：

10.3390/app11114948

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

In this review, we present an analysis of the most used multi-agent reinforcement learning algorithms. Starting with the single-agent reinforcement learning algorithms, we focus on the most critical issues that must be taken into account in their extension to multi-agent scenarios. The analyzed algorithms were grouped according to their features. We present a detailed taxonomy of the main multi-agent approaches proposed in the literature, focusing on their related mathematical models. For each algorithm, we describe the possible application fields, while pointing out its pros and cons. The described multi-agent algorithms are compared in terms of the most important characteristics for multi-agent reinforcement learning applications-namely, nonstationarity, scalability, and observability. We also describe the most common benchmark environments used to evaluate the performances of the considered methods.

引用

页数：25

共 68 条

[1]

[Anonymous], 2018, Fully decentralized multi-agent reinforcement learning with networked agents

[2]

[Anonymous], 1989, LEARNING DELAYED REW

[3]

Arjovsky M, 2017, PR MACH LEARN RES, V70

[4]

Bengio Y., 2009, P 26 ANN INT C MACH, P41

[5] The complexity of decentralized control of Markov decision processes [J].

Bernstein, DS ;

Givan, R ;

Immerman, N ;

Zilberstein, S .

MATHEMATICS OF OPERATIONS RESEARCH, 2002, 27 (04) :819-840

[6]

Bloembergen D., 2010, LENIENT FREQUENCY AD, P19

[7]

Calvo A., 2018, P 26 IRISH C ARTIFIC

[8]

Cardarilli G.C., 2020, P 2020 ASILOMAR C SI

[9]

Claus C, 1998, FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, P746

[10] Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks [J].

Cui, Jingjing ;

Liu, Yuanwei ;

Nallanathan, Arumugam .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) :729-743

← 1 2 3 4 5 6 7 →