A survey on multi-agent deep reinforcement learning: from the perspective of challenges and applications

被引：140

作者：

Du, Wei ^{[1
]}

Ding, Shifei ^{[1
,2
]}

机构：

[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Jiangsu, Peoples R China

[2] Minist Educ Peoples Republ, Mine Digitizat Engn Res Ctr, Xuzhou 221116, Jiangsu, Peoples R China

来源：

ARTIFICIAL INTELLIGENCE REVIEW | 2021年 / 54卷 / 05期

关键词：

Deep reinforcement learning; Multi-agent; Game theory; Centralized training and decentralized execution; Communication learning; Agent modeling; SMART GENERATION CONTROL; ALGORITHM;

D O I：

10.1007/s10462-020-09938-y

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep reinforcement learning has proved to be a fruitful method in various tasks in the field of artificial intelligence during the last several years. Recent works have focused on deep reinforcement learning beyond single-agent scenarios, with more consideration of multi-agent settings. The main goal of this paper is to provide a detailed and systematic overview of multi-agent deep reinforcement learning methods in views of challenges and applications. Specifically, the preliminary knowledge is introduced first for a better understanding of this field. Then, a taxonomy of challenges is proposed and the corresponding structures and representative methods are introduced. Finally, some applications and interesting future opportunities for multi-agent deep reinforcement learning are given.

引用

页码：3215 / 3238

页数：24

共 121 条

[1] Abouheaf M, 2017, 2017 IEEE 5TH INTERNATIONAL SYMPOSIUM ON ROBOTICS AND INTELLIGENT SENSORS (IRIS), P111, DOI 10.1109/IRIS.2017.8250107
[2] Autonomous agents modelling other agents: A comprehensive survey and open problems
Albrecht, Stefano V.
Stone, Peter
[J]. ARTIFICIAL INTELLIGENCE, 2018, 258 : 66 - 95
[3] [Anonymous], 2017, Value-decomposition networks for cooperative multi-agent learning
[4] [Anonymous], 2016, Ijcai
[5] [Anonymous], 2015, CoRR
[6] [Anonymous], 2017, 3 MULT C REINF LEARN
[7] [Anonymous], 2018, Advances in Neural Information Processing Systems
[8] [Anonymous], 2016, MULT INT PRIOR COORD
[9] [Anonymous], 2017, ARXIV170603235
[10] [Anonymous], 2017, Rainbow: Combining improvements in deep reinforcement learning

← 1 2 3 4 5 6 7 8 9 10 →