A survey on multi-agent deep reinforcement learning: from the perspective of challenges and applications

被引:140
作者
Du, Wei [1 ]
Ding, Shifei [1 ,2 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Jiangsu, Peoples R China
[2] Minist Educ Peoples Republ, Mine Digitizat Engn Res Ctr, Xuzhou 221116, Jiangsu, Peoples R China
关键词
Deep reinforcement learning; Multi-agent; Game theory; Centralized training and decentralized execution; Communication learning; Agent modeling; SMART GENERATION CONTROL; ALGORITHM;
D O I
10.1007/s10462-020-09938-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep reinforcement learning has proved to be a fruitful method in various tasks in the field of artificial intelligence during the last several years. Recent works have focused on deep reinforcement learning beyond single-agent scenarios, with more consideration of multi-agent settings. The main goal of this paper is to provide a detailed and systematic overview of multi-agent deep reinforcement learning methods in views of challenges and applications. Specifically, the preliminary knowledge is introduced first for a better understanding of this field. Then, a taxonomy of challenges is proposed and the corresponding structures and representative methods are introduced. Finally, some applications and interesting future opportunities for multi-agent deep reinforcement learning are given.
引用
收藏
页码:3215 / 3238
页数:24
相关论文
共 121 条
  • [1] Abouheaf M, 2017, 2017 IEEE 5TH INTERNATIONAL SYMPOSIUM ON ROBOTICS AND INTELLIGENT SENSORS (IRIS), P111, DOI 10.1109/IRIS.2017.8250107
  • [2] Autonomous agents modelling other agents: A comprehensive survey and open problems
    Albrecht, Stefano V.
    Stone, Peter
    [J]. ARTIFICIAL INTELLIGENCE, 2018, 258 : 66 - 95
  • [3] [Anonymous], 2017, Value-decomposition networks for cooperative multi-agent learning
  • [4] [Anonymous], 2016, Ijcai
  • [5] [Anonymous], 2015, CoRR
  • [6] [Anonymous], 2017, 3 MULT C REINF LEARN
  • [7] [Anonymous], 2018, Advances in Neural Information Processing Systems
  • [8] [Anonymous], 2016, MULT INT PRIOR COORD
  • [9] [Anonymous], 2017, ARXIV170603235
  • [10] [Anonymous], 2017, Rainbow: Combining improvements in deep reinforcement learning