A survey of multi-agent deep reinforcement learning with communication

被引:36
作者
Zhu, Changxi [1 ]
Dastani, Mehdi [1 ]
Wang, Shihan [1 ]
机构
[1] Univ Utrecht, Dept Informat & Comp Sci, Utrecht, Netherlands
关键词
Multi-agent reinforcement learning; Deep reinforcement learning; Communication; Survey; COORDINATION;
D O I
10.1007/s10458-023-09633-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Communication is an effective mechanism for coordinating the behaviors of multiple agents, broadening their views of the environment, and to support their collaborations. In the field of multi-agent deep reinforcement learning (MADRL), agents can improve the overall learning performance and achieve their objectives by communication. Agents can communicate various types of messages, either to all agents or to specific agent groups, or conditioned on specific constraints. With the growing body of research work in MADRL with communication (Comm-MADRL), there is a lack of a systematic and structural approach to distinguish and classify existing Comm-MADRL approaches. In this paper, we survey recent works in the Comm-MADRL field and consider various aspects of communication that can play a role in designing and developing multi-agent reinforcement learning systems. With these aspects in mind, we propose 9 dimensions along which Comm-MADRL approaches can be analyzed, developed, and compared. By projecting existing works into the multi-dimensional space, we discover interesting trends. We also propose some novel directions for designing future Comm-MADRL systems through exploring possible combinations of the dimensions.
引用
收藏
页数:48
相关论文
共 119 条
[1]  
Agarwal A, 2020, 19 INT C AUT AG MULT, P1741, DOI DOI 10.48550/ARXIV.1906.01202
[2]   Multimodal Machine Learning: A Survey and Taxonomy [J].
Baltrusaitis, Tadas ;
Ahuja, Chaitanya ;
Morency, Louis-Philippe .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (02) :423-443
[3]  
Bogin B, 2019, Arxiv, DOI arXiv:1809.00549
[4]   Superhuman AI for multiplayer poker [J].
Brown, Noam ;
Sandholm, Tuomas .
SCIENCE, 2019, 365 (6456) :885-+
[5]  
Brys T, 2014, AAAI CONF ARTIF INTE, P1687
[6]  
Bullard K., 2021, arXiv
[7]   A comprehensive survey of multiagent reinforcement learning [J].
Busoniu, Lucian ;
Babuska, Robert ;
De Schutter, Bart .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02) :156-172
[8]  
Busoniu L, 2006, I C CONT AUTOMAT ROB, P1133
[9]   Multi-Agent Systems and Blockchain: Results from a Systematic Literature Review [J].
Calvaresi, Davide ;
Dubovitskaya, Alevtina ;
Calbimonte, Jean Paul ;
Taveter, Kuldar ;
Schumacher, Michael .
ADVANCES IN PRACTICAL APPLICATIONS OF AGENTS, MULTI-AGENT SYSTEMS, AND COMPLEXITY: THE PAAMS COLLECTION, 2018, 10978 :110-126
[10]  
Cao K., 2018, 6 INT C LEARN REPR I