Attentive Relational State Representation in Decentralized Multiagent Reinforcement Learning

被引:9
作者
Liu, Xiangyu [1 ,2 ]
Tan, Ying [1 ,2 ]
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, Key Lab Machine Percept, Minist Educ, Beijing 100871, Peoples R China
[2] Peking Univ, Sch Elect Engn & Comp Sci, Dept Machine Intelligence, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Aggregates; Multi-agent systems; Reinforcement learning; Task analysis; Protocols; Decision making; Topology; Agent modeling; attentive relational encoder (ARE); decentralized learning; multiagent reinforcement learning (MARL); state representation; COMPREHENSIVE SURVEY;
D O I
10.1109/TCYB.2020.2979803
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In multiagent reinforcement learning (MARL), it is crucial for each agent to model the relation with its neighbors. Existing approaches usually resort to concatenate the features of multiple neighbors, fixing the size and the identity of the inputs. But these settings are inflexible and unscalable. In this article, we propose an attentive relational encoder (ARE), which is a novel scalable feedforward neural module, to attentionally aggregate an arbitrary-sized neighboring feature set for state representation in the decentralized MARL. The ARE actively selects the relevant information from the neighboring agents and is permutation invariant, computationally efficient, and flexible to interactive multiagent systems. Our method consistently outperforms the latest competing decentralized MARL methods in several multiagent tasks. In particular, it shows strong cooperative performance in challenging StarCraft micromanagement tasks and achieves over a 96% winning rate against the most difficult noncheating built-in artificial intelligence bots.
引用
收藏
页码:252 / 264
页数:13
相关论文
共 53 条
[1]   Autonomous agents modelling other agents: A comprehensive survey and open problems [J].
Albrecht, Stefano V. ;
Stone, Peter .
ARTIFICIAL INTELLIGENCE, 2018, 258 :66-95
[2]  
[Anonymous], 2018, ARXIV180907193
[3]  
[Anonymous], 2015, PREPRINT
[4]  
[Anonymous], 2002, P 19 INT C MACHINE L
[5]  
[Anonymous], 2017, ARXIV170603235
[6]  
Bahdanau D., 2015, ICLR
[7]  
Bernstein D. S., 2000, P 16 C UNCERTAINTY A, P32
[8]  
Breese J.S., 1998, P C UNC ART INT, P43
[9]   Local Community Mining on Distributed and Dynamic Networks From a Multiagent Perspective [J].
Bu, Zhan ;
Wu, Zhiang ;
Cao, Jie ;
Jiang, Yichuan .
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (04) :986-999
[10]   A comprehensive survey of multiagent reinforcement learning [J].
Busoniu, Lucian ;
Babuska, Robert ;
De Schutter, Bart .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02) :156-172