Efficient Communication via Self-Supervised Information Aggregation for Online and Offline Multiagent Reinforcement Learning

被引:0
|
作者
Guan, Cong [1 ,2 ]
Chen, Feng [1 ,2 ]
Yuan, Lei [3 ]
Zhang, Zongzhang [1 ,2 ]
Yu, Yang [3 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
[2] Nanjing Univ, Sch Artificial Intelligence, Nanjing 210023, Peoples R China
[3] Polixir Technol, Nanjing 211106, Peoples R China
基金
美国国家科学基金会;
关键词
Benchmark testing; Reinforcement learning; Observability; Training; Learning (artificial intelligence); Decision making; Data mining; Cooperative multiagent reinforcement learning (MARL); multiagent communication; offline learning; representation learning;
D O I
10.1109/TNNLS.2024.3420791
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Utilizing messages from teammates can improve coordination in cooperative multiagent reinforcement learning (MARL). Previous works typically combine raw messages of teammates with local information as inputs for policy. However, neglecting message aggregation poses significant inefficiency for policy learning. Motivated by recent advances in representation learning, we argue that efficient message aggregation is essential for good coordination in cooperative MARL. In this article, we propose Multiagent communication via Self-supervised Information Aggregation (MASIA), where agents can aggregate the received messages into compact representations with high relevance to augment the local policy. Specifically, we design a permutation-invariant message encoder to generate common information-aggregated representation from messages and optimize it via reconstructing and shooting future information in a self-supervised manner. Hence, each agent would utilize the most relevant parts of the aggregated representation for decision-making by a novel message extraction mechanism. Furthermore, considering the potential of offline learning for real-world applications, we build offline benchmarks for multiagent communication, which is the first as we know. Empirical results demonstrate the superiority of our method in both online and offline settings. We also release the built offline benchmarks in this article as a testbed for communication ability validation to facilitate further future research in this direction.
引用
收藏
页数:13
相关论文
共 50 条
  • [11] Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning
    Bai, Chenjia
    Liu, Peng
    Liu, Kaiyu
    Wang, Lingxiao
    Zhao, Yingnan
    Han, Lei
    Wang, Zhaoran
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4776 - 4790
  • [12] Portfolio management using online reinforcement learning with adaptive exploration and Multi-task self-supervised representation
    Sang, Chuan-Yun
    Huang, Szu-Hao
    Chen, Chiao-Ting
    Chang, Heng-Ta
    APPLIED SOFT COMPUTING, 2025, 172
  • [13] Self-Supervised Learning for Efficient Antialiasing Seismic Data Interpolation
    Yuan, Pengyu
    Wang, Shirui
    Hu, Wenyi
    Nadukandi, Prashanth
    Botero, German Ocampo
    Wu, Xuqing
    Hien Van Nguyen
    Chen, Jiefu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [14] Efficient Personalized Speech Enhancement Through Self-Supervised Learning
    Sivaraman, Aswin
    Kim, Minje
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1342 - 1356
  • [15] Efficient self-supervised heterogeneous graph representation learning with reconstruction
    Mo, Yujie
    Shen, Heng Tao
    Zhu, Xiaofeng
    INFORMATION FUSION, 2025, 117
  • [16] Self-Supervised Learning for Annotation Efficient Biomedical Image Segmentation
    Rettenberger, Luca
    Schilling, Marcel
    Elser, Stefan
    Bohland, Moritz
    Reischl, Markus
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2023, 70 (09) : 2519 - 2528
  • [17] Self-Supervised Point Cloud Representation Learning via Separating Mixed Shapes
    Sun, Chao
    Zheng, Zhedong
    Wang, Xiaohan
    Xu, Mingliang
    Yang, Yi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6207 - 6218
  • [18] Self-Supervised Node Representation Learning via Node-to-Neighbourhood Alignment
    Dong, Wei
    Yan, Dawei
    Wang, Peng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (06) : 4218 - 4233
  • [19] Self-Supervised Graph Representation Learning via Topology Transformations
    Gao, Xiang
    Hu, Wei
    Qi, Guo-Jun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 4202 - 4215
  • [20] Predicting Human Mobility via Self-Supervised Disentanglement Learning
    Gao, Qiang
    Hong, Jinyu
    Xu, Xovee
    Kuang, Ping
    Zhou, Fan
    Trajcevski, Goce
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (05) : 2126 - 2141