Sparse communication in multi-agent deep reinforcement learning

被引：1

作者：

Han, Shuai ^{[1
]}

Dastani, Mehdi ^{[1
]}

Wang, Shihan ^{[1
]}

机构：

[1] Univ Utrecht, Princetonpl 5, NL-3584 CC Utrecht, Netherlands

来源：

NEUROCOMPUTING | 2025年 / 625卷

关键词：

Multi-agent deep reinforcement learning; Multi-agent system; Communication learning; Message scheduling; Heterogeneous agents;

D O I：

10.1016/j.neucom.2025.129344

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning to communicate efficiently is central to multi-agent deep reinforcement learning (MADRL). Existing methods often require agents to exchange messages intensively, which abuses communication channels and leads to high communication overhead. Only a few methods target on learning sparse communication, but they allow limited information to be shared, which affects the efficiency of policy learning. In this work, we propose a multi-agent deep reinforcement learning framework with a decentralized communication scheduling process. The proposed framework, which we call Model-Based Communication (MBC), employs supervised learning to build a message estimation model. This model is used by individual agents to decide if they have to communicate their local information to other agents: agents do not communicate their local information if the intended messages can be properly estimated by others. The MBC framework enables multiple agents to make decisions with sparse communication. We evaluate our framework in a variety of mixed cooperative- competitive environments in both homogeneous and heterogeneous domains. The experimental results show that the MBC improves the performance the state-of-art baselines in both domains and leads to a lower communication overhead compared to the baselines.

引用

页数：14

共 45 条

[31]

Shixiang Gu, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P3389, DOI 10.1109/ICRA.2017.7989385

[32]

Singh A., 2019, 7 INT C LEARN REPR

[33]

Sukhbaatar S, 2016, ADV NEUR IN, V29

[34]

Sunehag P, 2018, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), P2085

[35]

van Hasselt H, 2016, AAAI CONF ARTIF INTE, P2094

[36]

Wachi A, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P6006

[37] GAN and Multi-Agent DRL Based Decentralized Traffic Light Signal Control [J].

Wang, Zixin ;

Zhu, Hanyu ;

He, Mingcheng ;

Zhou, Yong ;

Luo, Xiliang ;

Zhang, Ning .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (02) :1333-1348

[38] MAMBPO: Sample-efficient multi-robot reinforcement learning using learned world models [J].

Willemsen, Daniel ;

Coppola, Mario ;

de Croon, Guido C. H. E. .

2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, :5635-5640

[39]

Wu ZF, 2023, AAAI CONF ARTIF INTE, P10435

[40]

Xiao W ..., 2023, AAMAS, P1587, DOI DOI 10.5555/3545946.3598814

← 1 2 3 4 5 →