Multi-agent reinforcement learning with weak ties☆

被引:0
作者
Wang, Huan [1 ,2 ]
Zhou, Xu [1 ]
Kang, Yu [3 ]
Xue, Jian [4 ]
Yang, Chenguang [5 ]
Liu, Xiaofeng [1 ]
机构
[1] Hohai Univ, Coll Artificial Intelligence & Automat, Changzhou 213022, Peoples R China
[2] Chuzhou Polytechig, Sch Informat Engn, Chuzhou 239000, Peoples R China
[3] Univ Sci & Technol China, Sch Automat Engn, Hefei 230026, Peoples R China
[4] Univ Chinese Acad Sci, Sch Engn Sci, Beijing 100049, Peoples R China
[5] UWE, Bristol Robot Lab, Bristol BS161QY, England
关键词
Weak ties; Graph modeling; Information exchange; Optimal policy; Multi-agent reinforcement learning; COMMUNICATION; STRENGTH; SYSTEMS;
D O I
10.1016/j.inffus.2025.102942
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing multi-agent reinforcement learning (MARL) algorithms focus primarily on maximizing global game gains or encouraging cooperation between agents, often overlooking the weak ties between them. In multi- agent environments, the quality of the information exchanged is crucial for optimal policy learning. To this end, we propose a novel MARL framework that integrates weak-tie theory with graph modeling to forma weak- tie modeling module. We use the distribution of tie strengths and the dominant agent which is computed based on tie graph to control the information exchange between agents. Our method is evaluated against various baseline models indifferent multi-agent environments. Experimental results show that our method significantly improves the adversarial win rates and rewards of agents, and reduces the combat losses of agents in confrontation. Our method provides insights into how to reduce information redundancy in the training of large-scale agents.
引用
收藏
页数:12
相关论文
共 37 条
  • [31] Weixun W., 2020, INT C LEARN REPR ICM, P2048
  • [32] Yang M., 2023, 28 INT C DAT SYST AD
  • [33] Causal inference multi-agent reinforcement learning for traffic signal control
    Yang, Shantian
    Yang, Bo
    Zeng, Zheng
    Kang, Zhongfeng
    [J]. INFORMATION FUSION, 2023, 94 : 243 - 256
  • [34] An inductive heterogeneous graph attention-based multi-agent deep graph infomax algorithm for adaptive traffic signal control
    Yang, Shantian
    Yang, Bo
    [J]. INFORMATION FUSION, 2022, 88 : 249 - 262
  • [35] Yu C., 2021, CoRR abs/2103.01955v2
  • [36] Yuan L, 2022, AAAI CONF ARTIF INTE, P9466
  • [37] Zhang Y, 2021, AAAI CONF ARTIF INTE, V35, P11387