Closely Cooperative Multi-Agent Reinforcement Learning Based on Intention Sharing and Credit Assignment

被引:0
|
作者
Fu, Hao [1 ,2 ]
You, Mingyu [1 ,2 ]
Zhou, Hongjun [1 ,2 ]
He, Bin [1 ,2 ]
机构
[1] Tongji Univ, Shanghai Res Inst Intelligent Autonomous Syst, Coll Elect & Informat Engn, Shanghai 200070, Peoples R China
[2] Frontiers Sci Ctr Intelligent Autonomous Syst, State Key Lab Intelligent Autonomous Syst, Shanghai Key Lab Intelligent Autonomous Syst, Shanghai 201203, Peoples R China
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 12期
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Collaboration; Encoding; Training; Multi-agent systems; Autonomous systems; Mutual information; Decision making; Trajectory; Synchronization; MARL; closely collaborative tasks; intention sharing; credit assignment;
D O I
10.1109/LRA.2024.3497661
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Collaborative tasks are important in multi-agent systems. Multi-agent reinforcement learning is a commonly used technique for solving multi-agent cooperative policy learning. The closely collaborative task is a special but common case within cooperative tasks, where the change in the environmental state requires multiple agents to simultaneously perform specific actions. For example, in a box-pushing task where the boxes are heavy and require multiple agents to push simultaneously. The closely cooperative task faces some unique challenges. Firstly, the completion of a closely collaborative task requires agents to synchronize their actions, necessitating a consistent intention among them. Secondly, when some agents' erroneous actions lead to task failure, it becomes a challenge to avoid incorrectly penalizing agents who performed the correct actions. These challenges make most of the existing MARL methods perform poorly on this task. In this letter, we propose a closely collaborative multi-agent reinforcement learning(CC-MARL) algorithm based on intention sharing and credit assignment. We use a two-phase training to learn intention encoding and intention sharing respectively, and decompose joint action values based on counterfactual baseline ideas. We deployed scenarios in both simulated and real environments with various sizes, numbers of boxes, and numbers of agents and compare CC-MARL with various classical MARL algorithms on box-pushing tasks of different map scales in simulation, demonstrating the state-of-the-art of our method.
引用
收藏
页码:11770 / 11777
页数:8
相关论文
共 50 条
  • [31] Centralized reinforcement learning for multi-agent cooperative environments
    Lu, Chengxuan
    Bao, Qihao
    Xia, Shaojie
    Qu, Chongxiao
    EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 267 - 273
  • [32] Swarm Reinforcement Learning for traffic signal control based on cooperative multi-agent framework
    Tahifa, Mohammed
    Boumhidi, Jaouad
    Yahyaouy, Ali
    2015 INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV), 2015,
  • [33] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
    Zhang Jiandong
    Yang Qiming
    Shi Guoqing
    Lu Yi
    Wu Yong
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2021, 32 (06) : 1421 - 1438
  • [34] Critic learning in multi agent credit assignment problem
    Rahaie, Zahra
    Beigy, Hamid
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 30 (06) : 3465 - 3480
  • [35] Negotiation agent based on Deep reinforcement learning for multi-agent cooperative distributed predictive control.
    Aponte-Rengifo, O.
    Francisco, M.
    Vega, P.
    IFAC PAPERSONLINE, 2023, 56 (02): : 1496 - 1501
  • [36] A bankruptcy based approach to solving multi-agent credit assignment problem
    Yarahmadi, Hossein
    Shiri, Mohammad Ebrahim
    Navidi, Hamidreza
    Sharifi, Arash
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2021, 12 : 1987 - 2018
  • [37] Distributed Cooperative Multi-Agent Reinforcement Learning with Directed Coordination Graph
    Jing, Gangshan
    Bai, He
    George, Jemin
    Chakrabortty, Aranya
    Sharma, Piyush K.
    2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 3273 - 3278
  • [38] Multi-Agent Reinforcement Learning for Traffic Signal Control: A Cooperative Approach
    Kolat, Mate
    Kovari, Balint
    Becsi, Tamas
    Aradi, Szilard
    SUSTAINABILITY, 2023, 15 (04)
  • [39] A Cooperative Spectrum Sensing With Multi-Agent Reinforcement Learning Approach in Cognitive Radio Networks
    Gao, Ang
    Du, Chengyuan
    Ng, Soon Xin
    Liang, Wei
    IEEE COMMUNICATIONS LETTERS, 2021, 25 (08) : 2604 - 2608
  • [40] Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning
    Liu, Jiaqi
    Xu, Chengkai
    Hang, Peng
    Sun, Jian
    Ding, Mingyu
    Zhan, Wei
    Tomizuka, Masayoshi
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (05): : 4292 - 4299