Closely Cooperative Multi-Agent Reinforcement Learning Based on Intention Sharing and Credit Assignment

被引:0
|
作者
Fu, Hao [1 ,2 ]
You, Mingyu [1 ,2 ]
Zhou, Hongjun [1 ,2 ]
He, Bin [1 ,2 ]
机构
[1] Tongji Univ, Shanghai Res Inst Intelligent Autonomous Syst, Coll Elect & Informat Engn, Shanghai 200070, Peoples R China
[2] Frontiers Sci Ctr Intelligent Autonomous Syst, State Key Lab Intelligent Autonomous Syst, Shanghai Key Lab Intelligent Autonomous Syst, Shanghai 201203, Peoples R China
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 12期
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Collaboration; Encoding; Training; Multi-agent systems; Autonomous systems; Mutual information; Decision making; Trajectory; Synchronization; MARL; closely collaborative tasks; intention sharing; credit assignment;
D O I
10.1109/LRA.2024.3497661
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Collaborative tasks are important in multi-agent systems. Multi-agent reinforcement learning is a commonly used technique for solving multi-agent cooperative policy learning. The closely collaborative task is a special but common case within cooperative tasks, where the change in the environmental state requires multiple agents to simultaneously perform specific actions. For example, in a box-pushing task where the boxes are heavy and require multiple agents to push simultaneously. The closely cooperative task faces some unique challenges. Firstly, the completion of a closely collaborative task requires agents to synchronize their actions, necessitating a consistent intention among them. Secondly, when some agents' erroneous actions lead to task failure, it becomes a challenge to avoid incorrectly penalizing agents who performed the correct actions. These challenges make most of the existing MARL methods perform poorly on this task. In this letter, we propose a closely collaborative multi-agent reinforcement learning(CC-MARL) algorithm based on intention sharing and credit assignment. We use a two-phase training to learn intention encoding and intention sharing respectively, and decompose joint action values based on counterfactual baseline ideas. We deployed scenarios in both simulated and real environments with various sizes, numbers of boxes, and numbers of agents and compare CC-MARL with various classical MARL algorithms on box-pushing tasks of different map scales in simulation, demonstrating the state-of-the-art of our method.
引用
收藏
页码:11770 / 11777
页数:8
相关论文
共 50 条
  • [41] Credit-of-Q-value for Multi-Agent Reinforcement Learning
    Li, Shuaibin
    Li, Xiu
    Cui, Jinqiang
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6982 - 6988
  • [42] Addition of Learning to Critic Agent as a Solution to the Multi-Agent Credit Assignment Problem
    Rahaie, Zahra
    Beigy, Hamid
    2009 FIFTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING, COMPUTING WITH WORDS AND PERCEPTIONS IN SYSTEM ANALYSIS, DECISION AND CONTROL, 2010, : 219 - 222
  • [43] Smart Shield: Prevent Aerial Eavesdropping via Cooperative Intelligent Jamming Based on Multi-Agent Reinforcement Learning
    Wang, Qubeijian
    Tang, Shiyue
    Sun, Wen
    Zhang, Yin
    Sun, Geng
    Dai, Hong-Ning
    Guizani, Mohsen
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (04) : 2995 - 3011
  • [44] Hierarchical Multi-Agent Training Based on Reinforcement Learning
    Wang, Guanghua
    Li, Wenjie
    Wu, Zhanghua
    Guo, Xian
    2024 9TH ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS, ACIRS, 2024, : 11 - 18
  • [45] Rethinking Exploration and Experience Exploitation in Value-Based Multi-Agent Reinforcement Learning
    Borzilov, Anatolii
    Skrynnik, Alexey
    Panov, Aleksandr
    IEEE ACCESS, 2025, 13 : 13770 - 13781
  • [46] Cooperative Reinforcement Learning Algorithm to Distributed Power System Based on Multi-Agent
    Gao, La-mei
    Zeng, Jun
    Wu, Jie
    Li, Min
    2009 3RD INTERNATIONAL CONFERENCE ON POWER ELECTRONICS SYSTEMS AND APPLICATIONS: ELECTRIC VEHICLE AND GREEN ENERGY, 2009, : 53 - 53
  • [47] Multi-agent Reinforcement Learning Based on K-Means Algorithm
    Liu Changan
    Liu Fei
    Liu Chunyang
    Wu Hua
    CHINESE JOURNAL OF ELECTRONICS, 2011, 20 (03): : 414 - 418
  • [48] Reinforcement Learning Approach for Cooperative Control of Multi-Agent Systems
    Javalera-Rincon, Valeria
    Puig Cayuela, Vicenc
    Morcego Seix, Bernardo
    Orduna-Cabrera, Fernando
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 80 - 91
  • [49] Transform networks for cooperative multi-agent deep reinforcement learning
    Hongbin Wang
    Xiaodong Xie
    Lianke Zhou
    Applied Intelligence, 2023, 53 : 9261 - 9269
  • [50] Transform networks for cooperative multi-agent deep reinforcement learning
    Wang, Hongbin
    Xie, Xiaodong
    Zhou, Lianke
    APPLIED INTELLIGENCE, 2023, 53 (08) : 9261 - 9269