A Projection-based Exploration Method for Multi-Agent Coordination

被引:0
|
作者
Tang, Hainan [1 ]
Liu, Juntao [1 ]
Wang, Zhenjie [1 ]
Gao, Ziwen [1 ]
Li, You [2 ]
机构
[1] Wuhan Digital Engn Inst, Wuhan, Hubei, Peoples R China
[2] Hubei Univ, Wuhan, Hubei, Peoples R China
关键词
Projection Exploration; Multi-agent Coordination; Maximum distribution entropy;
D O I
10.1145/3669721.3669723
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-agent reinforcement learning (MARL), states with high exploration value are difficult to be identified and coordinately visited, resulting in low learning efficiency. To this end, a projection-based exploration method for multi-agent coordination (PEMAC) is proposed. Goal states are selected using the count-based approach in the optimal projection space, of which the entropy of state distribution is maximal. Then, by reshaping the rewards in the replay buffer, agents are trained to visit those high-value states in a coordinated manner. In order to verify the effectiveness of the proposed method, comparative experiments are conducted in the multi-particle environment (MPE), in which dense-reward and sparse-reward settings are all both considered. Corresponding results suggest that PEMAC can effectively improve learning efficiency.
引用
收藏
页码:8 / 14
页数:7
相关论文
共 50 条
  • [41] Multi-Agent Coordination with Lagrangian Measurements
    Grushkovskaya, Victoria
    Ebenbauer, Christian
    IFAC PAPERSONLINE, 2016, 49 (22): : 115 - 120
  • [42] Multi-Agent Coordination for DER in MicroGrid
    Logenthiran, T.
    Srinivasan, Dipti
    Wong, David
    2008 IEEE INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY TECHNOLOGIES (ICSET), VOLS 1 AND 2, 2008, : 77 - 82
  • [43] A multi-agent approach to environment exploration
    Maio, D
    Rizzi, S
    INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 1996, 5 (2-3): : 213 - 250
  • [44] A hybrid system for multi-agent exploration
    Leung, C
    Al-Jumaily, A
    2004 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, PROCEEDINGS, 2004, : 209 - 213
  • [45] TRANSFER LEARNING FOR MULTI-AGENT COORDINATION
    Vrancx, Peter
    De Hauwere, Yann-Michael
    Nowe, Ann
    ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2011, : 263 - 272
  • [46] Coordination in multi-agent RoboCup teams
    Candea, C
    Hu, HS
    Iocchi, L
    Nardi, D
    Piaggio, M
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2001, 36 (02) : 67 - 86
  • [47] Multi-Agent Flag Coordination Games
    Marzagao, David Kohan
    Rivera, Nicolas
    Cooper, Colin
    McBurney, Peter
    Steinhofel, Kathleen
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1442 - 1450
  • [48] Multi-agent Exploration with Reinforcement Learning
    Sygkounas, Alkis
    Tsipianitis, Dimitris
    Nikolakopoulos, George
    Bechlioulis, Charalampos P.
    2022 30TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2022, : 630 - 635
  • [49] Coordination in introspective multi-agent systems
    Charif, Yasmine
    Sabouret, Nicolas
    PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY (IAT 2007), 2007, : 412 - +
  • [50] Intelligent Multi-agent Coordination and Learning
    Chang, Yu-Cheng
    Dostovalova, Anna
    Lin, Chin-Teng
    Kim, Jijoong
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 1431 - 1436