A Projection-based Exploration Method for Multi-Agent Coordination

被引:0
|
作者
Tang, Hainan [1 ]
Liu, Juntao [1 ]
Wang, Zhenjie [1 ]
Gao, Ziwen [1 ]
Li, You [2 ]
机构
[1] Wuhan Digital Engn Inst, Wuhan, Hubei, Peoples R China
[2] Hubei Univ, Wuhan, Hubei, Peoples R China
关键词
Projection Exploration; Multi-agent Coordination; Maximum distribution entropy;
D O I
10.1145/3669721.3669723
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-agent reinforcement learning (MARL), states with high exploration value are difficult to be identified and coordinately visited, resulting in low learning efficiency. To this end, a projection-based exploration method for multi-agent coordination (PEMAC) is proposed. Goal states are selected using the count-based approach in the optimal projection space, of which the entropy of state distribution is maximal. Then, by reshaping the rewards in the replay buffer, agents are trained to visit those high-value states in a coordinated manner. In order to verify the effectiveness of the proposed method, comparative experiments are conducted in the multi-particle environment (MPE), in which dense-reward and sparse-reward settings are all both considered. Corresponding results suggest that PEMAC can effectively improve learning efficiency.
引用
收藏
页码:8 / 14
页数:7
相关论文
共 50 条
  • [31] A Projection-based Hotspot Analysis Method
    Ren, Chao
    Li, Rui
    Li, Meng
    Li, Caihong
    PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 2066 - 2069
  • [32] Projection-based coordination control of automated manufacturing systems
    Sanchez, A.
    Llamas, L.
    Gonzalez, K.
    2007 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING, 2007, : 171 - 174
  • [33] A Projection-Based Method for Shape Measurement
    Nguyen, Thanh Phuong
    Nguyen, Xuan Son
    Borgi, Mohamed Anouar
    Nguyen, M. K.
    JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2020, 62 (04) : 489 - 504
  • [34] A Centralized Strategy for Multi-Agent Exploration
    Gul, Faiza
    Mir, Adnan
    Mir, Imran
    Mir, Suleman
    Islaam, Tauqeer Ul
    Abualigah, Laith
    Forestiero, Agostino
    IEEE Access, 2022, 10 : 126871 - 126884
  • [35] Rational coordination in multi-agent environments
    Gmytrasiewicz, PJ
    Durfee, EH
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2000, 3 (04) : 319 - 350
  • [36] Rational Coordination in Multi-Agent Environments
    Piotr J. Gmytrasiewicz
    Edmund H. Durfee
    Autonomous Agents and Multi-Agent Systems, 2000, 3 : 319 - 350
  • [37] Multi-agent coordination by communication of evaluations
    de Jong, E
    MULTI-AGENT RATIONALITY, 1997, 1237 : 63 - 78
  • [38] MAPS: a system for multi-agent coordination
    Tews, A
    Wyeth, G
    ADVANCED ROBOTICS, 2000, 14 (01) : 37 - 50
  • [39] MAVEN: Multi-Agent Variational Exploration
    Mahajan, Anuj
    Rashid, Tabish
    Samvelyan, Mikayel
    Whiteson, Shimon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [40] A Centralized Strategy for Multi-Agent Exploration
    Gul, Faiza
    Mir, Adnan
    Mir, Imran
    Mir, Suleman
    Ul Islaam, Tauqeer
    Abualigah, Laith
    Forestiero, Agostino
    IEEE ACCESS, 2022, 10 : 126871 - 126884