Action Prediction for Cooperative Exploration in Multi-agent Reinforcement Learning

被引:0
|
作者
Zhang, Yanqiang [1 ]
Feng, Dawei [1 ]
Ding, Bo [1 ]
机构
[1] Natl Univ Def Technol, Natl Lab Parallel & Distributed Proc, Changsha 410073, Peoples R China
来源
NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II | 2024年 / 14448卷
关键词
Multi-agent Systems; Reinforcement Learning; Intrinsic Reward;
D O I
10.1007/978-981-99-8082-6_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-agent reinforcement learning methods have shown significant progress, however, they continue to exhibit exploration problems in complex and challenging environments. To address the above issue, current research has introduced several exploration-enhanced methods for multi-agent reinforcement learning, they are still faced with the issues of inefficient exploration and low performance in challenging tasks that necessitate complex cooperation among agents. This paper proposes the prediction-action Qmix (PQmix) method, an action prediction-based multi-agent intrinsic reward construction approach. The PQmix method employs the joint local observation of agents and the next joint local observation after executing actions to predict the real joint action of agents. The method calculates the action prediction error as the intrinsic reward to measure the novel of the joint state and encourages agents to actively explore the action and state spaces in the environment. We compare PQmix with strong baselines on the MARL benchmark to validate it. The result of experiments demonstrates that PQmix outperforms the state-of-the-art algorithms on the StarCraft Multi-Agent Challenge (SMAC). In the end, the stability of the method is verified by experiments.
引用
收藏
页码:358 / 372
页数:15
相关论文
共 50 条
  • [31] WRFMR: A Multi-Agent Reinforcement Learning Method for Cooperative Tasks
    Liu, Hui
    Zhang, Zhen
    Wang, Dongqing
    IEEE ACCESS, 2020, 8 : 216320 - 216331
  • [32] Multi-agent reinforcement learning: A survey
    Busoniu, Lucian
    Babuska, Robert
    De Schutter, Bart
    2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1133 - +
  • [33] Multi-agent Reinforcement Learning for Task Allocation in Cooperative Edge Cloud Computing
    Ding, Shiyao
    SERVICE-ORIENTED COMPUTING, ICSOC 2021 WORKSHOPS, 2022, 13236 : 283 - 297
  • [34] Cooperative Action Acquisition Based on Intention Estimation Method in a Multi-agent Reinforcement Learning System
    Tsubakimoto, Tatsuya
    Kobayashi, Kunikazu
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB 2014), 2014, : 122 - 125
  • [35] Cooperative Multi-Agent Jamming of Multiple Rogue Drones Using Reinforcement Learning
    Valianti, Panayiota
    Malialis, Kleanthis
    Kolios, Panayiotis
    Ellinas, Georgios
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 12345 - 12359
  • [36] Efficient Exploration for Multi-Agent Reinforcement Learning via Transferable Successor Features
    Liu, Wenzhang
    Dong, Lu
    Niu, Dan
    Sun, Changyin
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (09) : 1673 - 1686
  • [37] Enhanced Cooperative Multi-agent Learning Algorithms (ECMLA) using Reinforcement Learning
    Vidhate, Deepak A.
    Kulkarni, Parag
    2016 INTERNATIONAL CONFERENCE ON COMPUTING, ANALYTICS AND SECURITY TRENDS (CAST), 2016, : 556 - 561
  • [38] Cooperative reinforcement learning in topology-based multi-agent systems
    Dan Xiao
    Ah-Hwee Tan
    Autonomous Agents and Multi-Agent Systems, 2013, 26 : 86 - 119
  • [39] Cooperative Multi-Agent Reinforcement Learning with Constraint-Reduced DCOP
    Yi Xie
    Zhongyi Liu
    Zhao Liu
    Yijun Gu
    Journal of Beijing Institute of Technology, 2017, 26 (04) : 525 - 533
  • [40] Distributed cooperative reinforcement learning for multi-agent system with collision avoidance
    Lan, Xuejing
    Yan, Jiapei
    He, Shude
    Zhao, Zhijia
    Zou, Tao
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (01) : 567 - 585