Autonomous Cooperative Hunting with Rule-Based and Self-Learning Control for Multiagent Systems

被引:0
|
作者
Luo, Jiaxiang [1 ,2 ]
Xu, Bozhe [1 ]
Li, Xiangyang [1 ,3 ]
Yao, Zhannan [1 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou, Peoples R China
[2] Minist Educ, Engn Ctr Precis Elect Mfg Equipment, Guangzhou, Peoples R China
[3] Minist Educ, Key Lab Autonomous Syst & Networked Control, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Multiagent system; Cooperative control; Reinforcement learning; Imitation learning; Collision avoidance; GROUP-SIZE; PURSUIT; SUCCESS;
D O I
10.1007/s10846-024-02177-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper considers the problem of autonomous cooperative hunting in an unknown dynamic environment, where a group of mobile agents collaborate to capture a moving target. Due to the decentralized decision-making nature of multi-agent systems and the presence of real-world constraints, it is a challenging task. To solve this problem, an artificial rule based hunting algorithm (AR-HA) is firstly developed based on the principles of attraction and repulsion with heading adjustment, and each agent is controlled by the designed rules. Then, to further enhance the stability of cooperative hunting, a self-learning algorithm based on Twin Delayed Deep Deterministic policy gradient (SL-TD3) is proposed. Each agent is governed by its own SL-TD3 controller and learns independently from its interaction with the environment, taking advantage of the reward function designed based on the control rules of AR-HA. Besides, in order to improve training efficiency, imitation learning is employed to initialize the actor network. Experiments on both virtual and real robots demonstrate the effectiveness of the proposed algorithms for autonomous cooperative hunting.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Self-Triggered DMPC Design or Cooperative Multiagent Systems
    Mi, Xiaoxiao
    Zou, Yuanyuan
    Li, Shaoyuan
    Karimi, Hamid Reza
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (01) : 512 - 520
  • [22] Consensus seeking in multiagent cooperative control systems with bounded control input
    Zhang S.
    Duan G.
    Journal of Control Theory and Applications, 2011, 9 (02): : 210 - 214
  • [23] Self-learning Control for Active Network Management
    Perez-Olvera, Julio
    Green, Tim C.
    Junyent-Ferre, Adria
    2021 IEEE MADRID POWERTECH, 2021,
  • [24] Algorithm for Autonomous Power-Increase Operation Using Deep Reinforcement Learning and a Rule-Based System
    Lee, Daeil
    Arigi, Awwal Mohammed
    Kim, Jonghyun
    IEEE ACCESS, 2020, 8 : 196727 - 196746
  • [25] Combining reinforcement learning with rule-based controllers for transparent and general decision-making in autonomous driving
    Likmeta, Amarildo
    Metelli, Alberto Maria
    Tirinzoni, Andrea
    Giol, Riccardo
    Restelli, Marcello
    Romano, Danilo
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2020, 131 (131)
  • [26] Experience Based Learning in Policy Control of Multiagent System
    Damba, Ariuna
    Watanabe, Shigeyoshi
    2008 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 1209 - 1214
  • [27] A Comparison of Grouping Behaviors on Rule-Based and Learning-Based Multi-agent Systems
    Ueyama, Akihiro
    Isokawa, Teijiro
    Nishimura, Haruhiko
    Matsui, Nobuyuki
    RECENT ADVANCES IN NATURAL COMPUTING, 2016, 14 : 27 - 40
  • [28] Hierarchical Cooperative Control for Multiagent Systems With Switching Directed Topologies
    Hu, Jianqiang
    Cao, Jinde
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (10) : 2453 - 2463
  • [29] A survey on cooperative fault-tolerant control for multiagent systems
    Zhang, Pu
    Zhao, Di
    Kong, Xiangjie
    Zhang, Jialong
    Li, Lei
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2024, 18 (06): : 1431 - 1448
  • [30] Prescribed Performance Cooperative Control for Multiagent Systems With Input Quantization
    Liang, Hongjing
    Zhang, Yanhui
    Huang, Tingwen
    Ma, Hui
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (05) : 1810 - 1819