Autonomous Cooperative Hunting with Rule-Based and Self-Learning Control for Multiagent Systems

被引:0
|
作者
Luo, Jiaxiang [1 ,2 ]
Xu, Bozhe [1 ]
Li, Xiangyang [1 ,3 ]
Yao, Zhannan [1 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou, Peoples R China
[2] Minist Educ, Engn Ctr Precis Elect Mfg Equipment, Guangzhou, Peoples R China
[3] Minist Educ, Key Lab Autonomous Syst & Networked Control, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Multiagent system; Cooperative control; Reinforcement learning; Imitation learning; Collision avoidance; GROUP-SIZE; PURSUIT; SUCCESS;
D O I
10.1007/s10846-024-02177-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper considers the problem of autonomous cooperative hunting in an unknown dynamic environment, where a group of mobile agents collaborate to capture a moving target. Due to the decentralized decision-making nature of multi-agent systems and the presence of real-world constraints, it is a challenging task. To solve this problem, an artificial rule based hunting algorithm (AR-HA) is firstly developed based on the principles of attraction and repulsion with heading adjustment, and each agent is controlled by the designed rules. Then, to further enhance the stability of cooperative hunting, a self-learning algorithm based on Twin Delayed Deep Deterministic policy gradient (SL-TD3) is proposed. Each agent is governed by its own SL-TD3 controller and learns independently from its interaction with the environment, taking advantage of the reward function designed based on the control rules of AR-HA. Besides, in order to improve training efficiency, imitation learning is employed to initialize the actor network. Experiments on both virtual and real robots demonstrate the effectiveness of the proposed algorithms for autonomous cooperative hunting.
引用
收藏
页数:20
相关论文
共 50 条
  • [11] Multiagent association rules mining in cooperative learning systems
    Alhaj, R
    Kaya, M
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 75 - 87
  • [12] Cooperative Control-Based Task Assignments for Multiagent Systems With Intermittent Communication
    Wang, Bohui
    Chen, Weisheng
    Zhang, Bin
    Zhao, Yu
    Shi, Peng
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (10) : 6697 - 6708
  • [13] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Ana L. C. Bazzan
    Autonomous Agents and Multi-Agent Systems, 2009, 18 : 342 - 375
  • [14] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Bazzan, Ana L. C.
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) : 342 - 375
  • [15] A review of rule-based collision avoidance technology for autonomous UAV
    JinWen Hu
    Teng Wang
    HaoZhe Zhang
    Quan Pan
    JianDong Zhang
    Zhao Xu
    Science China Technological Sciences, 2023, 66 : 2481 - 2499
  • [16] Optimization control of UAVs based on self-learning adaptive dynamic programming
    Ye, Shuai
    Zhou, Ying-Jiang
    Jiang, Guo-Ping
    Lin, Qiong
    2020 35TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2020, : 738 - 743
  • [17] A review of rule-based collision avoidance technology for autonomous UAV
    Hu, Jinwen
    Wang, Teng
    Zhang, Haozhe
    Pan, Quan
    Zhang, Jiandong
    Xu, Zhao
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2023, 66 (09) : 2481 - 2499
  • [18] Persistent rule-based interactive reinforcement learning
    Adam Bignold
    Francisco Cruz
    Richard Dazeley
    Peter Vamplew
    Cameron Foale
    Neural Computing and Applications, 2023, 35 : 23411 - 23428
  • [19] Cooperative Tracking Control of Nonlinear Multiagent Systems Using Self-Structuring Neural Networks
    Chen, Gang
    Song, Yong-Duan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (08) : 1496 - 1507
  • [20] Persistent rule-based interactive reinforcement learning
    Bignold, Adam
    Cruz, Francisco
    Dazeley, Richard
    Vamplew, Peter
    Foale, Cameron
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (32): : 23411 - 23428