Autonomous Cooperative Hunting with Rule-Based and Self-Learning Control for Multiagent Systems

被引：0

作者：

Luo, Jiaxiang ^{[1
,2
]}

Xu, Bozhe ^{[1
]}

Li, Xiangyang ^{[1
,3
]}

Yao, Zhannan ^{[1
]}

机构：

[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou, Peoples R China

[2] Minist Educ, Engn Ctr Precis Elect Mfg Equipment, Guangzhou, Peoples R China

[3] Minist Educ, Key Lab Autonomous Syst & Networked Control, Guangzhou, Peoples R China

来源：

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS | 2024年 / 110卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Multiagent system; Cooperative control; Reinforcement learning; Imitation learning; Collision avoidance; GROUP-SIZE; PURSUIT; SUCCESS;

D O I：

10.1007/s10846-024-02177-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper considers the problem of autonomous cooperative hunting in an unknown dynamic environment, where a group of mobile agents collaborate to capture a moving target. Due to the decentralized decision-making nature of multi-agent systems and the presence of real-world constraints, it is a challenging task. To solve this problem, an artificial rule based hunting algorithm (AR-HA) is firstly developed based on the principles of attraction and repulsion with heading adjustment, and each agent is controlled by the designed rules. Then, to further enhance the stability of cooperative hunting, a self-learning algorithm based on Twin Delayed Deep Deterministic policy gradient (SL-TD3) is proposed. Each agent is governed by its own SL-TD3 controller and learns independently from its interaction with the environment, taking advantage of the reward function designed based on the control rules of AR-HA. Besides, in order to improve training efficiency, imitation learning is employed to initialize the actor network. Experiments on both virtual and real robots demonstrate the effectiveness of the proposed algorithms for autonomous cooperative hunting.

引用

页数：20

共 50 条

[11] Multiagent association rules mining in cooperative learning systems
Alhaj, R
Kaya, M
ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 75 - 87
[12] Cooperative Control-Based Task Assignments for Multiagent Systems With Intermittent Communication
Wang, Bohui
Chen, Weisheng
Zhang, Bin
Zhao, Yu
Shi, Peng
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (10) : 6697 - 6708
[13] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
Ana L. C. Bazzan
Autonomous Agents and Multi-Agent Systems, 2009, 18 : 342 - 375
[14] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
Bazzan, Ana L. C.
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) : 342 - 375
[15] A review of rule-based collision avoidance technology for autonomous UAV
JinWen Hu
Teng Wang
HaoZhe Zhang
Quan Pan
JianDong Zhang
Zhao Xu
Science China Technological Sciences, 2023, 66 : 2481 - 2499
[16] Optimization control of UAVs based on self-learning adaptive dynamic programming
Ye, Shuai
Zhou, Ying-Jiang
Jiang, Guo-Ping
Lin, Qiong
2020 35TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2020, : 738 - 743
[17] A review of rule-based collision avoidance technology for autonomous UAV
Hu, Jinwen
Wang, Teng
Zhang, Haozhe
Pan, Quan
Zhang, Jiandong
Xu, Zhao
SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2023, 66 (09) : 2481 - 2499
[18] Persistent rule-based interactive reinforcement learning
Adam Bignold
Francisco Cruz
Richard Dazeley
Peter Vamplew
Cameron Foale
Neural Computing and Applications, 2023, 35 : 23411 - 23428
[19] Cooperative Tracking Control of Nonlinear Multiagent Systems Using Self-Structuring Neural Networks
Chen, Gang
Song, Yong-Duan
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (08) : 1496 - 1507
[20] Persistent rule-based interactive reinforcement learning
Bignold, Adam
Cruz, Francisco
Dazeley, Richard
Vamplew, Peter
Foale, Cameron
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (32): : 23411 - 23428

← 1 2 3 4 5 →