A Multi-group Multi-agent System Based on Reinforcement Learning and Flocking

被引:5
作者
Wang, Gang [1 ]
Xiao, Jian [2 ,3 ]
Xue, Rui [4 ]
Yuan, Yongting [5 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Ctr Robot, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
[3] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Quzhou, Quzhou, Peoples R China
[4] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China
[5] 31435 Res Inst, Shenyang 110000, Peoples R China
基金
中国国家自然科学基金;
关键词
Distributed cooperative reinforcement learning; flocking; group confrontation; multi-group multi-agent system; SENSOR NETWORKS; MOBILE; ALGORITHMS; COVERAGE;
D O I
10.1007/s12555-021-0170-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present an inter-group confrontation and intra-group cooperation method for a predator group and prey group, and construct a multi-group multi-agent system. We model the motion of the prey group using the flocking control algorithm. The prey group can cooperatively avoid predators and maintain the integrity of the group after the predators have been detected. The autonomous decision-making of the predator group is implemented based on the distributed reinforcement learning algorithm. To efficiently share the learning experience among agents in the predator group, a distributed cooperative reinforcement learning algorithm with variable weights is proposed to accelerate the convergence of the learning algorithm. Simulations show the feasibility of this proposed method.
引用
收藏
页码:2364 / 2378
页数:15
相关论文
共 50 条
[41]   Multi-agent Reinforcement Learning for Control Systems: Challenges and Proposals [J].
Grana, Manuel ;
Fernandez-Gauna, Borja .
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2015, 2015, 9375 :18-25
[42]   A Tensor Factorization Approach to Generalization in Multi-Agent Reinforcement Learning [J].
Bromuri, Stefano .
2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 2, 2012, :274-281
[43]   A Flocking Algorithm for Multi-agent Control with Multi-leader Following Strategy [J].
Li, Yang ;
Tang, Gong-You ;
Yang, Xi-Xin ;
Wang, Pei-Dong .
PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, :3493-3497
[44]   Flocking control for multi-agent systems with stream-based obstacle avoidance [J].
Wang, Qiang ;
Chen, Jie ;
Fang, Hao ;
Ma, Qian .
TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2014, 36 (03) :391-398
[45]   Flocking of multi-agent dynamical systems based on pseudo-leader mechanism [J].
Zhou, Jin ;
Wu, Xiaoqun ;
Yu, Wenwu ;
Small, Michael ;
Lu, Jun-an .
SYSTEMS & CONTROL LETTERS, 2012, 61 (01) :195-202
[46]   Combined Flocking and Region-Based Shape Control for Multi-Agent Systems [J].
Fang, Wenxin ;
Zhao, Jiabao ;
Pan, Yuchen .
2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, :3557-3562
[47]   A distributed adaptive policy gradient method based on momentum for multi-agent reinforcement learning [J].
Shi, Junru ;
Wang, Xin ;
Zhang, Mingchuan ;
Liu, Muhua ;
Zhu, Junlong ;
Wu, Qingtao .
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (05) :7297-7310
[48]   Decentralized graph-based multi-agent reinforcement learning using reward machines [J].
Hu, Jueming ;
Xu, Zhe ;
Wang, Weichang ;
Qu, Guannan ;
Pang, Yutian ;
Liu, Yongming .
NEUROCOMPUTING, 2024, 564
[49]   Train timetabling with the general learning environment and multi-agent deep reinforcement learning [J].
Li, Wenqing ;
Ni, Shaoquan .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2022, 157 :230-251
[50]   Flocking of multi-agent systems with multiplicative and independent measurement noises [J].
Sun, Yongzheng ;
Wang, Yajun ;
Zhao, Donghua .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2015, 440 :81-89