Learning cooperative strategies in multi-agent encirclement games with faster prey using prior knowledge

被引：0

作者：

Li T. ^{[1
]}

Shi D. ^{[1
,2
]}

Wang Z. ^{[1
]}

Yang H. ^{[3
]}

Chen Y. ^{[4
]}

Shi Y. ^{[3
]}

机构：

[1] Intelligent Game and Decision Lab (IGDL), Defense Technology Innovation Institute, Beijing

[2] Tianjin Artificial Intelligence Innovation Center (TAIIC), Tianjin

[3] National University of Defense Technology, Changsha

[4] Department of Computer Science, Peking University, Beijing

来源：

Neural Computing and Applications | 2024年 / 36卷 / 25期

基金：

中国国家自然科学基金;

关键词：

Apollonius circle; Multi-agent deep reinforcement learning; Multi-agent encirclement with collision avoidance; Prior knowledge; Variance descriptor;

D O I：

10.1007/s00521-024-09727-6

中图分类号：

学科分类号：

摘要：

Multi-agent encirclement with collision avoidance constitutes a common challenge in the multi-agent confrontation domain, wherein the focus lies in the development of cooperative strategies among agents. Previous studies encountered difficulties in addressing the dynamic encirclement of faster prey in obstacles environment. This paper introduces a novel multi-agent deep reinforcement learning approach based on prior knowledge. It is dedicated to exploring the multi-agent encirclement with collision avoidance task involving slower multiple pursuers collaboratively encircling faster prey in an obstacles environment. Firstly, the utilization of the classic Apollonius circle theory as prior knowledge guides agent action selection, narrows the exploratory action space, and accelerates the learning of strategies. Subsequently, the variance descriptor restricts the motion direction of pursuers, thus ensuring that pursuers continuously narrow the encirclement until the prey is successfully encircled. Finally, experiments in an obstacles environment were conducted to validate the proposed method. The results indicate that our method can acquire an effective encirclement strategy, with an encirclement success rate exceeding that of previous methods by more than 10%, and simulation experiment results demonstrate the effectiveness and practicability of our method. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

引用

页码：15829 / 15842

页数：13

共 33 条

[1]

Turetsky V., Shima T., Target evasion from a missile performing multiple switches in guidance law, J Guid Control Dyn, 39, 10, pp. 2364-2373, (2016)

[2]

Perelman A., Shima T., Rusnak I., Cooperative differential games strategies for active aircraft protection from a homing missile, J Guid Control Dyn, 34, 3, pp. 761-773, (2011)

[3]

Camci E., Kayacan E., Game of drones: UAV pursuit-evasion game with type-2 fuzzy logic controllers tuned by reinforcement learning, . In: 2016 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), pp. 618-625, (2016)

[4]

Sun Z., Wu H., Shi Y., Yu X., Gao Y., Pei W., Yang Z., Piao H., Hou Y., Multi-agent air combat with two-stage graph-attention communication, Neural Comput Appl, 35, pp. 1-17, (2023)

[5]

Du W., Guo T., Chen J., Li B., Zhu G., Cao X., Cooperative pursuit of unauthorized UAVS in urban airspace via multi-agent reinforcement learning, Transp Res Part C Emerg Technol, 128, (2021)

[6]

Wan K., Wu D., Zhai Y., Li B., Gao X., Hu Z., An improved approach towards multi-agent pursuit-evasion game decision-making using deep reinforcement learning, Entropy, 23, 11, (2021)

[7]

Alexopoulos A., Schmidt T., Badreddin E., Cooperative pursue in pursuit-evasion games with unmanned aerial vehicles, In: 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IEEE, pp. 4538-4543, (2015)

[8]

Peng K., Rong H., Qian Y., Agrcnet: communicate by attentional graph relations in multi-agent reinforcement learning for traffic signal control, Neural Comput Appl, 35, 28, pp. 21007-21022, (2023)

[9]

Wishart D., Differential games: a mathematical theory with applications to warfare and pursuit, control and optimization, Phys Bull, 17, 2, (1966)

[10]

Sun W., Tsiotras P., Lolla T., Subramani D.N., Lermusiaux P.F., Multiple-pursuer/one-evader pursuit-evasion game in dynamic flowfields, J Guid Control Dyn, 40, 7, pp. 1627-1637, (2017)

← 1 2 3 4 →