UAV Swarm Cooperative Target Search: A Multi-Agent Reinforcement Learning Approach

被引：28

作者：

Hou, Yukai ^{[1
]}

Zhao, Jin ^{[1
]}

Zhang, Rongqing ^{[1
]}

Cheng, Xiang ^{[2
]}

Yang, Liuqing ^{[3
,4
]}

机构：

[1] Tongji Univ, Sch Software Engn, Shanghai 200092, Peoples R China

[2] Peking Univ, Sch Elect, Beijing 100871, Peoples R China

[3] Hong Kong Univ Sci & Technol Guangzhou, Internet Things Thrust & Intelligent Transporta T, Guangzhou 510000, Peoples R China

[4] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2024年 / 9卷 / 01期

关键词：

Autonomous aerial vehicles; Task analysis; Search problems; Collaboration; Scalability; Real-time systems; Machine learning algorithms; Unmanned aerial vehicles; multi-agent reinforcement learning; distributed search algorithm; Markov decision process; OPTIMIZATION; ALGORITHM;

D O I：

10.1109/TIV.2023.3316196

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The development of machine learning and artificial intelligence algorithms, as well as the progress of unmanned aerial vehicle swarm technology, has significantly enhanced the intelligence and autonomy of unmanned aerial vehicles in search missions, resulting in greater efficiency when searching unknown areas. However, as search scenarios become more complex, the existing unmanned aerial vehicle swarm search method lacks scalability and efficient cooperation. Furthermore, due to the increasing scale of search scenarios, the accuracy and real-time performance of global information are difficult to ensure, necessitating the provision of local information. This paper focuses on the large-scale search scenario and split it to provide both local and global information for running unmanned aerial vehicle swarm search algorithms. Since the search environment is often unknown, dynamic, and complex, it requires adaptive decision-making in a constantly changing environment, which is suitable for modeling as a Markov decision process. Considering the sequential-based scenario, we propose a distributed collaborative search method based on a multi-agent reinforcement learning algorithm, which can operate efficiently in complex and large-scale scenarios. Additionally, the proposed method can utilize a convolutional neural network to process high-dimensional map data with almost no loss of the structure information. Experimental results demonstrate that the proposed method can collaboratively search unknown areas, avoid collisions and repetitions, and find all targets faster compared with the benchmarks.

引用

页码：568 / 578

页数：11

共 36 条

[21] Decentralized Multi-UAV Flight Autonomy for Moving Convoys Search and Track [J].

Meng, Wei ;

He, Zhirong ;

Su, Rong ;

Yadav, Pradeep K. ;

Teo, Rodney ;

Xie, Lihua .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2017, 25 (04) :1480-1487

[22]

Mnih V, 2016, PR MACH LEARN RES, V48

[23] An Improved Potential Game Theory Based Method for Multi-UAV Cooperative Search [J].

Ni, Jianjun ;

Tang, Guangyi ;

Mo, Zhengpei ;

Cao, Weidong ;

Yang, Simon X. .

IEEE ACCESS, 2020, 8 :47787-47796

[24] Autonomous Source Search for UAVs Using Gaussian Mixture Model-Based Infotaxis: Algorithm and Flight Experiments [J].

Park, Minkyu ;

An, Seulbi ;

Seo, Jaemin ;

Oh, Hyondong .

IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2021, 57 (06) :4238-4254

[25] Ant colony optimization for multi-UAV minimum time search in uncertain domains [J].

Perez-Carabaza, Sara ;

Besada-Portas, Eva ;

Lopez-Orozco, Jose A. ;

de la Cruz, Jesus M. .

APPLIED SOFT COMPUTING, 2018, 62 :789-806

[26] Information Sharing Based on Local PSO for UAVs Cooperative Search of Moved Targets [J].

Saadaoui, Hassan ;

Bouanani, Faissal El ;

Illi, Elmehdi .

IEEE ACCESS, 2021, 9 :134998-135011

[27] UAV-Net plus : Effective and Energy-Efficient UAV Network Deployment for Extending Cell Tower Coverage With Dynamic Demands [J].

Sun, Renxin ;

Zhao, Dong ;

Ding, Lige ;

Zhang, Jing ;

Ma, Huadong .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (01) :973-985

[28] Many-to-Many Task Offloading in Vehicular Fog Computing: A Multi-Agent Deep Reinforcement Learning Approach [J].

Wei, Zhiwei ;

Li, Bing ;

Zhang, Rongqing ;

Cheng, Xiang ;

Yang, Liuqing .

IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (03) :2107-2122

[29] Multi-UAV Information Fusion and Cooperative Trajectory Optimization in Target Search [J].

Yao, Peng ;

Wei, Xin .

IEEE SYSTEMS JOURNAL, 2022, 16 (03) :4325-4333

[30] Optimal UAV Route Planning for Coverage Search of Stationary Target in River [J].

Yao, Peng ;

Xie, Zexiao ;

Ren, Ping .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2019, 27 (02) :822-829

← 1 2 3 4 →