UAV Swarm Cooperative Target Search: A Multi-Agent Reinforcement Learning Approach

被引：28

作者：

Hou, Yukai ^{[1
]}

Zhao, Jin ^{[1
]}

Zhang, Rongqing ^{[1
]}

Cheng, Xiang ^{[2
]}

Yang, Liuqing ^{[3
,4
]}

机构：

[1] Tongji Univ, Sch Software Engn, Shanghai 200092, Peoples R China

[2] Peking Univ, Sch Elect, Beijing 100871, Peoples R China

[3] Hong Kong Univ Sci & Technol Guangzhou, Internet Things Thrust & Intelligent Transporta T, Guangzhou 510000, Peoples R China

[4] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2024年 / 9卷 / 01期

关键词：

Autonomous aerial vehicles; Task analysis; Search problems; Collaboration; Scalability; Real-time systems; Machine learning algorithms; Unmanned aerial vehicles; multi-agent reinforcement learning; distributed search algorithm; Markov decision process; OPTIMIZATION; ALGORITHM;

D O I：

10.1109/TIV.2023.3316196

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The development of machine learning and artificial intelligence algorithms, as well as the progress of unmanned aerial vehicle swarm technology, has significantly enhanced the intelligence and autonomy of unmanned aerial vehicles in search missions, resulting in greater efficiency when searching unknown areas. However, as search scenarios become more complex, the existing unmanned aerial vehicle swarm search method lacks scalability and efficient cooperation. Furthermore, due to the increasing scale of search scenarios, the accuracy and real-time performance of global information are difficult to ensure, necessitating the provision of local information. This paper focuses on the large-scale search scenario and split it to provide both local and global information for running unmanned aerial vehicle swarm search algorithms. Since the search environment is often unknown, dynamic, and complex, it requires adaptive decision-making in a constantly changing environment, which is suitable for modeling as a Markov decision process. Considering the sequential-based scenario, we propose a distributed collaborative search method based on a multi-agent reinforcement learning algorithm, which can operate efficiently in complex and large-scale scenarios. Additionally, the proposed method can utilize a convolutional neural network to process high-dimensional map data with almost no loss of the structure information. Experimental results demonstrate that the proposed method can collaboratively search unknown areas, avoid collisions and repetitions, and find all targets faster compared with the benchmarks.

引用

页码：568 / 578

页数：11

共 36 条

[1] Dynamic Target Search Using Multi-UAVs Based on Motion-Encoded Genetic Algorithm With Multiple Parents [J].

Alanezi, Mohammed A. ;

Bouchekara, Houssem R. E. H. ;

Apalara, Tijani Abdul-Aziz ;

Shahriar, Mohammad Shoaib ;

Sha'aban, Yusuf A. ;

Javaid, Muhammad Sharjeel ;

Khodja, Mohammed Abdallah .

IEEE ACCESS, 2022, 10 :77922-77939

[2] Adaptive Search Control Applied to Search and Rescue Operations Using Unmanned Aerial Vehicles (UAVs) [J].

Chaves, A. N. ;

Cugnasca, P. S. ;

Neto, J. J. .

IEEE LATIN AMERICA TRANSACTIONS, 2014, 12 (07) :1278-1283

[3] Reinforcement learning: The Good, The Bad and The Ugly [J].

Dayana, Peter ;

Niv, Yael .

CURRENT OPINION IN NEUROBIOLOGY, 2008, 18 (02) :185-196

[4] Coarse Trajectory Design for Energy Minimization in UAV-Enabled [J].

Dinh-Hieu Tran ;

Vu, Thang X. ;

Chatzinotas, Symeon ;

ShahbazPanahi, Shahram ;

Ottersten, Bjorn .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (09) :9483-9496

[5] Evolutionary Planning of Multi-UAV Search for Missing Tourists [J].

Du, Yi-Chen ;

Zhang, Min-Xia ;

Ling, Hai-Feng ;

Zheng, Yu-Jun .

IEEE ACCESS, 2019, 7 :73480-73492

[6] Dynamic Discrete Pigeon-Inspired Optimization for Multi-UAV Cooperative Search-Attack Mission Planning [J].

Duan, Haibin ;

Zhao, Jianxia ;

Deng, Yimin ;

Shi, Yuhui ;

Ding, Xilun .

IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2021, 57 (01) :706-720

[7] Autonomous Cooperative Search Model for Multi-UAV With Limited Communication Network [J].

Fei, Bowen ;

Bao, Weidong ;

Zhu, Xiaomin ;

Liu, Daqian ;

Men, Tong ;

Xiao, Zhenliang .

IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (19) :19346-19361

[8]

Fei W., 2013, Acta Automatica Sinica, V39, P1889

[9] Multi-UAV Oxyrrhis Marina-Inspired Search and Dynamic Formation Control for Forest Firefighting [J].

Harikumar, K. ;

Senthilnath, J. ;

Sundaram, Suresh .

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2019, 16 (02) :863-873

[10] Functional Disability 5 Years after Acute Respiratory Distress Syndrome [J].

Herridge, Margaret S. ;

Tansey, Catherine M. ;

Matte, Andrea ;

Tomlinson, George ;

Diaz-Granados, Natalia ;

Cooper, Andrew ;

Guest, Cameron B. ;

Mazer, C. David ;

Mehta, Sangeeta ;

Stewart, Thomas E. ;

Kudlow, Paul ;

Cook, Deborah ;

Slutsky, Arthur S. ;

Cheung, Angela M. .

NEW ENGLAND JOURNAL OF MEDICINE, 2011, 364 (14) :1293-1304

← 1 2 3 4 →