Deep reinforcement learning for time-critical wilderness search and rescue using drones

被引：0

作者：

Ewers, Jan-Hendrik ^{[1
]}

Anderson, David ^{[1
]}

Thomson, Douglas ^{[1
]}

机构：

[1] Univ Glasgow, Autonomous Syst & Connect, Glasgow City, Scotland

来源：

FRONTIERS IN ROBOTICS AND AI | 2025年 / 11卷

基金：

英国工程与自然科学研究理事会;

关键词：

reinforcement learning; search planning; mission planning; autonomous systems; wilderness search and rescue; unmanned aerial vehicle; machine learning;

D O I：

10.3389/frobt.2024.1527095

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Traditional search and rescue methods in wilderness areas can be time-consuming and have limited coverage. Drones offer a faster and more flexible solution, but optimizing their search paths is crucial for effective operations. This paper proposes a novel algorithm using deep reinforcement learning to create efficient search paths for drones in wilderness environments. Our approach leverages a priori data about the search area and the missing person in the form of a probability distribution map. This allows the policy to learn optimal flight paths that maximize the probability of finding the missing person quickly. Experimental results show that our method achieves a significant improvement in search times compared to traditional coverage planning and search planning algorithms by over 160 % , a difference that can mean life or death in real-world search operations Additionally, unlike previous work, our approach incorporates a continuous action space enabled by cubature, allowing for more nuanced flight patterns.

引用

页数：10

共 50 条

[11] PBRL-TChain: A performance-enhanced permissioned blockchain for time-critical applications based on reinforcement learning
Zhang, Yiguang
Lin, Junxiong
Lu, Zhihui
Duan, Qiang
Huang, Shih-Chia
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 301 - 313
[12] Computationally Efficient DNN Mapping Search Heuristic using Deep Reinforcement Learning
Bakshi, Suyash
Johnsson, Lennart
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (05)
[13] A Search-Based Testing Approach for Deep Reinforcement Learning Agents
Zolfagharian, Amirhossein
Abdellatif, Manel
Briand, Lionel C.
Bagherzadeh, Mojtaba
Ramesh, S.
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (07) : 3715 - 3735
[14] Deep Reinforcement Learning Using Optimized Monte Carlo Tree Search in EWN
Zhang, Yixian
Li, Zhuoxuan
Cao, Yiding
Zhao, Xuan
Cao, Jinde
IEEE TRANSACTIONS ON GAMES, 2024, 16 (03) : 544 - 555
[15] A novel modified search and rescue optimization algorithm based on reinforcement learning for UAV path planning
Zhou W.-J.
Zhang C.-Q.
Tang W.-D.
Yi Y.-H.
Liu W.-W.
Qin W.-D.
Kongzhi yu Juece/Control and Decision, 2024, 39 (04): : 1203 - 1211
[16] Efficient Novelty Search Through Deep Reinforcement Learning
Shi, Longxiang
Li, Shijian
Zheng, Qian
Yao, Min
Pan, Gang
IEEE ACCESS, 2020, 8 : 128809 - 128818
[17] Real Time Path Planning of Robot using Deep Reinforcement Learning
Raajan, Jeevan
Srihari, P., V
Satya, Jayadev P.
Bhikkaji, B.
Pasumarthy, Ramkrishna
IFAC PAPERSONLINE, 2020, 53 (02): : 15602 - 15607
[18] Automated Vulnerability Exploitation Using Deep Reinforcement Learning
Almajali, Anas
Al-Abed, Loiy
Yousef, Khalil M. Ahmad
Mohd, Bassam J.
Samamah, Zaid
Abu Shhadeh, Anas
APPLIED SCIENCES-BASEL, 2024, 14 (20):
[19] Constrained attractor selection using deep reinforcement learning
Wang, Xue-She
Turner, James D.
Mann, Brian P.
JOURNAL OF VIBRATION AND CONTROL, 2021, 27 (5-6) : 502 - 514
[20] Decentralized Multi-Agent Deep Reinforcement Learning in Swarms of Drones for Flood Monitoring
Baldazo, David
Parras, Juan
Zazo, Santiago
2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,

← 1 2 3 4 5 →