Deep reinforcement learning for time-critical wilderness search and rescue using drones

被引:0
|
作者
Ewers, Jan-Hendrik [1 ]
Anderson, David [1 ]
Thomson, Douglas [1 ]
机构
[1] Univ Glasgow, Autonomous Syst & Connect, Glasgow City, Scotland
来源
FRONTIERS IN ROBOTICS AND AI | 2025年 / 11卷
基金
英国工程与自然科学研究理事会;
关键词
reinforcement learning; search planning; mission planning; autonomous systems; wilderness search and rescue; unmanned aerial vehicle; machine learning;
D O I
10.3389/frobt.2024.1527095
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Traditional search and rescue methods in wilderness areas can be time-consuming and have limited coverage. Drones offer a faster and more flexible solution, but optimizing their search paths is crucial for effective operations. This paper proposes a novel algorithm using deep reinforcement learning to create efficient search paths for drones in wilderness environments. Our approach leverages a priori data about the search area and the missing person in the form of a probability distribution map. This allows the policy to learn optimal flight paths that maximize the probability of finding the missing person quickly. Experimental results show that our method achieves a significant improvement in search times compared to traditional coverage planning and search planning algorithms by over 160 % , a difference that can mean life or death in real-world search operations Additionally, unlike previous work, our approach incorporates a continuous action space enabled by cubature, allowing for more nuanced flight patterns.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] PBRL-TChain: A performance-enhanced permissioned blockchain for time-critical applications based on reinforcement learning
    Zhang, Yiguang
    Lin, Junxiong
    Lu, Zhihui
    Duan, Qiang
    Huang, Shih-Chia
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 301 - 313
  • [12] Computationally Efficient DNN Mapping Search Heuristic using Deep Reinforcement Learning
    Bakshi, Suyash
    Johnsson, Lennart
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (05)
  • [13] A Search-Based Testing Approach for Deep Reinforcement Learning Agents
    Zolfagharian, Amirhossein
    Abdellatif, Manel
    Briand, Lionel C.
    Bagherzadeh, Mojtaba
    Ramesh, S.
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (07) : 3715 - 3735
  • [14] Deep Reinforcement Learning Using Optimized Monte Carlo Tree Search in EWN
    Zhang, Yixian
    Li, Zhuoxuan
    Cao, Yiding
    Zhao, Xuan
    Cao, Jinde
    IEEE TRANSACTIONS ON GAMES, 2024, 16 (03) : 544 - 555
  • [15] A novel modified search and rescue optimization algorithm based on reinforcement learning for UAV path planning
    Zhou W.-J.
    Zhang C.-Q.
    Tang W.-D.
    Yi Y.-H.
    Liu W.-W.
    Qin W.-D.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (04): : 1203 - 1211
  • [16] Efficient Novelty Search Through Deep Reinforcement Learning
    Shi, Longxiang
    Li, Shijian
    Zheng, Qian
    Yao, Min
    Pan, Gang
    IEEE ACCESS, 2020, 8 : 128809 - 128818
  • [17] Real Time Path Planning of Robot using Deep Reinforcement Learning
    Raajan, Jeevan
    Srihari, P., V
    Satya, Jayadev P.
    Bhikkaji, B.
    Pasumarthy, Ramkrishna
    IFAC PAPERSONLINE, 2020, 53 (02): : 15602 - 15607
  • [18] Automated Vulnerability Exploitation Using Deep Reinforcement Learning
    Almajali, Anas
    Al-Abed, Loiy
    Yousef, Khalil M. Ahmad
    Mohd, Bassam J.
    Samamah, Zaid
    Abu Shhadeh, Anas
    APPLIED SCIENCES-BASEL, 2024, 14 (20):
  • [19] Constrained attractor selection using deep reinforcement learning
    Wang, Xue-She
    Turner, James D.
    Mann, Brian P.
    JOURNAL OF VIBRATION AND CONTROL, 2021, 27 (5-6) : 502 - 514
  • [20] Decentralized Multi-Agent Deep Reinforcement Learning in Swarms of Drones for Flood Monitoring
    Baldazo, David
    Parras, Juan
    Zazo, Santiago
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,