On Automatic Person-in-Water Detection for Marine Search and Rescue Operations

被引:4
作者
Taipalmaa, Jussi [1 ]
Raitoharju, Jenni [2 ]
Queralta, Jorge Pena [3 ,4 ]
Westerlund, Tomi [3 ]
Gabbouj, Moncef [1 ]
机构
[1] Tampere Univ, Dept Comp Sci, Tampere 33014, Finland
[2] Univ Jyvaskyla, Fac Informat Technol, Jyvaskyla 40014, Finland
[3] Univ Turku, Turku Intelligent Embedded & Robot Syst Grp, Turku 20014, Finland
[4] Swiss Fed Inst Technol, Swiss Fed Sch Technol Zurich, SCAI Lab, SPZ, CH-8092 Zurich, Switzerland
基金
芬兰科学院;
关键词
Search and rescue (SAR); person-in-water; unmanned aerial vehicle (UAV); object detection; deep learning (DL); dataset;
D O I
10.1109/ACCESS.2024.3386640
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In marine search and rescue missions, the objective is to find a missing person in the water. Time is a critical factor in the identification of the missing person, as any delay in locating them can have life-threatening consequences. Autonomous unmanned aerial vehicles (UAVs) possess the potential to help in the search task by providing a bird's-eye view helping to cover larger areas faster. Therefore, it is very important that UAVs can efficiently and accurately detect persons in the water. This work studies automatic person detection in the water from a UAV. We performed experiments on both lakes and sea near Turku, Finland, and captured videos of people in the water from various altitudes and different viewing angles. Our person-in-water detection tests focus on important factors that have not received sufficient attention in prior studies: evaluation metrics and detection thresholds, the impact and use of different bounding box sizes, multi-frame detection and performance in unseen environments. We provide analysis of the suitability of different approaches for the person detection task and we also publish our training and testing data that includes over 72000 frames. To the best of our knowledge, this is the largest publicly available person-in-water detection dataset.
引用
收藏
页码:52428 / 52438
页数:11
相关论文
共 42 条
[1]  
[Anonymous], 2022, Labelbox
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]   Surpassing the Human Accuracy: Detecting Gallbladder Cancer from USG Images with Curriculum Learning [J].
Basu, Soumen ;
Gupta, Mayank ;
Rana, Pratyaksha ;
Gupta, Pankaj ;
Arora, Chetan .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :20854-20864
[4]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
[5]  
Bovcon B., 2019, PROC COMPUT VIS WINT, P1
[6]  
Bovcon B, 2018, IEEE INT C INT ROBOT, P5807, DOI 10.1109/IROS.2018.8594238
[7]   Stereo obstacle detection for unmanned surface vehicles by IMU-assisted semantic segmentation [J].
Bovcon, Borja ;
Mandeljc, Rok ;
Pers, Janez ;
Kristan, Matej .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 104 :1-13
[8]  
Bovcon B, 2017, INT SYMP IMAGE SIG, P1, DOI 10.1109/ISPA.2017.8073559
[9]   Human Detection in Drone Images Using YOLO for Search-and-Rescue Operations [J].
Caputo, Sergio ;
Castellano, Giovanna ;
Greco, Francesco ;
Mencar, Corrado ;
Petti, Niccolo ;
Vessio, Gennaro .
AIXIA 2021 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13196 :326-337
[10]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851