Automatic Person Detection in Search and Rescue Operations Using Deep CNN Detectors

被引:64
作者
Sambolek, Sasa [1 ]
Ivasic-Kos, Marina [2 ,3 ]
机构
[1] High Sch Tina Ujevica, Kutina 44320, Croatia
[2] Univ Rijeka, Dept Informat, Rijeka 51000, Croatia
[3] Univ Rijeka, Ctr Artificial Intelligence & Cybersecur, Rijeka 51000, Croatia
关键词
Convolutional neural networks; object detector; person detection; search and rescue operations; UAV; YOLO; UNMANNED AERIAL VEHICLES; UAVS;
D O I
10.1109/ACCESS.2021.3063681
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to a growing number of people who carry out various adrenaline activities or adventure tourism and stay in the mountains and other inaccessible places, there is an increasing need to organize a search and rescue operation (SAR) to provide assistance and health care to the injured. The goal of SAR operation is to search the largest area of the territory in the shortest time possible and find a lost or injured person. Today, drones (UAVs or drones) are increasingly involved in search operations, as they can capture a large, controlled area in a short amount of time. However, a detailed examination of a large amount of recorded material remains a problem. Even for an expert, it is not easy to find searched people who are relatively small considering the area where they are, often sheltered by vegetation or merged with the ground and in unusual positions due to falls, injuries, or exhaustion. Therefore, the automatic detection of persons and objects in images/videos taken by drones in these operations is very significant. In this paper, the reliability of existing state-of-the-art detectors such as Faster R-CNN, YOLOv4, RetinaNet, and Cascade R-CNN on a VisDrone benchmark and custom-made dataset SARD build to simulate rescue scenes was investigated. After training the models on selected datasets, detection results were compared. Because of the high speed and accuracy and the small number of false detections, the YOLOv4 detector was chosen for further examination. YOLOv4 model results related to different network sizes, different detection accuracies, and transfer learning settings were analyzed. The model robustness to weather conditions and motion blur were also investigated. The paper proposes a model that can be used in SAR operations because of the excellent results in detecting people in search and rescue scenarios.
引用
收藏
页码:37905 / 37922
页数:18
相关论文
共 71 条
  • [21] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [22] He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]
  • [23] Howard AG, ARXIV E PRINTS ARXIV
  • [24] Searching for MobileNetV3
    Howard, Andrew
    Sandler, Mark
    Chu, Grace
    Chen, Liang-Chieh
    Chen, Bo
    Tan, Mingxing
    Wang, Weijun
    Zhu, Yukun
    Pang, Ruoming
    Vasudevan, Vijay
    Le, Quoc V.
    Adam, Hartwig
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1314 - 1324
  • [25] Drone-based Object Counting by Spatially Regularized Regional Proposal Network
    Hsieh, Meng-Ru
    Lin, Yen-Liang
    Hsu, Winston H.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4165 - 4173
  • [26] Ivasic-Kos M., 2019, P SAI INT SYST C
  • [27] Ivasic-Kos M, 2018, EUR W VIS INF PROCES
  • [28] Human Detection in Thermal Imaging Using YOLO
    Ivasic-Kos, Marina
    Kristo, Mate
    Pobar, Miran
    [J]. PROCEEDINGS OF THE 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND TECHNOLOGY APPLICATIONS (ICCTA 2019), 2019, : 20 - 24
  • [29] A knowledge-based multi-layered image annotation system
    Ivasic-Kos, Marina
    Ipsic, Ivo
    Ribaric, Slobodan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (24) : 9539 - 9553
  • [30] Jin P., 2020, ARXIV200111394