Deep Learning Approach in Aerial Imagery for Supporting Land Search and Rescue Missions

被引：1

作者：

Dunja Božić-Štulić

Željko Marušić

Sven Gotovac

机构：

[1] University of Split,Faculty of Electrical Engineering, Mechanical Engineering and Naval Architecture

[2] University of Mostar,Faculty of Science and Education

来源：

International Journal of Computer Vision | 2019年 / 127卷

关键词：

Convolutional neural networks; RCNN; Salient object detection; Unmanned aerial vehicles (UAV); Search and rescue; SAR image database;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper, we propose a novel approach to person detection in UAV aerial images for search and rescue tasks in Mediterranean and Sub-Mediterranean landscapes. Person detection in very high spatial resolution images involves target objects that are relatively small and often camouflaged within the environment; thus, such detection is a challenging and demanding task. The proposed method starts by reducing the search space through a visual attention algorithm that detects the salient or most prominent segments in the image. To reduce the number of non-relevant salient regions, we selected those regions most likely to contain a person using pre-trained and fine-tuned convolutional neural networks (CNNs) for detection. We established a special database called HERIDAL to train and test our model. This database was compiled for training purposes, and it contains over 68,750 image patches of wilderness acquired from an aerial perspective as well as approximately 500 labelled full-size real-world images intended for testing purposes. The proposed method achieved a detection rate of 88.9% and a precision of 34.8%, which demonstrates better effectiveness than the system currently used by Croatian Mountain search and rescue (SAR) teams (IPSAR), which is based on mean-shift segmentation. We also used the HERIDAL database to train and test a state-of-the-art region proposal network, Faster R-CNN (Ren et al. in Faster R-CNN: towards real-time object detection with region proposal networks, 2015. CoRR arXiv:1506.01497), which achieved comparable but slightly worse results than those of our proposed method.

引用

页码：1256 / 1278

页数：22

共 14 条

[1] Enzweiler M(2009)Monocular pedestrian detection: Survey and experiments IEEE Transactions on Pattern Analysis and Machine Intelligence 31 2179-2195
[2] Gavrila DM(2013)A saliency detection model using low-level features based on wavelet transform IEEE Transactions on Multimedia 15 96-105
[3] Imamoglu N(1980)A feature-integration theory of attention Cognit Psychol 12 97-136
[4] Lin W(2010)Two-stage segmentation of aerial images for search and rescue Information Technology and Control 39 138-145
[5] Fang Y(2017)How good is my test data? Introducing safety analysis for computer vision International Journal of Computer Vision 125 95-109
[6] Treisman AM(undefined)undefined undefined undefined undefined-undefined
[7] Gelade G(undefined)undefined undefined undefined undefined-undefined
[8] Turić H(undefined)undefined undefined undefined undefined-undefined
[9] Dujmić H(undefined)undefined undefined undefined undefined-undefined
[10] Papić V(undefined)undefined undefined undefined undefined-undefined

← 1 2 →