Automated Recognition of Submerged Body-like Objects in Sonar Images Using Convolutional Neural Networks

被引:0
作者
Nga, Yan Zun [1 ]
Rymansaib, Zuhayr [1 ]
Treloar, Alfie Anthony [1 ]
Hunter, Alan [1 ]
机构
[1] Univ Bath, Fac Engn & Design, Bath BA2 7AY, England
基金
英国工程与自然科学研究理事会;
关键词
underwater search; automation; robotics; sidescan sonar (SSS); automated target recognition (ATR); machine learning; convolutional neural networks (CNN); SIDE-SCAN SONAR; CLASSIFICATION; ALGORITHM;
D O I
10.3390/rs16214036
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The Police Robot for Inspection and Mapping of Underwater Evidence (PRIME) is an uncrewed surface vehicle (USV) currently being developed for underwater search and recovery teams to assist in crime scene investigation. The USV maps underwater scenes using sidescan sonar (SSS). Test exercises use a clothed mannequin lying on the seafloor as a target object to evaluate system performance. A robust, automated method for detecting human body-shaped objects is required to maximise operational functionality. The use of a convolutional neural network (CNN) for automatic target recognition (ATR) is proposed. SSS image data acquired from four different locations during previous missions were used to build a dataset consisting of two classes, i.e., a binary classification problem. The target object class consisted of 166 196 x 196 pixel image snippets of the underwater mannequin, whereas the non-target class consisted of 13,054 examples. Due to the large class imbalance in the dataset, CNN models were trained with six different imbalance ratios. Two different pre-trained models (ResNet-50 and Xception) were compared, and trained via transfer learning. This paper presents results from the CNNs and details the training methods used. Larger datasets are shown to improve CNN performance despite class imbalance, achieving average F1 scores of 97% in image classification. Average F1 scores for target vs background classification with unseen data are only 47% but the end result is enhanced by combining multiple weak classification results in an ensemble average. The combined output, represented as a georeferenced heatmap, accurately indicates the target object location with a high detection confidence and one false positive of low confidence. The CNN approach shows improved object detection performance when compared to the currently used ATR method.
引用
收藏
页数:16
相关论文
共 59 条
  • [1] A Data Augmentation-Based Framework to Handle Class Imbalance Problem for Alzheimer's Stage Detection
    Afzal, Sitara
    Maqsood, Muazzam
    Nazir, Faria
    Khan, Umair
    Aadil, Farhan
    Awan, Khalid M.
    Mehmood, Irfan
    Song, Oh-Young
    [J]. IEEE ACCESS, 2019, 7 : 115528 - 115539
  • [2] [Anonymous], 2014, P 2 INT C EXH UND AC
  • [3] Geophysical Methods for Wreck-Site Monitoring: the Rapid Archaeological Site Surveying and Evaluation (RASSE) programme
    Bates, C. Richard
    Lawrence, Mark
    Dean, Martin
    Robertson, Philip
    [J]. INTERNATIONAL JOURNAL OF NAUTICAL ARCHAEOLOGY, 2011, 40 (02) : 404 - 416
  • [4] Becker RF, 2013, Underwater Forensic Investigation
  • [5] Brown S., Dorset Live
  • [6] Chapple PB, 2009, OCEANS-IEEE, P980
  • [7] Chollet F., 2015, KERAS
  • [8] Xception: Deep Learning with Depthwise Separable Convolutions
    Chollet, Francois
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1800 - 1807
  • [9] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [10] A Comparative Study of Different CNN Models and Transfer Learning Effect for Underwater Object Classification in Side-Scan Sonar Images
    Du, Xing
    Sun, Yongfu
    Song, Yupeng
    Sun, Huifeng
    Yang, Lei
    [J]. REMOTE SENSING, 2023, 15 (03)