Automated Recognition of Submerged Body-like Objects in Sonar Images Using Convolutional Neural Networks

被引:0
作者
Nga, Yan Zun [1 ]
Rymansaib, Zuhayr [1 ]
Treloar, Alfie Anthony [1 ]
Hunter, Alan [1 ]
机构
[1] Univ Bath, Fac Engn & Design, Bath BA2 7AY, England
基金
英国工程与自然科学研究理事会;
关键词
underwater search; automation; robotics; sidescan sonar (SSS); automated target recognition (ATR); machine learning; convolutional neural networks (CNN); SIDE-SCAN SONAR; CLASSIFICATION; ALGORITHM;
D O I
10.3390/rs16214036
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The Police Robot for Inspection and Mapping of Underwater Evidence (PRIME) is an uncrewed surface vehicle (USV) currently being developed for underwater search and recovery teams to assist in crime scene investigation. The USV maps underwater scenes using sidescan sonar (SSS). Test exercises use a clothed mannequin lying on the seafloor as a target object to evaluate system performance. A robust, automated method for detecting human body-shaped objects is required to maximise operational functionality. The use of a convolutional neural network (CNN) for automatic target recognition (ATR) is proposed. SSS image data acquired from four different locations during previous missions were used to build a dataset consisting of two classes, i.e., a binary classification problem. The target object class consisted of 166 196 x 196 pixel image snippets of the underwater mannequin, whereas the non-target class consisted of 13,054 examples. Due to the large class imbalance in the dataset, CNN models were trained with six different imbalance ratios. Two different pre-trained models (ResNet-50 and Xception) were compared, and trained via transfer learning. This paper presents results from the CNNs and details the training methods used. Larger datasets are shown to improve CNN performance despite class imbalance, achieving average F1 scores of 97% in image classification. Average F1 scores for target vs background classification with unseen data are only 47% but the end result is enhanced by combining multiple weak classification results in an ensemble average. The combined output, represented as a georeferenced heatmap, accurately indicates the target object location with a high detection confidence and one false positive of low confidence. The CNN approach shows improved object detection performance when compared to the currently used ATR method.
引用
收藏
页数:16
相关论文
共 59 条
  • [51] Wang A, 2024, Arxiv, DOI [arXiv:2405.14458, DOI 10.48550/ARXIV.2405.14458]
  • [52] Underwater sonar image classification using adaptive weights convolutional neural network
    Wang, Xingmei
    Jiao, Jia
    Yin, Jingwei
    Zhao, Wensheng
    Han, Xiao
    Sun, Boxuan
    [J]. APPLIED ACOUSTICS, 2019, 146 : 145 - 154
  • [53] On the Use of Tiny Convolutional Neural Networks for Human-Expert-Level Classification Performance in Sonar Imagery
    Williams, David P.
    [J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 2021, 46 (01) : 236 - 260
  • [54] The Mondrian Detection Algorithm for Sonar Imagery
    Williams, David P.
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (02): : 1091 - 1102
  • [55] Fast Unsupervised Seafloor Characterization in Sonar Imagery Using Lacunarity
    Williams, David P.
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (11): : 6022 - 6034
  • [56] Fast Target Detection in Synthetic Aperture Sonar Imagery: A New Algorithm and Large-Scale Performance Analysis
    Williams, David P.
    [J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 2015, 40 (01) : 71 - 92
  • [57] Ye XF, 2018, OCEANS 2018 MTS/IEEE CHARLESTON
  • [58] Zerr B., 1997, AUTOMATIC TARGET CLA
  • [59] Detection of Small Objects in Side-Scan Sonar Images Using an Enhanced YOLOv7-Based Approach
    Zhang, Feihu
    Zhang, Wei
    Cheng, Chensheng
    Hou, Xujia
    Cao, Chun
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (11)