A Multi-Domain Collaborative Transfer Learning Method with Multi-Scale Repeated Attention Mechanism for Underwater Side-Scan Sonar Image Classification

被引:31
作者
Cheng, Zhen [1 ]
Huo, Guanying [1 ]
Li, Haisen [2 ]
机构
[1] Hohai Univ, Coll Internet Things Engn, Changzhou 213022, Jiangsu, Peoples R China
[2] Harbin Engn Univ, Coll Underwater Acoust Engn, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
side-scan sonar image classification; multi-domain collaborative transfer learning; multi-scale repeated attention mechanism; multi-domain datasets; feature representation; SEDIMENT CLASSIFICATION; SVM; CNN;
D O I
10.3390/rs14020355
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Due to the strong speckle noise caused by the seabed reverberation which makes it difficult to extract discriminating and noiseless features of a target, recognition and classification of underwater targets using side-scan sonar (SSS) images is a big challenge. Moreover, unlike classification of optical images which can use a large dataset to train the classifier, classification of SSS images usually has to exploit a very small dataset for training, which may cause classifier overfitting. Compared with traditional feature extraction methods using descriptors-such as Haar, SIFT, and LBP-deep learning-based methods are more powerful in capturing discriminating features. After training on a large optical dataset, e.g., ImageNet, direct fine-tuning method brings improvement to the sonar image classification using a small-size SSS image dataset. However, due to the different statistical characteristics between optical images and sonar images, transfer learning methods-e.g., fine-tuning-lack cross-domain adaptability, and therefore cannot achieve very satisfactory results. In this paper, a multi-domain collaborative transfer learning (MDCTL) method with multi-scale repeated attention mechanism (MSRAM) is proposed for improving the accuracy of underwater sonar image classification. In the MDCTL method, low-level characteristic similarity between SSS images and synthetic aperture radar (SAR) images, and high-level representation similarity between SSS images and optical images are used together to enhance the feature extraction ability of the deep learning model. Using different characteristics of multi-domain data to efficiently capture useful features for the sonar image classification, MDCTL offers a new way for transfer learning. MSRAM is used to effectively combine multi-scale features to make the proposed model pay more attention to the shape details of the target excluding the noise. Experimental results of classification show that, in using multi-domain data sets, the proposed method is more stable with an overall accuracy of 99.21%, bringing an improvement of 4.54% compared with the fine-tuned VGG19. Results given by diverse visualization methods also demonstrate that the method is more powerful in feature representation by using the MDCTL and MSRAM.
引用
收藏
页数:25
相关论文
共 47 条
  • [1] Semisynthetic Versus Real-World Sonar Training Data for the Classification of Mine-Like Objects
    Barngrover, Christopher
    Kastner, Ryan
    Belongie, Serge
    [J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 2015, 40 (01) : 48 - 56
  • [2] Albumentations: Fast and Flexible Image Augmentations
    Buslaev, Alexander
    Iglovikov, Vladimir I.
    Khvedchenya, Eugene
    Parinov, Alex
    Druzhinin, Mikhail
    Kalinin, Alexandr A.
    [J]. INFORMATION, 2020, 11 (02)
  • [3] Celebi A. T., 2011, 2011 IEEE 19th Signal Processing and Communications Applications Conference (SIU 2011), P916, DOI 10.1109/SIU.2011.5929801
  • [4] Speckle noise reduction in SAS imagery
    Chaillan, Fabien
    Fraschini, Christophe
    Courmontagne, Philippe
    [J]. SIGNAL PROCESSING, 2007, 87 (04) : 762 - 781
  • [5] Automatic detection and classification of man-made targets in side scan sonar images
    Chew, Ai Ling
    Tong, Poh Bee
    Chia, Chin Swee
    [J]. 2007 SYMPOSIUM ON UNDERWATER TECHNOLOGY AND WORKSHOP ON SCIENTIFIC USE OF SUBMARINE CABLES AND RELATED TECHNOLOGIES, VOLS 1 AND 2, 2007, : 126 - 132
  • [6] Automated detection/classification of sea mines in sonar imagery
    Dobeck, GJ
    Hyland, JC
    Smedley, L
    [J]. DETECTION AND REMEDIATION TECHNOLOGIES FOR MINES AND MINELIKE TARGETS II, 1997, 3079 : 90 - 110
  • [7] Side-Scan Sonar Image Classification Based on Style Transfer and Pre-Trained Convolutional Neural Networks
    Ge, Qiang
    Ruan, Fengxue
    Qiao, Baojun
    Zhang, Qian
    Zuo, Xianyu
    Dang, Lanxue
    [J]. ELECTRONICS, 2021, 10 (15)
  • [8] Gerg I.D., 2021, IEEE T GEOSCI REMOTE, V60, P1, DOI [10.1109/TGRS.2020.3045649, DOI 10.1109/TGRS.2020.3045649]
  • [9] A Modified Local Binary Pattern Descriptor for SAR Image Matching
    Ghannadi, Mohammad Amin
    Saadatseresht, Mohammad
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (04) : 568 - 572
  • [10] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587