A Multi-Domain Collaborative Transfer Learning Method with Multi-Scale Repeated Attention Mechanism for Underwater Side-Scan Sonar Image Classification

被引：31

作者：

Cheng, Zhen ^{[1
]}

Huo, Guanying ^{[1
]}

Li, Haisen ^{[2
]}

机构：

[1] Hohai Univ, Coll Internet Things Engn, Changzhou 213022, Jiangsu, Peoples R China

[2] Harbin Engn Univ, Coll Underwater Acoust Engn, Harbin 150001, Peoples R China

来源：

REMOTE SENSING | 2022年 / 14卷 / 02期

基金：

中国国家自然科学基金;

关键词：

side-scan sonar image classification; multi-domain collaborative transfer learning; multi-scale repeated attention mechanism; multi-domain datasets; feature representation; SEDIMENT CLASSIFICATION; SVM; CNN;

D O I：

10.3390/rs14020355

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Due to the strong speckle noise caused by the seabed reverberation which makes it difficult to extract discriminating and noiseless features of a target, recognition and classification of underwater targets using side-scan sonar (SSS) images is a big challenge. Moreover, unlike classification of optical images which can use a large dataset to train the classifier, classification of SSS images usually has to exploit a very small dataset for training, which may cause classifier overfitting. Compared with traditional feature extraction methods using descriptors-such as Haar, SIFT, and LBP-deep learning-based methods are more powerful in capturing discriminating features. After training on a large optical dataset, e.g., ImageNet, direct fine-tuning method brings improvement to the sonar image classification using a small-size SSS image dataset. However, due to the different statistical characteristics between optical images and sonar images, transfer learning methods-e.g., fine-tuning-lack cross-domain adaptability, and therefore cannot achieve very satisfactory results. In this paper, a multi-domain collaborative transfer learning (MDCTL) method with multi-scale repeated attention mechanism (MSRAM) is proposed for improving the accuracy of underwater sonar image classification. In the MDCTL method, low-level characteristic similarity between SSS images and synthetic aperture radar (SAR) images, and high-level representation similarity between SSS images and optical images are used together to enhance the feature extraction ability of the deep learning model. Using different characteristics of multi-domain data to efficiently capture useful features for the sonar image classification, MDCTL offers a new way for transfer learning. MSRAM is used to effectively combine multi-scale features to make the proposed model pay more attention to the shape details of the target excluding the noise. Experimental results of classification show that, in using multi-domain data sets, the proposed method is more stable with an overall accuracy of 99.21%, bringing an improvement of 4.54% compared with the fine-tuned VGG19. Results given by diverse visualization methods also demonstrate that the method is more powerful in feature representation by using the MDCTL and MSRAM.

引用

页数：25

共 47 条

[1] Semisynthetic Versus Real-World Sonar Training Data for the Classification of Mine-Like Objects
Barngrover, Christopher
Kastner, Ryan
Belongie, Serge
[J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 2015, 40 (01) : 48 - 56
[2] Albumentations: Fast and Flexible Image Augmentations
Buslaev, Alexander
Iglovikov, Vladimir I.
Khvedchenya, Eugene
Parinov, Alex
Druzhinin, Mikhail
Kalinin, Alexandr A.
[J]. INFORMATION, 2020, 11 (02)
[3] Celebi A. T., 2011, 2011 IEEE 19th Signal Processing and Communications Applications Conference (SIU 2011), P916, DOI 10.1109/SIU.2011.5929801
[4] Speckle noise reduction in SAS imagery
Chaillan, Fabien
Fraschini, Christophe
Courmontagne, Philippe
[J]. SIGNAL PROCESSING, 2007, 87 (04) : 762 - 781
[5] Automatic detection and classification of man-made targets in side scan sonar images
Chew, Ai Ling
Tong, Poh Bee
Chia, Chin Swee
[J]. 2007 SYMPOSIUM ON UNDERWATER TECHNOLOGY AND WORKSHOP ON SCIENTIFIC USE OF SUBMARINE CABLES AND RELATED TECHNOLOGIES, VOLS 1 AND 2, 2007, : 126 - 132
[6] Automated detection/classification of sea mines in sonar imagery
Dobeck, GJ
Hyland, JC
Smedley, L
[J]. DETECTION AND REMEDIATION TECHNOLOGIES FOR MINES AND MINELIKE TARGETS II, 1997, 3079 : 90 - 110
[7] Side-Scan Sonar Image Classification Based on Style Transfer and Pre-Trained Convolutional Neural Networks
Ge, Qiang
Ruan, Fengxue
Qiao, Baojun
Zhang, Qian
Zuo, Xianyu
Dang, Lanxue
[J]. ELECTRONICS, 2021, 10 (15)
[8] Gerg I.D., 2021, IEEE T GEOSCI REMOTE, V60, P1, DOI [10.1109/TGRS.2020.3045649, DOI 10.1109/TGRS.2020.3045649]
[9] A Modified Local Binary Pattern Descriptor for SAR Image Matching
Ghannadi, Mohammad Amin
Saadatseresht, Mohammad
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (04) : 568 - 572
[10] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587

← 1 2 3 4 5 →