Joint Representation and Recognition for Ship-Radiated Noise Based on Multimodal Deep Learning

被引:34
作者
Yuan, Fei [1 ]
Ke, Xiaoquan [1 ]
Cheng, En [1 ]
机构
[1] Xiamen Univ, Minist Educ, Key Lab Underwater Acoust Commun & Marine Informa, Xiamen 361005, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
ship-radiated noise recognition; pattern recognition; multimodal deep learning; canonical correlation analysis; UNDERWATER; FUSION;
D O I
10.3390/jmse7110380
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Ship recognition based on ship-radiated noise is one of the most important and challenging subjects in underwater acoustic signal processing. The recognition methods for ship-radiated noise recognition include traditional methods and deep learning (DL) methods. Developing from the DL methods and inspired by audio-video speech recognition (AVSR), the paper further introduces multimodal deep learning (multimodal-DL) methods for the recognition of ship-radiated noise. In this paper, ship-radiated noise (acoustics modality) and visual observation of the ships (visual modality) are two different modalities that the multimodal-DL methods model on. The paper specially designs a multimodal-DL framework, the multimodal convolutional neural networks (multimodal-CNNs) for the recognition of ship-radiated noise. Then the paper proposes a strategy based on canonical correlation analysis (CCA-based strategy) to build a joint representation and recognition on the two different single-modality (acoustics modality and visual modality). The multimodal-CNNs and the CCA-based strategy are tested on real ship-radiated noise data recorded. Experimental results show that, using the CCA-based strategy, strong-discriminative information can be built from weak-discriminative information provided from a single-modality. Experimental results also show that as long as any one of the single-modalities can provide information for the recognition, the multimodal-DL methods can have a much better multiclass recognition performance than the DL methods. The paper also discusses the advantages and superiorities of the multimodal-Dl methods over the traditional methods for ship-radiated noise recognition.
引用
收藏
页数:17
相关论文
共 21 条
  • [1] Acoustic detection and classification of river boats
    Averbuch, Amir
    Zheludev, Valery
    Neittaanmaki, Pekka
    Wartiainen, Pekka
    Huoman, Kari
    Janson, Kim
    [J]. APPLIED ACOUSTICS, 2011, 72 (01) : 22 - 34
  • [2] Underwater target classification using wavelet packets and neural networks
    Azimi-Sadjadi, MR
    Yao, D
    Huang, Q
    Dobeck, GJ
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (03): : 784 - 794
  • [3] Convolutional Neural Network With Second-Order Pooling for Underwater Target Classification
    Cao, Xu
    Togneri, Roberto
    Zhang, Xiaomin
    Yu, Yang
    [J]. IEEE SENSORS JOURNAL, 2019, 19 (08) : 3058 - 3066
  • [4] Cao X, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), P89, DOI 10.1109/ICDSP.2016.7868522
  • [5] Cui GL, 2014, IEEE RAD CONF, P144, DOI 10.1109/RADAR.2014.6875573
  • [6] Discriminant Correlation Analysis: Real-Time Feature Level Fusion for Multimodal Biometric Recognition
    Haghighat, Mohammad
    Abdel-Mottaleb, Mohamed
    Alhalabi, Wadee
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2016, 11 (09) : 1984 - 1996
  • [7] Jiang YL, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), P75
  • [8] Kamal S, 2013, OCEAN ELECTR, P48
  • [9] Underwater Acoustic Target Recognition Based on Supervised Feature-Separation Algorithm
    Ke, Xiaoquan
    Yuan, Fei
    Cheng, En
    [J]. SENSORS, 2018, 18 (12)
  • [10] Mroueh Y, 2015, INT CONF ACOUST SPEE, P2130, DOI 10.1109/ICASSP.2015.7178347