Joint Representation and Recognition for Ship-Radiated Noise Based on Multimodal Deep Learning

被引：34

作者：

Yuan, Fei ^{[1
]}

Ke, Xiaoquan ^{[1
]}

Cheng, En ^{[1
]}

机构：

[1] Xiamen Univ, Minist Educ, Key Lab Underwater Acoust Commun & Marine Informa, Xiamen 361005, Fujian, Peoples R China

来源：

JOURNAL OF MARINE SCIENCE AND ENGINEERING | 2019年 / 7卷 / 11期

基金：

中国国家自然科学基金;

关键词：

ship-radiated noise recognition; pattern recognition; multimodal deep learning; canonical correlation analysis; UNDERWATER; FUSION;

D O I：

10.3390/jmse7110380

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

Ship recognition based on ship-radiated noise is one of the most important and challenging subjects in underwater acoustic signal processing. The recognition methods for ship-radiated noise recognition include traditional methods and deep learning (DL) methods. Developing from the DL methods and inspired by audio-video speech recognition (AVSR), the paper further introduces multimodal deep learning (multimodal-DL) methods for the recognition of ship-radiated noise. In this paper, ship-radiated noise (acoustics modality) and visual observation of the ships (visual modality) are two different modalities that the multimodal-DL methods model on. The paper specially designs a multimodal-DL framework, the multimodal convolutional neural networks (multimodal-CNNs) for the recognition of ship-radiated noise. Then the paper proposes a strategy based on canonical correlation analysis (CCA-based strategy) to build a joint representation and recognition on the two different single-modality (acoustics modality and visual modality). The multimodal-CNNs and the CCA-based strategy are tested on real ship-radiated noise data recorded. Experimental results show that, using the CCA-based strategy, strong-discriminative information can be built from weak-discriminative information provided from a single-modality. Experimental results also show that as long as any one of the single-modalities can provide information for the recognition, the multimodal-DL methods can have a much better multiclass recognition performance than the DL methods. The paper also discusses the advantages and superiorities of the multimodal-Dl methods over the traditional methods for ship-radiated noise recognition.

引用

页数：17

共 21 条

[1] Acoustic detection and classification of river boats
Averbuch, Amir
Zheludev, Valery
Neittaanmaki, Pekka
Wartiainen, Pekka
Huoman, Kari
Janson, Kim
[J]. APPLIED ACOUSTICS, 2011, 72 (01) : 22 - 34
[2] Underwater target classification using wavelet packets and neural networks
Azimi-Sadjadi, MR
Yao, D
Huang, Q
Dobeck, GJ
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (03): : 784 - 794
[3] Convolutional Neural Network With Second-Order Pooling for Underwater Target Classification
Cao, Xu
Togneri, Roberto
Zhang, Xiaomin
Yu, Yang
[J]. IEEE SENSORS JOURNAL, 2019, 19 (08) : 3058 - 3066
[4] Cao X, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), P89, DOI 10.1109/ICDSP.2016.7868522
[5] Cui GL, 2014, IEEE RAD CONF, P144, DOI 10.1109/RADAR.2014.6875573
[6] Discriminant Correlation Analysis: Real-Time Feature Level Fusion for Multimodal Biometric Recognition
Haghighat, Mohammad
Abdel-Mottaleb, Mohamed
Alhalabi, Wadee
[J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2016, 11 (09) : 1984 - 1996
[7] Jiang YL, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), P75
[8] Kamal S, 2013, OCEAN ELECTR, P48
[9] Underwater Acoustic Target Recognition Based on Supervised Feature-Separation Algorithm
Ke, Xiaoquan
Yuan, Fei
Cheng, En
[J]. SENSORS, 2018, 18 (12)
[10] Mroueh Y, 2015, INT CONF ACOUST SPEE, P2130, DOI 10.1109/ICASSP.2015.7178347

← 1 2 3 →