A Spatio-temporal Deep Learning Approach for Underwater Acoustic Signals Classification

被引：8

作者：

Alouani, Zakaria ^{[1
,3
]}

Hmamouche, Youssef ^{[1
]}

El Khamlichi, Btissam ^{[1
]}

Seghrouchni, Amal El Fallah ^{[1
,2
]}

机构：

[1] Mohammed VI Polytechn Univ, AI Movement Int Artificial Intelligence Ctr Moro, Rabat, Morocco

[2] Sorbonne Univ, LIP6, UMR 7606, CNRS, Paris, France

[3] Natl Inst Stat & Appl Econ, Rabat, Morocco

来源：

2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022) | 2022年

关键词：

D O I：

10.1109/AVSS56176.2022.9959247

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Target recognition from underwater acoustic signals is a major challenge in surveillance systems, especially in military and defense fields. Deep learning models are increasingly used for the automatic classification of underwater signals, but many challenges remain due to the complexity of sound navigation and ranging networks, the noise present in the signals, and the difficulty of collecting large amounts of data for efficient training. In this paper, we propose two new architectures for underwater signal classification based on Spatio-temporal modeling. In experiments, evaluations on two real datasets show that the proposed approach achieves a classification accuracy of 98% which outperforms the state-of-the-art methods. In addition, the proposed end-to-end network is considerably faster than MFCC-based networks such as Yamnet and VGGish.

引用

页数：7

共 18 条

[1] Deep Transfer Learning for Machine Diagnosis: From Sound and Music Recognition to Bearing Fault Detection [J].

Brusa, Eugenio ;

Delprete, Cristiana ;

Di Maggio, Luigi Gianpio .

APPLIED SCIENCES-BASEL, 2021, 11 (24)

[2] Multimodal Emotion Recognition Using Transfer Learning on Audio and Text Data [J].

Deng, James J. ;

Leung, Clement H. C. ;

Li, Yuanxi .

COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT III, 2021, 12951 :552-563

[3]

Gemmeke JF, 2017, INT CONF ACOUST SPEE, P776, DOI 10.1109/ICASSP.2017.7952261

[4] Audio Signal Classification Using Linear Predictive Coding and Random Forests [J].

Grama, Lacrimioara ;

Rusu, Corneliu .

2017 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2017,

[5]

Hershey S, 2017, INT CONF ACOUST SPEE, P131, DOI 10.1109/ICASSP.2017.7952132

[6] DeepShip: An underwater acoustic benchmark dataset and a separable convolution based autoencoder for classification [J].

Irfan, Muhammad ;

Zheng Jiangbin ;

Ali, Shahid ;

Iqbal, Muhammad ;

Masood, Zafar ;

Hamid, Umar .

EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183

[7] Passive sonar automated target classifier for shallow waters using end-to-end learnable deep convolutional LSTMs [J].

Kamal, Suraj ;

Chandran, Satheesh C. ;

Supriya, M. H. .

ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2021, 24 (04) :860-871

[8]

Kang CY, 2004, LECT NOTES COMPUT SC, V3173, P930

[9] Underwater target recognition using convolutional recurrent neural networks with 3-D Mel-spectrogram and data augmentation [J].

Liu, Feng ;

Shen, Tongsheng ;

Luo, Zailei ;

Zhao, Dexin ;

Guo, Shaojun .

APPLIED ACOUSTICS, 2021, 178

[10] ShipsEar: An underwater vessel noise database [J].

Santos-Dominguez, David ;

Torres-Guijarro, Soledad ;

Cardenal-Lopez, Antonio ;

Pena-Gimenez, Antonio .

APPLIED ACOUSTICS, 2016, 113 :64-69

← 1 2 →