Denoising odontocete echolocation clicks using a hybrid model with convolutional neural network and long short-term memory network

被引:3
作者
Yang, Wuyi [1 ]
Chang, Wenlei [1 ]
Song, Zhongchang [1 ,3 ]
Niu, Fuqiang [2 ,4 ]
Wang, Xianyan [2 ]
Zhang, Yu [1 ]
机构
[1] Xiamen Univ, Minist Educ, Coll Ocean & Earth Sci, Key Lab Underwater Acoust Commun & Marine Informat, Xiamen, Peoples R China
[2] Minist Nat Resources, Inst Oceanog 3, Lab Marine Biol & Ecol, Xiamen, Peoples R China
[3] Xiamen Univ, Coll Environm & Ecol, State Key Lab Marine Environm Sci, Xiamen, Peoples R China
[4] Xiamen Ocean Vocat Coll, Xiamen, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
NEOPHOCAENA-PHOCAENOIDES-SUNAMERI; ACOUSTIC LOCALIZATION; CLASSIFICATION; BEHAVIOR; SOUNDS; GULF;
D O I
10.1121/10.0020560
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Ocean noise negatively influences the recording of odontocete echolocation clicks. In this study, a hybrid model based on the convolutional neural network (CNN) and long short-term memory (LSTM) network-called a hybrid CNN-LSTM model-was proposed to denoise echolocation clicks. To learn the model parameters, the echolocation clicks were partially corrupted by adding ocean noise, and the model was trained to recover the original echolocation clicks. It can be difficult to collect large numbers of echolocation clicks free of ambient sea noise for training networks. Data augmentation and transfer learning were employed to address this problem. Based on Gabor functions, simulated echolocation clicks were generated to pre-train the network models, and the parameters of the networks were then fine-tuned using odontocete echolocation clicks. Finally, the performance of the proposed model was evaluated using synthetic data. The experimental results demonstrated the effectiveness of the proposed model for denoising two typical echolocation clicks-namely, narrowband high-frequency and broadband echolocation clicks. The denoising performance of hybrid models with the different number of convolution and LSTM layers was evaluated. Consequently, hybrid models with one convolutional layer and multiple LSTM layers are recommended, which can be adopted for denoising both types of echolocation clicks. (c) 2023 Acoustical Society of America.
引用
收藏
页码:938 / 947
页数:10
相关论文
共 50 条
[31]   Long Short-Term Memory Neural Network for Temperature Prediction in Laser Powder Bed Additive Manufacturing [J].
Yarahmadi, Ashkan Mansouri ;
Breuss, Michael ;
Hartmann, Carsten .
INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 3, 2023, 544 :119-132
[32]   Effective Video Event Detection Using Optimized Bidirectional Long Short-Term Memory Network [J].
Alamuru, Susmitha ;
Jain, Sanjay .
INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2024, 23 (05) :1911-1933
[33]   Recognition of pulmonary diseases from lung sounds using convolutional neural networks and long short-term memory [J].
Fraiwan, M. ;
Fraiwan, L. ;
Alkhodari, M. ;
Hassanin, O. .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 13 (10) :4759-4771
[34]   Early-Stage Gas Identification Using Convolutional Long Short-Term Neural Network with Sensor Array Time Series Data [J].
Zhou, Kai ;
Liu, Yixin .
SENSORS, 2021, 21 (14)
[35]   Mobile Neural Architecture Search Network and Convolutional Long Short-Term Memory-Based Deep Features Toward Detecting Violence from Video [J].
Heyam M. Bin Jahlan ;
Lamiaa A. Elrefaei .
Arabian Journal for Science and Engineering, 2021, 46 :8549-8563
[36]   Mobile Neural Architecture Search Network and Convolutional Long Short-Term Memory-Based Deep Features Toward Detecting Violence from Video [J].
Bin Jahlan, Heyam M. ;
Elrefaei, Lamiaa A. .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2021, 46 (09) :8549-8563
[37]   A Novel Long Short-Term Memory Network Model For Multimodal Music Emotion Analysis In Affective Computing [J].
Chen, Wenwen .
JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2023, 26 (03) :367-376
[38]   Decoding of EEG Signals Using Deep Long Short-Term Memory Network in Face Recognition Task [J].
Ghosh, Lidia ;
Ghosh, Sayantani ;
Konar, Amit ;
Rakshit, Pratyusha ;
Nagar, Atulya K. .
2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, :477-483
[39]   Hyperspectral Image Classification Using Attention-Based Bidirectional Long Short-Term Memory Network [J].
Mei, Shaohui ;
Li, Xingang ;
Liu, Xiao ;
Cai, Huimin ;
Du, Qian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[40]   Electroencephalography-based imagined speech recognition using deep long short-term memory network [J].
Agarwal, Prabhakar ;
Kumar, Sandeep .
ETRI JOURNAL, 2022, 44 (04) :672-685