Denoising odontocete echolocation clicks using a hybrid model with convolutional neural network and long short-term memory network

被引:3
作者
Yang, Wuyi [1 ]
Chang, Wenlei [1 ]
Song, Zhongchang [1 ,3 ]
Niu, Fuqiang [2 ,4 ]
Wang, Xianyan [2 ]
Zhang, Yu [1 ]
机构
[1] Xiamen Univ, Minist Educ, Coll Ocean & Earth Sci, Key Lab Underwater Acoust Commun & Marine Informat, Xiamen, Peoples R China
[2] Minist Nat Resources, Inst Oceanog 3, Lab Marine Biol & Ecol, Xiamen, Peoples R China
[3] Xiamen Univ, Coll Environm & Ecol, State Key Lab Marine Environm Sci, Xiamen, Peoples R China
[4] Xiamen Ocean Vocat Coll, Xiamen, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
NEOPHOCAENA-PHOCAENOIDES-SUNAMERI; ACOUSTIC LOCALIZATION; CLASSIFICATION; BEHAVIOR; SOUNDS; GULF;
D O I
10.1121/10.0020560
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Ocean noise negatively influences the recording of odontocete echolocation clicks. In this study, a hybrid model based on the convolutional neural network (CNN) and long short-term memory (LSTM) network-called a hybrid CNN-LSTM model-was proposed to denoise echolocation clicks. To learn the model parameters, the echolocation clicks were partially corrupted by adding ocean noise, and the model was trained to recover the original echolocation clicks. It can be difficult to collect large numbers of echolocation clicks free of ambient sea noise for training networks. Data augmentation and transfer learning were employed to address this problem. Based on Gabor functions, simulated echolocation clicks were generated to pre-train the network models, and the parameters of the networks were then fine-tuned using odontocete echolocation clicks. Finally, the performance of the proposed model was evaluated using synthetic data. The experimental results demonstrated the effectiveness of the proposed model for denoising two typical echolocation clicks-namely, narrowband high-frequency and broadband echolocation clicks. The denoising performance of hybrid models with the different number of convolution and LSTM layers was evaluated. Consequently, hybrid models with one convolutional layer and multiple LSTM layers are recommended, which can be adopted for denoising both types of echolocation clicks. (c) 2023 Acoustical Society of America.
引用
收藏
页码:938 / 947
页数:10
相关论文
共 50 条
[21]   Ego-Vehicle Speed Prediction Using a Long Short-Term Memory Based Recurrent Neural Network [J].
Yeon, Kyuhwan ;
Min, Kyunghan ;
Shin, Jaewook ;
Sunwoo, Myoungho ;
Han, Manbae .
INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2019, 20 (04) :713-722
[22]   Predicting life expectancy with a long short-term memory recurrent neural network using electronic medical records [J].
Beeksma, Merijn ;
Verberne, Suzan ;
van den Bosch, Antal ;
Das, Enny ;
Hendrickx, Iris ;
Groenewoud, Stef .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (1)
[23]   Hybrid convolutional long short-term memory models for sales forecasting in retail [J].
Moraes, Thais de Castro ;
Yuan, Xue-Ming ;
Chew, Ek Peng .
JOURNAL OF FORECASTING, 2024, 43 (05) :1278-1293
[24]   Application of long short-term memory neural network technique for predicting monthly pan evaporation [J].
Abed, Mustafa ;
Imteaz, Monzur Alam ;
Ahmed, Ali Najah ;
Huang, Yuk Feng .
SCIENTIFIC REPORTS, 2021, 11 (01)
[25]   Attention-based hybrid convolutional-long short-term memory network for bridge pier hysteresis and backbone curves prediction [J].
Yazdanpanah, Omid ;
Chang, Minwoo ;
Bakhshi, Ensieh Ali .
INTEGRATED COMPUTER-AIDED ENGINEERING, 2025, 32 (02) :176-195
[26]   Long Short-Term Memory Neural Network Combined With a Hybrid-Modular Clockwork Structure for Transient Modeling of Nonlinear Circuits [J].
Chaleshtori, Razieh Moradi ;
Saboohi, Amin ;
Faraji, Amin ;
Sadrossadat, Sayed Alireza ;
Moftakharzadeh, Ali ;
Savaria, Yvon .
IEEE ACCESS, 2025, 13 :107979-107993
[27]   Optimized deep convolutional neural network with a long short-term memory framework for automated myocardial infraction detection in electrocardiogram images using Grey Wolf Optimizer [J].
Murugan, Vinoth ;
Panigrahy, Damodar .
JOURNAL OF ELECTRONIC IMAGING, 2025, 34 (01)
[28]   Heart sound signal recovery based on time series signal prediction using a recurrent neural network in the long short-term memory model [J].
Wang, Zhi-Hao ;
Horng, Gwo-Jiun ;
Hsu, Tz-Heng ;
Aripriharta, A. ;
Jong, Gwo-Jia .
JOURNAL OF SUPERCOMPUTING, 2020, 76 (11) :8373-8390
[29]   Prediction of Pedestrian Crossing Intentions at Intersections Based on Long Short-Term Memory Recurrent Neural Network [J].
Zhang, Shile ;
Abdel-Aty, Mohamed ;
Yuan, Jinghui ;
Li, Pei .
TRANSPORTATION RESEARCH RECORD, 2020, 2674 (04) :57-65
[30]   Multimodal Gait Abnormality Recognition Using a Convolutional Neural Network-Bidirectional Long Short-Term Memory (CNN-BiLSTM) Network Based on Multi-Sensor Data Fusion [J].
Li, Jing ;
Liang, Weisheng ;
Yin, Xiyan ;
Li, Jun ;
Guan, Weizheng ;
Mourad-Chehade, Farah .
SENSORS, 2023, 23 (22)