A transfer deep residual shrinkage network for bird sound recognition

被引:0
作者
Chen, Xiao [1 ,2 ]
Zeng, Zhaoyou [1 ]
Xu, Tong [1 ]
机构
[1] Nanjing Univ Informat Sci Technol, Sch Elect & Informat Engn, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Nanjing 210044, Peoples R China
来源
ELECTRONIC RESEARCH ARCHIVE | 2025年 / 33卷 / 07期
关键词
bioacoustics; bird sound recognition; audio signal processing; machine learning; deep learning; transfer learning; deep residual shrinkage network; ecological monitoring; FRACTIONAL DERIVATIVE METHOD; LAMB WAVE SIGNALS; NOISE;
D O I
10.3934/era.2025185
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Bird sound recognition has important applications in bird monitoring and ecological protection. However, in complicated environments, noise and insufficient sample data are the major factors affecting recognition accuracy. We proposed a bird sound recognition method based on a developed transfer deep residual shrinkage network. First, a deep residual shrinkage network with noise resistance was constructed based on the structural characteristics of the residual shrinkage module, multi-scale operations, and the characteristics of bird sound Mel spectrograms. Then, the deep residual shrinkage network was pre-trained using a bird sound dataset, applying an unfreezing finetuning strategy, to mitigate the impact of insufficient training data. A transfer learning alleviated the problem of data scarcity by utilizing pre-trained models, while the deep residual shrinkage network enhanced the performance of the model in a noisy environment by optimizing the network structure. Experimental results showed that this method achieves high recognition accuracy under noise and small data sets. It has advantages over the compared methods and is suitable for ecological monitoring fields such as bird population monitoring. The method has good application prospects.
引用
收藏
页码:4135 / 4150
页数:16
相关论文
共 27 条
[1]   Automated Bird Species Identification using Audio Signal Processing and Neural Networks [J].
Chandu, B. ;
Munikoti, Akash ;
Murthy, Karthik S. ;
Murthy, Ganesh, V ;
Nagaraj, Chaitra .
2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2020,
[2]  
[陈莎莎 Chen Shasha], 2014, [计算机应用与软件, Computer Applications and Software], V31, P154
[3]   Video super resolution based on deformable 3D convolutional group fusion [J].
Chen, Xiao ;
Jing, Ruyun .
SCIENTIFIC REPORTS, 2025, 15 (01)
[4]   Bird sound recognition based on adaptive frequency cepstral coefficient and improved support vector machine using a hunter-prey optimizer [J].
Chen, Xiao ;
Zeng, Zhaoyou .
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (11) :19438-19453
[5]   Land scene classification for remote sensing images with an improved capsule network [J].
Chen, Xiao ;
Zhu, Juan .
JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (02)
[6]   Attention mechanism feedback network for image super-resolution [J].
Chen, Xiao ;
Jing, Ruyun ;
Suna, Chaowen .
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
[7]   Multiscale Recursive Feedback Network for Image Super-Resolution [J].
Chen, Xiao ;
Sun, Chaowen .
IEEE ACCESS, 2022, 10 :6393-6406
[8]   Effect of Transducer Shadowing of Ultrasonic Anemometers on Wind Velocity Measurement [J].
Chen, Xiao ;
Zhan, Wei .
IEEE SENSORS JOURNAL, 2021, 21 (04) :4731-4738
[9]  
Chen X, 2015, J VIBROENG, V17, P4211
[10]   Tsallis Distribution-Based Fractional Derivative Method for Lamb Wave Signal Recovery [J].
Chen, Xiao ;
Wang, Chenlong .
RESEARCH IN NONDESTRUCTIVE EVALUATION, 2015, 26 (03) :174-188