Automatic rain and cicada chorus filtering of bird acoustic data

被引:30
作者
Brown, Alexander [1 ]
Garg, Saurabh [1 ]
Montgomery, James [1 ]
机构
[1] Univ Tasmania, Sch Technol Environm & Design, Hobart, Tas, Australia
关键词
Bioacoustics; Acoustic indices; Noise filtering; Sound classification; ENVIRONMENTAL SOUND RECOGNITION; IDENTIFICATION; RECORDINGS; TRANSFORM; NOISE;
D O I
10.1016/j.asoc.2019.105501
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recording and analysing environmental audio recordings has become a common approach for monitoring the environment. This has several advantages over other approaches, such as reducing costs by avoiding the need for experts to be present in the area of interest. A current problem with performing analyses of environmental recordings is interference from noise that can mask vocalisations of interest. This makes detecting these vocalisations more difficult and can require additional resources. While some work has been done to remove stationary noise from environmental recordings, there has been little effort to remove noise from non-stationary sources, such as rain, wind, engines, and animal vocalisations that are not of interest. This work addresses the challenge of filtering noise from rain and cicada choruses from recordings containing bird sound. The use of acoustic indices and Mel Frequency Cepstral Coefficients (MFCCs) with machine learning classifiers is investigated to find the most effective filters. Hyperparameters for several classification approaches are investigated to fine tune models to achieve the best results. The approach used enables users to set thresholds to increase or decrease the sensitivity of classification, based on the prediction probability outputted by classifiers. A novel approach to remove cicada choruses using bandpass filters is also proposed. A threshold-based approach (Multi-Layer Perceptron with Acoustic Indices and MFCCs) for rain detection is derived which achieves an AUC of 0.9911 and is more accurate than existing approaches when set to the same sensitivities. Cicada choruses are classified in the training set used with 100% accuracy using 10-fold cross-validation using a Support Vector Machine (SVM) classifier with MFCCs. The cicada filtering approach greatly increased the median signal to noise ratios of affected recordings from 0.53 for unfiltered audio to 1.86 to audio filtered by both the cicada filter and a common stationary noise filter. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 44 条
[1]   Automatic anuran identification using noise removal and audio activity detection [J].
Alonso, Jesus B. ;
Cabrera, Josue ;
Shyamnani, Rohit ;
Travieso, Carlos M. ;
Bolanos, Federico ;
Garcia, Adrian ;
Villegas, Alexander ;
Wainwright, Mark .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 72 :83-92
[2]  
[Anonymous], 2009, ACM SIGKDD explorations newsletter, DOI 10.1145/1656274.1656278
[3]  
[Anonymous], 2012, 2012 IEEE INT C ELEC, DOI DOI 10.1109/EIT.2012.6220700
[4]  
Bagwell C., ROBS SOX SOUND EXCHA
[5]   Automatic identification of rainfall in acoustic recordings [J].
Bedoya, Carol ;
Isaza, Claudia ;
Daza, Juan M. ;
Lopez, Jose D. .
ECOLOGICAL INDICATORS, 2017, 75 :95-100
[6]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]  
Breiman L., 2017, Classification and regression trees, DOI [DOI 10.1201/9781315139470-8, 10.1201/9781315139470-8]
[9]   Automatic and Efficient Denoising of Bioacoustics Recordings Using MMSE STSA [J].
Brown, Alexander ;
Garg, Saurabh ;
Montgomery, James .
IEEE ACCESS, 2018, 6 :5010-5022
[10]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)