Using multi-stream hierarchical deep neural network to extract deep audio feature for acoustic event detection

被引:0
作者
Yanxiong Li
Xue Zhang
Hai Jin
Xianku Li
Qin Wang
Qianhua He
Qian Huang
机构
[1] South China University of Technology,School of Electronic and Information Engineering
来源
Multimedia Tools and Applications | 2018年 / 77卷
关键词
Audio feature; Acoustic event; Deep neural network;
D O I
暂无
中图分类号
学科分类号
摘要
Extraction of effective audio features from acoustic events definitely influences the performance of Acoustic Event Detection (AED) system, especially in adverse audio situations. In this study, we propose a framework for extracting Deep Audio Feature (DAF) using multi-stream hierarchical Deep Neural Network (DNN). The DAF outputted from the proposed framework fuses the potential complementary information of multiple input feature streams and thus could be more discriminative than those input features for AED. We take two input feature streams and the hierarchical DNNs with two stages as an example for showing the extraction of DAF. The effectiveness of different audio features for AED is evaluated on two audio corpora, i.e. BBC (British Broadcasting Corporation) audio dataset and TV audio dataset with different signal-to-noise ratios. Experimental results show that DAF outperforms other features for AED under several experimental conditions.
引用
收藏
页码:897 / 916
页数:19
相关论文
共 14 条
[1]  
Hinton GE(2006)A fast learning algorithm for deep belief nets Neural Comput 18 1527-1554
[2]  
Osindero S(2003)Gabor analysis of auditory midbrain receptive fields: spectro-temporal and binaural composition J Neurophysiol 90 456-476
[3]  
Teh YW(2012)Spectro-temporal modulation subspace-spanning filter bank features for robust automatic speech recognition J Acoust Soc Am 131 4134-4151
[4]  
Qiu A(2015)Magnetic resonance brain image classification based on weighted-type fractional Fourier transform and nonparallel support vector machine Int J Imaging Syst Technol 25 317-327
[5]  
Schreiner C(undefined)undefined undefined undefined undefined-undefined
[6]  
Escabi M(undefined)undefined undefined undefined undefined-undefined
[7]  
Schädler MR(undefined)undefined undefined undefined undefined-undefined
[8]  
Meyer BT(undefined)undefined undefined undefined undefined-undefined
[9]  
Kollmeier B(undefined)undefined undefined undefined undefined-undefined
[10]  
Zhang Y(undefined)undefined undefined undefined undefined-undefined