A NOVEL APPROACH FOR AUTOMATIC ACOUSTIC NOVELTY DETECTION USING A DENOISING AUTOENCODER WITH BIDIRECTIONAL LSTM NEURAL NETWORKS

被引:0
作者
Marchi, Erik [1 ]
Vesperini, Fabio [2 ]
Eyben, Florian [1 ]
Squartini, Stefano [2 ]
Schuller, Bjboern [1 ,3 ,4 ]
机构
[1] Tech Univ Munich, Machine Intelligence & Signal Proc Grp, Munich, Germany
[2] Univ Politecn Marche, A3LAB, Dept Informat Engn, Ancona, Italy
[3] Univ Passau, Chair Complex & Intelligent Syst, Passau, Germany
[4] Imperial Coll London, Dept Comp, London, England
来源
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | 2015年
关键词
Acoustic Novelty Detection; Denoising Autoencorder; Bidirectional LSTM; Recurrent Neural Networks; CLASSIFICATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoustic novelty detection aims at identifying abnormal/novel acoustic signals which differ from the reference/normal data that the system was trained with. In this paper we present a novel unsupervised approach based on a denoising autoencoder. In our approach auditory spectral features are processed by a denoising autoencoder with bidirectional Long Short-Term Memory recurrent neural networks. We use the reconstruction error between the input and the output of the autoencoder as activation signal to detect novel events. The autoencoder is trained on a public database which contains recordings of typical in-home situations such as talking, watching television, playing and eating. The evaluation was performed on more than 260 different abnormal events. We compare results with state-of-the-art methods and we conclude that our novel approach significantly outperforms existing methods by achieving up to 93.4 % F-Measure.
引用
收藏
页码:1996 / 2000
页数:5
相关论文
共 29 条
[1]  
Allan J., 2007, P 16 ACM C C INF KNO, P623, DOI [DOI 10.1145/1321440.1321528, 10.1145/1321440.1321528]
[2]  
[Anonymous], 2011, BIGLEARN NIPS WORKSH
[3]  
[Anonymous], 2009, Advances in neural information processing systems
[4]  
Atrey P. K., 2006, IEEE INT C AC SPEECH, V5
[5]   The PASCAL CHiME speech separation and recognition challenge [J].
Barker, Jon ;
Vincent, Emmanuel ;
Ma, Ning ;
Christensen, Heidi ;
Green, Phil .
COMPUTER SPEECH AND LANGUAGE, 2013, 27 (03) :621-633
[6]  
Bengio Yoshua, 2006, Advances in Neural Information Processing Systems 19, V19, P153
[7]  
Bishop C. M., 2016, IEEE VISION IMAGE SI, V141, P217
[8]  
Clavel C, 2005, 2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, P1307
[9]  
Clifton LA, 2006, LECT NOTES COMPUT SC, V3973, P836
[10]  
Eyben F, 2010, P 18 ACM INT C MULT, P1459, DOI [DOI 10.1145/1873951.1874246, 10.1145/1873951.1874246]