Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection

被引:49
|
作者
Marchi, Erik [1 ,2 ,3 ]
Vesperini, Fabio [4 ]
Squartini, Stefano [4 ]
Schuller, Bjoern [2 ,3 ,5 ]
机构
[1] Tech Univ Munich, Machine Intelligence & Signal Proc Grp, Munich, Germany
[2] audEERING GmbH, Gilching, Germany
[3] Univ Passau, Chair Complex & Intelligent Syst, Passau, Germany
[4] Univ Politecn Marche, Dept Informat Engn, A3LAB, Ancona, Italy
[5] Imperial Coll London, Dept Comp, London, England
关键词
CLASSIFICATION; RECOGNITION; LSTM;
D O I
10.1155/2017/4694860
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the emerging field of acoustic novelty detection, most research efforts are devoted to probabilistic approaches such as mixture models or state-space models. Only recent studies introduced (pseudo-) generative models for acoustic novelty detection with recurrent neural networks in the form of an autoencoder. In these approaches, auditory spectral features of the next short term frame are predicted from the previous frames by means of Long-Short Term Memory recurrent denoising autoencoders. The reconstruction error between the input and the output of the autoencoder is used as activation signal to detect novel events. There is no evidence of studies focused on comparing previous efforts to automatically recognize novel events from audio signals and giving a broad and in depth evaluation of recurrent neural network-based autoencoders. The present contribution aims to consistently evaluate our recent novel approaches to fill this white spot in the literature and provide insight by extensive evaluations carried out on three databases: A3Novelty, PASCAL CHiME, and PROMETHEUS. Besides providing an extensive analysis of novel and state-of-the-art methods, the article shows how RNN-based autoencoders outperform statistical approaches up to an absolute improvement of 16.4% average F-measure over the three databases.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A Survey: Neural Network-Based Deep Learning for Acoustic Event Detection
    Xia, Xianjun
    Togneri, Roberto
    Sohel, Ferdous
    Zhao, Yuanjun
    Huang, Defeng
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (08) : 3433 - 3453
  • [2] A Survey: Neural Network-Based Deep Learning for Acoustic Event Detection
    Xianjun Xia
    Roberto Togneri
    Ferdous Sohel
    Yuanjun Zhao
    Defeng Huang
    Circuits, Systems, and Signal Processing, 2019, 38 : 3433 - 3453
  • [3] Novelty detection for a neural network-based online adaptive system
    Liu, Y
    Cukic, B
    Fuller, E
    Gururajan, S
    Yerramalla, S
    Proceedings of the 29th Annual International Computer Software and Applications Conference, Workshops and Fast Abstracts, 2005, : 117 - 122
  • [4] Acoustic Novelty Detection with Adversarial Autoencoders
    Principi, Emanuele
    Vesperini, Fabio
    Squartini, Stefano
    Piazza, Francesco
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3324 - 3330
  • [5] Deep Recurrent Neural Network-Based Identification of Precursor microRNAs
    Park, Seunghyun
    Min, Seonwoo
    Choi, Hyun-Soo
    Yoon, Sungroh
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [6] FEEDBACK CONNECTION FOR DEEP NEURAL NETWORK-BASED ACOUSTIC MODELING
    Tran, Dung T.
    Delcroix, Marc
    Ogawa, Atsunori
    Huemmer, Christian
    Nakatani, Tomohiro
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5240 - 5244
  • [7] DISCRIMINATIVE ACOUSTIC WORD EMBEDDINGS: RECURRENT NEURAL NETWORK-BASED APPROACHES
    Settle, Shane
    Livescu, Karen
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 503 - 510
  • [8] A Recurrent Neural Network-based Malicious Code Detection Technology
    Tang, Yongwang
    Liu, Xin
    Jin, Yanqing
    Wei, Han
    Deng, Qizheng
    PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 1737 - 1742
  • [9] Deep Gaussian Process autoencoders for novelty detection
    Rémi Domingues
    Pietro Michiardi
    Jihane Zouaoui
    Maurizio Filippone
    Machine Learning, 2018, 107 : 1363 - 1383
  • [10] RECURRENT NEURAL NETWORKS WITH STOCHASTIC LAYERS FOR ACOUSTIC NOVELTY DETECTION
    Duong Nguyen
    Kirsebom, Oliver S.
    Frazao, Fabio
    Fablet, Ronan
    Matwin, Stan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 765 - 769