Hierarchic ConvNets Framework for Rare Sound Event Detection

被引:0
作者
Vesperini, Fabio [1 ]
Droghini, Diego [1 ]
Principi, Emanuele [1 ]
Gabrielli, Leonardo [1 ]
Squartini, Stefano [1 ]
机构
[1] Univ Politecn Marche, Dept Informat Engn, Ancona, Italy
来源
2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2018年
关键词
Convolutional Neural Network; Sound Event Detection; DCASE2017; Linear Prediction; Discrete Wavelet Transform;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a system for rare sound event detection using a hierarchical and multi-scaled approach based on Convolutional Neural Networks (CNN). The task consists on detection of event onsets from artificially generated mixtures. Spectral features are extracted from frames of the acoustic signals, then a first event detection stage operates as binary classifier at frame-rate and it proposes to the second stage contiguous blocks of frames which are assumed to contain a sound event. The second stage refines the event detection of the prior network, discarding blocks that contain background sounds wrongly classified by the first stage. Finally, the effective onset time of the active event is obtained. The performance of the algorithm has been assessed with the material provided for the second task of the IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events (DCASE) 2017. The achieved overall error rate and F-measure, resulting respectively equal to 0.22 and 88.50% on the evaluation dataset, significantly outperforms the challenge baseline and the system guarantees improved generalization performance with a reduced number of free network parameters w.r.t. other competitive algorithms.
引用
收藏
页码:1497 / 1501
页数:5
相关论文
共 27 条
  • [1] [Anonymous], 2012, CoRR
  • [2] [Anonymous], TECH REP
  • [3] [Anonymous], PROC 39TH IEEE INTER
  • [4] [Anonymous], 2017, arXiv preprint arXiv:1708.03211
  • [5] Bergstra J, 2012, J MACH LEARN RES, V13, P281
  • [6] Cakir E., 2017, P DET CLASS AC SCEN, P27
  • [7] Chollet F., 2015, about us
  • [8] Clavel C, 2005, 2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, P1307
  • [9] Droghini D., 2017, P WIRN VIETR SUL MAR
  • [10] Reliable detection of audio events in highly noisy environments
    Foggia, Pasquale
    Petkov, Nicolai
    Saggese, Alessia
    Strisciuglio, Nicola
    Vento, Mario
    [J]. PATTERN RECOGNITION LETTERS, 2015, 65 : 22 - 28