Acoustic Features for Deep Learning-Based Models for Emergency Siren Detection: an Evaluation Study

被引:8
作者
Cantarini, Michela [1 ]
Brocanelli, Anna [1 ]
Gabrielli, Leonardo [1 ]
Squartini, Stefano [1 ]
机构
[1] Univ Politecn Marche, Dept Informat Engn, Ancona, Italy
来源
PROCEEDINGS OF THE 12TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2021) | 2021年
关键词
Emergency Siren Detection; Deep Learning; Acoustic Features;
D O I
10.1109/ISPA52656.2021.9552140
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emergency Siren Detection is a topic of great importance for road safety. Nowadays, the design of cars with every comfort has improved the quality of driving, but distractions have also increased. Hence the usefulness of implementing an Emergency Vehicle Detection System: if installed inside the car, it alerts the driver of its approach, and if installed outdoors in strategic locations, it automatically activates reserved lanes. In this paper, we perform Emergency Siren Detection with a Convolutional Neural Network-based deep learning model. We investigate acoustic features to propose a low computational cost algorithm. We employ Short-Time Fourier Transform spectrograms as features and improve the classification performance by applying a harmonic percussive source separation technique. The enhancement of the harmonic components of the spectrograms gives better results than more computationally complex features. We also demonstrate the relevance of the siren harmonic contents in the classification task. The reduction of the network hyperparameters decreases the computational load of the algorithm and facilitates its implementation in real-time embedded systems.
引用
收藏
页码:47 / 53
页数:7
相关论文
共 31 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] [Anonymous], 2009, GAMMATONE LIKE SPECT
  • [3] An automatic emergency signal recognition system for the hearing impaired
    Beritelli, F.
    Casale, S.
    Russo, A.
    Serrano, S.
    [J]. 2006 IEEE 12TH DIGITAL SIGNAL PROCESSING WORKSHOP & 4TH IEEE SIGNAL PROCESSING EDUCATION WORKSHOP, VOLS 1 AND 2, 2006, : 179 - 182
  • [4] Buitinck L., 2013, ECML PKDD WORKSHOP L, P108
  • [5] Cantarini Michela, 2020, Intelligent Computing Theories and Application. 16th International Conference, ICIC 2020. Proceedings. Lecture Notes in Computer Science (LNCS 12463), P207, DOI 10.1007/978-3-030-60799-9_18
  • [6] Carmel D, 2017, EUR SIGNAL PR CONF, P1839, DOI 10.23919/EUSIPCO.2017.8081527
  • [7] Towards a system for automatic traffic sound event detection
    Chavdar, Marko
    Gerazov, Branislav
    Ivanovski, Zoran
    Kartalov, Tomislav
    [J]. 2020 28TH TELECOMMUNICATIONS FORUM (TELFOR), 2020, : 209 - 212
  • [8] Chollet F., 2018, Astrophysics Source Code Library ascl:1806.022
  • [9] Driedger J., 2014, P 15 C INT SOC MUS I, P611, DOI DOI 10.5281/ZENODO.1415226
  • [10] Ebizuka Y, 2019, IEEE INT C INTELL TR, P4431, DOI 10.1109/ITSC.2019.8917028