Acoustic Features for Deep Learning-Based Models for Emergency Siren Detection: an Evaluation Study

被引:8
作者
Cantarini, Michela [1 ]
Brocanelli, Anna [1 ]
Gabrielli, Leonardo [1 ]
Squartini, Stefano [1 ]
机构
[1] Univ Politecn Marche, Dept Informat Engn, Ancona, Italy
来源
PROCEEDINGS OF THE 12TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2021) | 2021年
关键词
Emergency Siren Detection; Deep Learning; Acoustic Features;
D O I
10.1109/ISPA52656.2021.9552140
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emergency Siren Detection is a topic of great importance for road safety. Nowadays, the design of cars with every comfort has improved the quality of driving, but distractions have also increased. Hence the usefulness of implementing an Emergency Vehicle Detection System: if installed inside the car, it alerts the driver of its approach, and if installed outdoors in strategic locations, it automatically activates reserved lanes. In this paper, we perform Emergency Siren Detection with a Convolutional Neural Network-based deep learning model. We investigate acoustic features to propose a low computational cost algorithm. We employ Short-Time Fourier Transform spectrograms as features and improve the classification performance by applying a harmonic percussive source separation technique. The enhancement of the harmonic components of the spectrograms gives better results than more computationally complex features. We also demonstrate the relevance of the siren harmonic contents in the classification task. The reduction of the network hyperparameters decreases the computational load of the algorithm and facilitates its implementation in real-time embedded systems.
引用
收藏
页码:47 / 53
页数:7
相关论文
共 31 条
  • [11] Fatimah B., 2020, P 2020 11 INT C COMP, P1
  • [12] Fitzgerald D., 2010, P INT C DIGITAL AUDI, V13
  • [13] Font F., 2013, P 21 ACM INT C MULT, P411, DOI DOI 10.1145/2502081.2502245
  • [14] Kingma DP, 2014, ADV NEUR IN, V27
  • [15] Kiran SL, 2017, PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES FOR SMART NATION (SMARTTECHCON), P127, DOI 10.1109/SmartTechCon.2017.8358355
  • [16] levine s. n., 1998, J AUDIO ENG SOC
  • [17] Recognition of the Ambulance Siren Sound in Taiwan by the Longest Common Subsequence
    Liaw, Jiun-Jian
    Wang, Wen-Shen
    Chu, Hung-Chi
    Huang, Meng-Sian
    Lu, Chuan-Pin
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 3825 - 3828
  • [18] Marchegiani L., 2018, arXiv
  • [19] McFee B., 2015, P PYTH SCI C, P18, DOI [10.25080/Majora-7b98e3ed-003, 10. 25080/Majora-7b98e3ed-003]
  • [20] Meucci F., 2008, 16 EUR SIGN PROC C, P1