Novel Time-Frequency Based Scheme for Detecting Sound Events from Sound Background in Audio Segments

被引:5
作者
Hajihashemi, Vahid [1 ]
Alavigharahbagh, Abdorreza [1 ]
Oliveira, Hugo S. [1 ]
Cruz, Pedro Miguel [2 ]
Tavares, Joao Manuel R. S. [3 ]
机构
[1] Univ Porto, Fac Engn, Porto, Portugal
[2] Bosch Secur Syst SA, Ovar, Portugal
[3] Univ Porto, Dept Engn Mecan, Fac Engn, Porto, Portugal
来源
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2021 | 2021年 / 12702卷
关键词
Signal processing; Wavelet transform; Machine learning; Event detection;
D O I
10.1007/978-3-030-93420-0_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Usually, Sound event detection systems that classify different events from sound data have two main blocks. In the first block, sound events are separated from sound background and in next block, different events are classified. In recent years, this research area has become increasingly popular in a wide range of applications, such as in surveillance and city patterns learning and recognition, mainly when combined with imaging sensors. However, it still poses challenging problems due to existent noise, complexity of the events, poor microphone(s) quality, bad microphone location(s), or events occurring simultaneously. This research aimed to compare accurate signal processing and classification methods to suggest a novel method for detecting sound events from sound background in urban scenes. Using wavelet and Mel-frequency cepstral coefficients, the analysis of the effect of classification methods and minimization of the number of train data are some of the advantages of the proposed method. The proposed methods' application to a standard sounds database led to an accuracy of about 99% in event detection.
引用
收藏
页码:402 / 416
页数:15
相关论文
共 39 条
  • [1] [Anonymous], 2016, P DETECTION CLASSIFI
  • [2] Atrey P.K., 2006, 2006 IEEE INT C ACOU, V5
  • [3] Decoding speech in the presence of other sources
    Barker, JP
    Cooke, MP
    Ellis, DPW
    [J]. SPEECH COMMUNICATION, 2005, 45 (01) : 5 - 25
  • [4] Soundscapes and environmental noise management
    Brown, A. L.
    [J]. NOISE CONTROL ENGINEERING JOURNAL, 2010, 58 (05) : 493 - 500
  • [5] Cotton CV, 2011, 2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), P69, DOI 10.1109/ASPAA.2011.6082331
  • [6] Crockett B.G., 2019, [No title captured], Patent No. [10,523,169, 10523169]
  • [7] Derakhshan M, 2019, Tabriz J. Electr. Eng., V49, P565
  • [8] Hidden Markov models
    Eddy, SR
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) : 361 - 365
  • [9] VITERBI ALGORITHM
    FORNEY, GD
    [J]. PROCEEDINGS OF THE IEEE, 1973, 61 (03) : 268 - 278
  • [10] Gelfand S. A., 2017, Hearing: An introduction to psychological and physiological acoustics