Novel Time-Frequency Based Scheme for Detecting Sound Events from Sound Background in Audio Segments

被引:5
|
作者
Hajihashemi, Vahid [1 ]
Alavigharahbagh, Abdorreza [1 ]
Oliveira, Hugo S. [1 ]
Cruz, Pedro Miguel [2 ]
Tavares, Joao Manuel R. S. [3 ]
机构
[1] Univ Porto, Fac Engn, Porto, Portugal
[2] Bosch Secur Syst SA, Ovar, Portugal
[3] Univ Porto, Dept Engn Mecan, Fac Engn, Porto, Portugal
关键词
Signal processing; Wavelet transform; Machine learning; Event detection;
D O I
10.1007/978-3-030-93420-0_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Usually, Sound event detection systems that classify different events from sound data have two main blocks. In the first block, sound events are separated from sound background and in next block, different events are classified. In recent years, this research area has become increasingly popular in a wide range of applications, such as in surveillance and city patterns learning and recognition, mainly when combined with imaging sensors. However, it still poses challenging problems due to existent noise, complexity of the events, poor microphone(s) quality, bad microphone location(s), or events occurring simultaneously. This research aimed to compare accurate signal processing and classification methods to suggest a novel method for detecting sound events from sound background in urban scenes. Using wavelet and Mel-frequency cepstral coefficients, the analysis of the effect of classification methods and minimization of the number of train data are some of the advantages of the proposed method. The proposed methods' application to a standard sounds database led to an accuracy of about 99% in event detection.
引用
收藏
页码:402 / 416
页数:15
相关论文
共 50 条
  • [1] Ecological Sound Events Classification Based on Time-Frequency Features
    Ming, Li
    PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2016, : 345 - 348
  • [2] Environmental Sound Recognition With Time-Frequency Audio Features
    Chu, Selina
    Narayanan, Shrikanth
    Kuo, C. -C. Jay
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (06): : 1142 - 1158
  • [3] Background Sound Classification in Speech Audio Segments
    Singh, Janvijay
    Joshi, Raviraj
    2019 10TH INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2019,
  • [4] A novel spread spectrum audio watermarking scheme based on time-frequency characteristics
    Esmaili, S
    Krishnan, S
    Raahemifar, K
    CCECE 2003: CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, PROCEEDINGS: TOWARD A CARING AND HUMANE TECHNOLOGY, 2003, : 1963 - 1966
  • [5] Time-frequency analysis of heart sound based on HHT
    Zhao, ZD
    Zhao, ZJ
    Chen, YQ
    2005 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS, VOLS 1 AND 2, PROCEEDINGS: VOL 1: COMMUNICATION THEORY AND SYSTEMS, 2005, : 926 - 929
  • [6] Environmental Sound Classification based on Time-frequency Representation
    Thwe, Khine Zar
    War, Nu
    2017 18TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNDP 2017), 2017, : 251 - 255
  • [7] Time-frequency multipliers for sound synthesis
    Depalle, Philipe
    Kronland-Martinet, Richard
    Torresani, Bruno
    WAVELETS XII, PTS 1 AND 2, 2007, 6701
  • [8] Time-frequency analysis of the heart sound
    Shengwu Yixue Gongchengxue Zazhi, 1 (41-46):
  • [9] Heart sound cancellation from lung sound recordings using time-frequency filtering
    Pourazad, M. T.
    Moussavi, Z.
    Thomas, G.
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2006, 44 (03) : 216 - 225
  • [10] Heart sound cancellation from lung sound recordings using time-frequency filtering
    M. T. Pourazad
    Z. Moussavi
    G. Thomas
    Medical and Biological Engineering and Computing, 2006, 44 : 216 - 225