Frame-wise dynamic threshold based polyphonic acoustic event detection

被引:8
作者
Xia, Xianjun [1 ]
Togneri, Roberto [1 ]
Sohel, Ferdous [2 ]
Huang, David [1 ]
机构
[1] Univ Western Australia, Sch Elect Elect & Comp Engn, Nedlands, WA, Australia
[2] Murdoch Univ, Sch Engn & Informat Technol, Murdoch, WA, Australia
来源
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年
关键词
acoustic event detection; multi-label classification; dynamic threshold; NEURAL-NETWORKS;
D O I
10.21437/Interspeech.2017-746
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acoustic event detection, the determination of the acoustic event type and the localisation of the event, has been widely applied in many real-world applications. Many works adopt multi-label classification techniques to perform the polyphonic acoustic event detection with a global threshold to detect the active acoustic events. However, the global threshold has to be set manually and is highly dependent on the database being tested. To deal with this, we replaced the fixed threshold method with a frame-wise dynamic threshold approach in this paper. Two novel approaches, namely contour and regressor based dynamic threshold approaches are proposed in this work. Experimental results on the popular TUT Acoustic Scenes 2016 database of polyphonic events demonstrated the superior performance of the proposed approaches.
引用
收藏
页码:474 / 478
页数:5
相关论文
共 43 条
  • [11] Feature extraction strategies in deep learning based acoustic event detection
    Espi, Miguel
    Fujimoto, Masakiyo
    Kinoshita, Keisuke
    Nakatani, Tomohiro
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2922 - 2926
  • [12] Global statistical features-based approach for Acoustic Event Detection
    Jayalakshmi, S. L.
    Chandrakala, S.
    Nedunchelian, R.
    [J]. APPLIED ACOUSTICS, 2018, 139 : 113 - 118
  • [13] Prediction of the acoustic comfort of a dwelling based on automatic sound event detection
    Bonet-Sola, Daniel
    Vidana-Vila, Ester
    Alsina-Pages, Rosa Ma
    [J]. NOISE MAPPING, 2023, 10 (01)
  • [14] RANDOM FOREST REGRESSION BASED ACOUSTIC EVENT DETECTION WITH BOTTLENECK FEATURES
    Xia, Xianjun
    Togneri, Roberto
    Sohel, Ferdous
    Huang, David
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 157 - 162
  • [15] SPARSE REPRESENTATION BASED ON A BAG OF SPECTRAL EXEMPLARS FOR ACOUSTIC EVENT DETECTION
    Lu, Xugang
    Tsao, Yu
    Matsuda, Shigeki
    Hori, Chiori
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [16] Dynamic Threshold Based Spectrum Detection in Cognitive Radio Systems
    Yu, Gui-cai
    Shao, Yu-bin
    Long, Hua
    Yue, Guang-xin
    [J]. 2009 5TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-8, 2009, : 1447 - +
  • [17] Improvement of shot detection methods based on dynamic threshold selection
    Ardebilian, M
    Tu, XW
    Chen, LM
    [J]. MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS II, 1997, 3229 : 14 - 22
  • [18] A Statistical Threat Detection Method Based on Dynamic Time Threshold
    Tian, Jian-Wei
    Qiao, Hong
    Li, Xi
    Tian, Zhen
    [J]. 2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1087 - 1090
  • [19] SEMI-SUPERVISED ACOUSTIC EVENT DETECTION BASED ON TRI-TRAINING
    Shi, Bowen
    Sun, Ming
    Kao, Chieh-Chi
    Rozgic, Viktor
    Matsoukas, Spyros
    Wang, Chao
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 750 - 754
  • [20] A Survey: Neural Network-Based Deep Learning for Acoustic Event Detection
    Xianjun Xia
    Roberto Togneri
    Ferdous Sohel
    Yuanjun Zhao
    Defeng Huang
    [J]. Circuits, Systems, and Signal Processing, 2019, 38 : 3433 - 3453