CONFIDENCE BASED ACOUSTIC EVENT DETECTION

被引:0
|
作者
Xia, Xianjun [1 ]
Togneri, Roberto [1 ]
Sohel, Ferdous [2 ]
Huang, David [1 ]
机构
[1] Univ Western Australia, Sch Elect Elect & Comp Engn, Nedlands, WA, Australia
[2] Murdoch Univ, Sch Engn & Informat Technol, Murdoch, WA, Australia
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年
关键词
acoustic event detection; multi-label classification; confidence; multi-variable regression;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoustic event detection, the determination of the acoustic event type and the localisation of the event, has been widely applied in many real-world applications. Many works adopt the multi-label classification technique to perform the polyphonic acoustic event detection with a global threshold to detect the active acoustic events. However, the manually labeled boundaries are error-prone and cannot always be accurate, especially when the frame length is too short to be accurately labeled by human annotators. To deal with this, a confidence is assigned to each frame and acoustic event detection is performed using a multi-variable regression approach in this paper. Experimental results on the latest TUT sound event 2017 database of polyphonic events demonstrate the superior performance of the proposed approach compared to the multi-label classification based AED method.
引用
收藏
页码:306 / 310
页数:5
相关论文
共 50 条
  • [21] On Learning Disentangled Representation for Acoustic Event Detection
    Gao, Lijian
    Mao, Qirong
    Dong, Ming
    Jing, Yu
    Chinnam, Ratna
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2006 - 2014
  • [22] Feature analysis and selection for acoustic event detection
    Zhuang, Xiaodan
    Zhou, Xi
    Huang, Thomas S.
    Hasegawa-Johnson, Mark
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 17 - 20
  • [23] Exploiting spectro-temporal locality in deep learning based acoustic event detection
    Miquel Espi
    Masakiyo Fujimoto
    Keisuke Kinoshita
    Tomohiro Nakatani
    EURASIP Journal on Audio, Speech, and Music Processing, 2015
  • [24] Exploiting spectro-temporal locality in deep learning based acoustic event detection
    Espi, Miquel
    Fujimoto, Masakiyo
    Kinoshita, Keisuke
    Nakatani, Tomohiro
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015,
  • [25] Acoustic event detection in meeting-room environments
    Temko, Andrey
    Nadeu, Climent
    PATTERN RECOGNITION LETTERS, 2009, 30 (14) : 1281 - 1288
  • [26] ACOUSTIC EVENT DETECTION FOR MULTIPLE OVERLAPPING SIMILAR SOURCES
    Stowell, Dan
    Clayton, David
    2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [27] Random Regression Forests for Acoustic Event Detection and Classification
    Huy Phan
    Maass, Marco
    Mazur, Radoslaw
    Mertins, Alfred
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (01) : 20 - 31
  • [28] EAR-TUKE: The Acoustic Event Detection System
    Lojka, Martin
    Pleva, Matus
    Kiktova, Eva
    Juhar, Jozef
    Cizmar, Anton
    MULTIMEDIA COMMUNICATIONS, SERVICES AND SECURITY, MCSS 2014, 2014, 429 : 137 - 148
  • [29] Temporal attentive pooling for acoustic event detection ocr
    Lu, Xugang
    Shen, Peng
    Li, Sheng
    Tsao, Yu
    Kawai, Hisashi
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1354 - 1357
  • [30] A DATABASE AND CHALLENGE FOR ACOUSTIC SCENE CLASSIFICATION AND EVENT DETECTION
    Giannoulis, Dimitrios
    Stowell, Dan
    Benetos, Emmanouil
    Rossignol, Mathias
    Lagrange, Mathieu
    Plumbley, Mark D.
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,