CONFIDENCE BASED ACOUSTIC EVENT DETECTION

被引：0

作者：

Xia, Xianjun ^{[1
]}

Togneri, Roberto ^{[1
]}

Sohel, Ferdous ^{[2
]}

Huang, David ^{[1
]}

机构：

[1] Univ Western Australia, Sch Elect Elect & Comp Engn, Nedlands, WA, Australia

[2] Murdoch Univ, Sch Engn & Informat Technol, Murdoch, WA, Australia

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年

关键词：

acoustic event detection; multi-label classification; confidence; multi-variable regression;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Acoustic event detection, the determination of the acoustic event type and the localisation of the event, has been widely applied in many real-world applications. Many works adopt the multi-label classification technique to perform the polyphonic acoustic event detection with a global threshold to detect the active acoustic events. However, the manually labeled boundaries are error-prone and cannot always be accurate, especially when the frame length is too short to be accurately labeled by human annotators. To deal with this, a confidence is assigned to each frame and acoustic event detection is performed using a multi-variable regression approach in this paper. Experimental results on the latest TUT sound event 2017 database of polyphonic events demonstrate the superior performance of the proposed approach compared to the multi-label classification based AED method.

引用

页码：306 / 310

页数：5

共 50 条

[21] On Learning Disentangled Representation for Acoustic Event Detection
Gao, Lijian
Mao, Qirong
Dong, Ming
Jing, Yu
Chinnam, Ratna
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2006 - 2014
[22] Feature analysis and selection for acoustic event detection
Zhuang, Xiaodan
Zhou, Xi
Huang, Thomas S.
Hasegawa-Johnson, Mark
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 17 - 20
[23] Exploiting spectro-temporal locality in deep learning based acoustic event detection
Miquel Espi
Masakiyo Fujimoto
Keisuke Kinoshita
Tomohiro Nakatani
EURASIP Journal on Audio, Speech, and Music Processing, 2015
[24] Exploiting spectro-temporal locality in deep learning based acoustic event detection
Espi, Miquel
Fujimoto, Masakiyo
Kinoshita, Keisuke
Nakatani, Tomohiro
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015,
[25] Acoustic event detection in meeting-room environments
Temko, Andrey
Nadeu, Climent
PATTERN RECOGNITION LETTERS, 2009, 30 (14) : 1281 - 1288
[26] ACOUSTIC EVENT DETECTION FOR MULTIPLE OVERLAPPING SIMILAR SOURCES
Stowell, Dan
Clayton, David
2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
[27] Random Regression Forests for Acoustic Event Detection and Classification
Huy Phan
Maass, Marco
Mazur, Radoslaw
Mertins, Alfred
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (01) : 20 - 31
[28] EAR-TUKE: The Acoustic Event Detection System
Lojka, Martin
Pleva, Matus
Kiktova, Eva
Juhar, Jozef
Cizmar, Anton
MULTIMEDIA COMMUNICATIONS, SERVICES AND SECURITY, MCSS 2014, 2014, 429 : 137 - 148
[29] Temporal attentive pooling for acoustic event detection ocr
Lu, Xugang
Shen, Peng
Li, Sheng
Tsao, Yu
Kawai, Hisashi
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1354 - 1357
[30] A DATABASE AND CHALLENGE FOR ACOUSTIC SCENE CLASSIFICATION AND EVENT DETECTION
Giannoulis, Dimitrios
Stowell, Dan
Benetos, Emmanouil
Rossignol, Mathias
Lagrange, Mathieu
Plumbley, Mark D.
2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,

← 1 2 3 4 5 →