CROWDSOURCING STRONG LABELS FOR SOUND EVENT DETECTION

被引：2

作者：

Martin-Morato, Irene ^{[1
]}

Harju, Manu ^{[1
]}

Mesaros, Annamaria ^{[1
]}

机构：

[1] Tampere Univ, Comp Sci, Korkeakoulunkatu 7, Tampere 33720, Finland

来源：

2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) | 2021年

基金：

芬兰科学院;

关键词：

Strong labels; Sound event detection; Crowd-sourcing; Multi-annotator data;

D O I：

10.1109/WASPAA52581.2021.9632761

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Strong labels are a necessity for evaluation of sound event detection methods, but often scarcely available due to the high resources required by the annotation task. We present a method for estimating strong labels using crowdsourced weak labels, through a process that divides the annotation task into simple unit tasks. Based on estimations of annotators' competence, aggregation and processing of the weak labels results in a set of objective strong labels. The experiment uses synthetic audio in order to verify the quality of the resulting annotations through comparison with ground truth. The proposed method produces labels with high precision, though not all event instances are recalled. Detection metrics comparing the produced annotations with the ground truth show 80% F-score in 1 s segments, and up to 89.5% intersection-based F1-score calculated according to the polyphonic sound detection score metrics.

引用

页码：246 / 250

页数：5

共 50 条

[1] Strong Labeling of Sound Events Using Crowdsourced Weak Labels and Annotator Competence Estimation
Martin-Morato, Irene
Mesaros, Annamaria
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 902 - 914
[2] SOUND EVENT DETECTION BY MULTITASK LEARNING OF SOUND EVENTS AND SCENES WITH SOFT SCENE LABELS
Imoto, Keisuke
Tonami, Noriyuki
Koizumi, Yuma
Yasuda, Masahiro
Yamanishi, Ryosuke
Yamashita, Yoichi
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 621 - 625
[3] Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions
Khandelwal, Tanmay
Das, Rohan Kumar
Koh, Andrew
Chng, Eng Siong
2023 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP, SSP, 2023, : 329 - 333
[4] Dynamic Thresholding on FixMatch with Weak and Strong Data Augmentations for Sound Event Detection
Khandelwal, Tanmay
Das, Rohan Kumar
2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 428 - 432
[5] Regional Traffic Event Detection Using Data Crowdsourcing
Kim, Yuna
Song, Sangho
Lee, Hyeonbyeong
Choi, Dojin
Lim, Jongtae
Bok, Kyoungsoo
Yoo, Jaesoo
APPLIED SCIENCES-BASEL, 2023, 13 (16):
[6] Event Specific Attention for Polyphonic Sound Event Detection
Sundar, Harshavardhan
Sun, Ming
Wang, Chao
INTERSPEECH 2021, 2021, : 566 - 570
[7] Active Learning for Sound Event Detection
Shuyang Zhao
Heittola, Toni
Virtanen, Tuomas
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2895 - 2905
[8] Detecting Sound Events Using Convolutional Macaron Net With Pseudo Strong Labels
Chan, Teck Kai
Chin, Cheng Siong
IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
[9] Environmental Sound Classification for Flood Event Detection
Basnyat, Bipendra
Roy, Nirmalya
Gangopadhyay, Aryya
Raglin, Adrienne
2022 18TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS (IE), 2022,
[10] INCREMENTAL LEARNING ALGORITHM FOR SOUND EVENT DETECTION
Koh, Eunjeong
Saki, Fatemeh
Guo, Yinyi
Hung, Cheng-Yu
Visser, Erik
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,

← 1 2 3 4 5 →