CROWDSOURCING STRONG LABELS FOR SOUND EVENT DETECTION

被引：2

作者：

Martin-Morato, Irene ^{[1
]}

Harju, Manu ^{[1
]}

Mesaros, Annamaria ^{[1
]}

机构：

[1] Tampere Univ, Comp Sci, Korkeakoulunkatu 7, Tampere 33720, Finland

来源：

2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) | 2021年

基金：

芬兰科学院;

关键词：

Strong labels; Sound event detection; Crowd-sourcing; Multi-annotator data;

D O I：

10.1109/WASPAA52581.2021.9632761

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Strong labels are a necessity for evaluation of sound event detection methods, but often scarcely available due to the high resources required by the annotation task. We present a method for estimating strong labels using crowdsourced weak labels, through a process that divides the annotation task into simple unit tasks. Based on estimations of annotators' competence, aggregation and processing of the weak labels results in a set of objective strong labels. The experiment uses synthetic audio in order to verify the quality of the resulting annotations through comparison with ground truth. The proposed method produces labels with high precision, though not all event instances are recalled. Detection metrics comparing the produced annotations with the ground truth show 80% F-score in 1 s segments, and up to 89.5% intersection-based F1-score calculated according to the polyphonic sound detection score metrics.

引用

页码：246 / 250

页数：5

共 50 条

[31] HYPERNETWORKS FOR SOUND EVENT DETECTION: A PROOF-OF-CONCEPT
Singh, Shubhr
Huy Phan
Benetos, Emmanouil
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 429 - 433
[32] Human–machine collaboration based sound event detection
Shengtong Ge
Zhiwen Yu
Fan Yang
Jiaqi Liu
Liang Wang
CCF Transactions on Pervasive Computing and Interaction, 2022, 4 : 158 - 171
[33] FRAMEWORK FOR EVALUATION OF SOUND EVENT DETECTION IN WEB VIDEOS
Badlani, Rohan
Shah, Ankit
Elizalde, Benjamin
Kumar, Anurag
Raj, Bhiksha
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 3096 - 3100
[34] AFFINITY MIXUP FOR WEAKLY SUPERVISED SOUND EVENT DETECTION
Izadi, Mohammad Rasool
Stevenson, Robert
Kloepper, Laura
2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
[35] THRESHOLD INDEPENDENT EVALUATION OF SOUND EVENT DETECTION SCORES
Ebbers, Janek
Haeb-Umbach, Reinhold
Serizel, Romain
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1021 - 1025
[36] Frequency-Aware Convolution for Sound Event Detection
Song, Tao
Zhang, Wenwen
MULTIMEDIA MODELING, MMM 2025, PT I, 2025, 15520 : 415 - 426
[37] DUAL KNOWLEDGE DISTILLATION FOR EFFICIENT SOUND EVENT DETECTION
Xiao, Yang
Das, Rohan Kumar
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 690 - 694
[38] Investigating Crowdsourcing as a Method to Collect Emotion Labels for Images
Korovina, Olga
Baez, Marcos
Casati, Fabio
Berestneva, Olga
Nielek, Radoslaw
CHI 2018: EXTENDED ABSTRACTS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2018,
[39] A formalized framework for incorporating expert labels in crowdsourcing environment
Hu, Qingyang
He, Qinming
Huang, Hao
Chiew, Kevin
Liu, Zhenguang
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2016, 47 (03) : 403 - 425
[40] A formalized framework for incorporating expert labels in crowdsourcing environment
Qingyang Hu
Qinming He
Hao Huang
Kevin Chiew
Zhenguang Liu
Journal of Intelligent Information Systems, 2016, 47 : 403 - 425

← 1 2 3 4 5 →