CROWDSOURCING STRONG LABELS FOR SOUND EVENT DETECTION

被引:2
|
作者
Martin-Morato, Irene [1 ]
Harju, Manu [1 ]
Mesaros, Annamaria [1 ]
机构
[1] Tampere Univ, Comp Sci, Korkeakoulunkatu 7, Tampere 33720, Finland
来源
2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) | 2021年
基金
芬兰科学院;
关键词
Strong labels; Sound event detection; Crowd-sourcing; Multi-annotator data;
D O I
10.1109/WASPAA52581.2021.9632761
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Strong labels are a necessity for evaluation of sound event detection methods, but often scarcely available due to the high resources required by the annotation task. We present a method for estimating strong labels using crowdsourced weak labels, through a process that divides the annotation task into simple unit tasks. Based on estimations of annotators' competence, aggregation and processing of the weak labels results in a set of objective strong labels. The experiment uses synthetic audio in order to verify the quality of the resulting annotations through comparison with ground truth. The proposed method produces labels with high precision, though not all event instances are recalled. Detection metrics comparing the produced annotations with the ground truth show 80% F-score in 1 s segments, and up to 89.5% intersection-based F1-score calculated according to the polyphonic sound detection score metrics.
引用
收藏
页码:246 / 250
页数:5
相关论文
共 50 条
  • [31] HYPERNETWORKS FOR SOUND EVENT DETECTION: A PROOF-OF-CONCEPT
    Singh, Shubhr
    Huy Phan
    Benetos, Emmanouil
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 429 - 433
  • [32] Human–machine collaboration based sound event detection
    Shengtong Ge
    Zhiwen Yu
    Fan Yang
    Jiaqi Liu
    Liang Wang
    CCF Transactions on Pervasive Computing and Interaction, 2022, 4 : 158 - 171
  • [33] FRAMEWORK FOR EVALUATION OF SOUND EVENT DETECTION IN WEB VIDEOS
    Badlani, Rohan
    Shah, Ankit
    Elizalde, Benjamin
    Kumar, Anurag
    Raj, Bhiksha
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 3096 - 3100
  • [34] AFFINITY MIXUP FOR WEAKLY SUPERVISED SOUND EVENT DETECTION
    Izadi, Mohammad Rasool
    Stevenson, Robert
    Kloepper, Laura
    2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [35] THRESHOLD INDEPENDENT EVALUATION OF SOUND EVENT DETECTION SCORES
    Ebbers, Janek
    Haeb-Umbach, Reinhold
    Serizel, Romain
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1021 - 1025
  • [36] Frequency-Aware Convolution for Sound Event Detection
    Song, Tao
    Zhang, Wenwen
    MULTIMEDIA MODELING, MMM 2025, PT I, 2025, 15520 : 415 - 426
  • [37] DUAL KNOWLEDGE DISTILLATION FOR EFFICIENT SOUND EVENT DETECTION
    Xiao, Yang
    Das, Rohan Kumar
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 690 - 694
  • [38] Investigating Crowdsourcing as a Method to Collect Emotion Labels for Images
    Korovina, Olga
    Baez, Marcos
    Casati, Fabio
    Berestneva, Olga
    Nielek, Radoslaw
    CHI 2018: EXTENDED ABSTRACTS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2018,
  • [39] A formalized framework for incorporating expert labels in crowdsourcing environment
    Hu, Qingyang
    He, Qinming
    Huang, Hao
    Chiew, Kevin
    Liu, Zhenguang
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2016, 47 (03) : 403 - 425
  • [40] A formalized framework for incorporating expert labels in crowdsourcing environment
    Qingyang Hu
    Qinming He
    Hao Huang
    Kevin Chiew
    Zhenguang Liu
    Journal of Intelligent Information Systems, 2016, 47 : 403 - 425