CROWDSOURCING STRONG LABELS FOR SOUND EVENT DETECTION

被引:2
|
作者
Martin-Morato, Irene [1 ]
Harju, Manu [1 ]
Mesaros, Annamaria [1 ]
机构
[1] Tampere Univ, Comp Sci, Korkeakoulunkatu 7, Tampere 33720, Finland
来源
2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) | 2021年
基金
芬兰科学院;
关键词
Strong labels; Sound event detection; Crowd-sourcing; Multi-annotator data;
D O I
10.1109/WASPAA52581.2021.9632761
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Strong labels are a necessity for evaluation of sound event detection methods, but often scarcely available due to the high resources required by the annotation task. We present a method for estimating strong labels using crowdsourced weak labels, through a process that divides the annotation task into simple unit tasks. Based on estimations of annotators' competence, aggregation and processing of the weak labels results in a set of objective strong labels. The experiment uses synthetic audio in order to verify the quality of the resulting annotations through comparison with ground truth. The proposed method produces labels with high precision, though not all event instances are recalled. Detection metrics comparing the produced annotations with the ground truth show 80% F-score in 1 s segments, and up to 89.5% intersection-based F1-score calculated according to the polyphonic sound detection score metrics.
引用
收藏
页码:246 / 250
页数:5
相关论文
共 50 条
  • [1] Strong Labeling of Sound Events Using Crowdsourced Weak Labels and Annotator Competence Estimation
    Martin-Morato, Irene
    Mesaros, Annamaria
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 902 - 914
  • [2] SOUND EVENT DETECTION BY MULTITASK LEARNING OF SOUND EVENTS AND SCENES WITH SOFT SCENE LABELS
    Imoto, Keisuke
    Tonami, Noriyuki
    Koizumi, Yuma
    Yasuda, Masahiro
    Yamanishi, Ryosuke
    Yamashita, Yoichi
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 621 - 625
  • [3] Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions
    Khandelwal, Tanmay
    Das, Rohan Kumar
    Koh, Andrew
    Chng, Eng Siong
    2023 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP, SSP, 2023, : 329 - 333
  • [4] Dynamic Thresholding on FixMatch with Weak and Strong Data Augmentations for Sound Event Detection
    Khandelwal, Tanmay
    Das, Rohan Kumar
    2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 428 - 432
  • [5] Regional Traffic Event Detection Using Data Crowdsourcing
    Kim, Yuna
    Song, Sangho
    Lee, Hyeonbyeong
    Choi, Dojin
    Lim, Jongtae
    Bok, Kyoungsoo
    Yoo, Jaesoo
    APPLIED SCIENCES-BASEL, 2023, 13 (16):
  • [6] Event Specific Attention for Polyphonic Sound Event Detection
    Sundar, Harshavardhan
    Sun, Ming
    Wang, Chao
    INTERSPEECH 2021, 2021, : 566 - 570
  • [7] Active Learning for Sound Event Detection
    Shuyang Zhao
    Heittola, Toni
    Virtanen, Tuomas
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2895 - 2905
  • [8] Detecting Sound Events Using Convolutional Macaron Net With Pseudo Strong Labels
    Chan, Teck Kai
    Chin, Cheng Siong
    IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [9] Environmental Sound Classification for Flood Event Detection
    Basnyat, Bipendra
    Roy, Nirmalya
    Gangopadhyay, Aryya
    Raglin, Adrienne
    2022 18TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS (IE), 2022,
  • [10] INCREMENTAL LEARNING ALGORITHM FOR SOUND EVENT DETECTION
    Koh, Eunjeong
    Saki, Fatemeh
    Guo, Yinyi
    Hung, Cheng-Yu
    Visser, Erik
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,