Multi-Task Learning for Acoustic Event Detection Using Event and Frame Position Information

被引:15
|
作者
Xia, Xianjun [1 ]
Togneri, Roberto [1 ]
Sohel, Ferdous [2 ]
Zhao, Yuanjun [1 ]
Huang, Defeng [1 ]
机构
[1] Univ Western Australia, Dept Elect Elect & Comp Engn, Perth, WA 6009, Australia
[2] Murdoch Univ, Coll Sci Hlth Engn & Educ, Perth, WA 6150, Australia
关键词
Acoustics; Task analysis; Neural networks; Event detection; Training; Indexes; Hidden Markov models; Acoustic event detection; multi-label classification; joint learning; multi-task; CLASSIFICATION; SCENES;
D O I
10.1109/TMM.2019.2933330
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Acoustic event detection deals with the acoustic signals to determine the sound type and to estimate the audio event boundaries. Multi-label classification based approaches are commonly used to detect the frame wise event types with a median filter applied to determine the happening acoustic events. However, the multi-label classifiers are trained only on the acoustic event types ignoring the frame position within the audio events. To deal with this, this paper proposes to construct a joint learning based multi-task system. The first task performs the acoustic event type detection and the second task is to predict the frame position information. By sharing representations between the two tasks, we can enable the acoustic models to generalize better than the original classifier by averaging respective noise patterns to be implicitly regularized. Experimental results on the monophonic UPC-TALP and the polyphonic TUT Sound Event datasets demonstrate the superior performance of the joint learning method by achieving lower error rate and higher F-score compared to the baseline AED system.
引用
收藏
页码:569 / 578
页数:10
相关论文
共 50 条
  • [1] A Scene-Dependent Sound Event Detection Approach Using Multi-Task Learning
    Liang, Han
    Ji, Wanting
    Wang, Ruili
    Ma, Yaxiong
    Chen, Jincai
    Chen, Min
    IEEE SENSORS JOURNAL, 2022, 22 (18) : 17483 - 17489
  • [2] Event Detection via Context Understanding Based on Multi-task Learning
    Xia, Jing
    Li, Xiaolong
    Tan, Yongbin
    Zhang, Wu
    Li, Dajun
    Xiong, Zhengkun
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)
  • [3] A MULTI-TASK LEARNING METHOD FOR WEAKLY SUPERVISED SOUND EVENT DETECTION
    Liu, Sichen
    Yang, Feiran
    Kang, Fang
    Yang, Jun
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8802 - 8806
  • [4] Event-Based Multi-Task Facial Landmark and Blink Detection
    Kielty, Paul
    Ryan, Cian
    Shariff, Waseem
    Lemley, Joe
    Corcoran, Peter
    IEEE ACCESS, 2025, 13 : 45609 - 45622
  • [5] POLYPHONIC SOUND EVENT AND SOUND ACTIVITY DETECTION: A MULTI-TASK APPROACH
    Pankajakshan, Arjun
    Bear, Helen L.
    Benetos, Emmanouil
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 323 - 327
  • [6] Multi-Task Learning for Improved Recognition of Multiple Types of Acoustic Information
    Kim, Jae-Won
    Park, Hochong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (10): : 1762 - 1765
  • [7] Frame-wise dynamic threshold based polyphonic acoustic event detection
    Xia, Xianjun
    Togneri, Roberto
    Sohel, Ferdous
    Huang, David
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 474 - 478
  • [8] A Survey on Multi-Task Learning
    Zhang, Yu
    Yang, Qiang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (12) : 5586 - 5609
  • [9] UNIFYING ISOLATED AND OVERLAPPING AUDIO EVENT DETECTION WITH MULTI-LABEL MULTI-TASK CONVOLUTIONAL RECURRENT NEURAL NETWORKS
    Huy Phan
    Chen, Oliver Y.
    Koch, Philipp
    Pham, Lam
    McLoughlin, Ian
    Mertins, Alfred
    De Vos, Maarten
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 51 - 55
  • [10] Deep Multi-Task Learning for Spatio-Temporal Incomplete Qualitative Event Forecasting
    Chowdhury, Tanmoy
    Gao, Yuyang
    Zhao, Liang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 7913 - 7926