Multi-Task Learning for Acoustic Event Detection Using Event and Frame Position Information

被引:15
|
作者
Xia, Xianjun [1 ]
Togneri, Roberto [1 ]
Sohel, Ferdous [2 ]
Zhao, Yuanjun [1 ]
Huang, Defeng [1 ]
机构
[1] Univ Western Australia, Dept Elect Elect & Comp Engn, Perth, WA 6009, Australia
[2] Murdoch Univ, Coll Sci Hlth Engn & Educ, Perth, WA 6150, Australia
关键词
Acoustics; Task analysis; Neural networks; Event detection; Training; Indexes; Hidden Markov models; Acoustic event detection; multi-label classification; joint learning; multi-task; CLASSIFICATION; SCENES;
D O I
10.1109/TMM.2019.2933330
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Acoustic event detection deals with the acoustic signals to determine the sound type and to estimate the audio event boundaries. Multi-label classification based approaches are commonly used to detect the frame wise event types with a median filter applied to determine the happening acoustic events. However, the multi-label classifiers are trained only on the acoustic event types ignoring the frame position within the audio events. To deal with this, this paper proposes to construct a joint learning based multi-task system. The first task performs the acoustic event type detection and the second task is to predict the frame position information. By sharing representations between the two tasks, we can enable the acoustic models to generalize better than the original classifier by averaging respective noise patterns to be implicitly regularized. Experimental results on the monophonic UPC-TALP and the polyphonic TUT Sound Event datasets demonstrate the superior performance of the joint learning method by achieving lower error rate and higher F-score compared to the baseline AED system.
引用
收藏
页码:569 / 578
页数:10
相关论文
共 50 条
  • [41] Survey on Multi-Task Learning in Smart Transportation
    Alzahrani, Mohammed
    Wang, Qianlong
    Liao, Weixian
    Chen, Xuhui
    Yu, Wei
    IEEE ACCESS, 2024, 12 : 17023 - 17044
  • [42] A novel learning-based frame pooling method for event detection
    Wang, Lan
    Gao, Chenqiang
    Liu, Jiang
    Meng, Deyu
    SIGNAL PROCESSING, 2017, 140 : 45 - 52
  • [43] Multi-task learning for arousal and sleep stage detection using fully convolutional networks
    Zan, Hasan
    Yildiz, Abdulnasir
    JOURNAL OF NEURAL ENGINEERING, 2023, 20 (05)
  • [44] Detecting frog calling activity based on acoustic event detection and multi-label learning
    Xie, Jie
    Michael, Towsey
    Zhang, Jinglan
    Roe, Paul
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 627 - 638
  • [45] Task Variance Regularized Multi-Task Learning
    Mao, Yuren
    Wang, Zekai
    Liu, Weiwei
    Lin, Xuemin
    Hu, Wenbin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 8615 - 8629
  • [46] Dermoscopic attributes classification using deep learning and multi-task learning
    Saitov, Irek
    Polevaya, Tatyana
    Filchenkov, Andrey
    9TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE, YSC2020, 2020, 178 : 328 - 336
  • [47] Multimodal Sentiment Analysis With Two-Phase Multi-Task Learning
    Yang, Bo
    Wu, Lijun
    Zhu, Jinhua
    Shao, Bo
    Lin, Xiaola
    Liu, Tie-Yan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2015 - 2024
  • [48] Multi-Task Reinforcement Learning With Attention-Based Mixture of Experts
    Cheng, Guangran
    Dong, Lu
    Cai, Wenzhe
    Sun, Changyin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (06) : 3811 - 3818
  • [49] Multimedia Event Detection Using Event-Driven Multiple Instance Learning
    Phan, Sang
    Le, Duy-Dinh
    Satoh, Shin'ichi
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1255 - 1258
  • [50] Multi-Task Deep Learning for Image Segmentation Using Recursive Approximation Tasks
    Ke, Rihuan
    Bugeau, Aurelie
    Papadakis, Nicolas
    Kirkland, Mark
    Schuetz, Peter
    Schonlieb, Carola-Bibiane
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3555 - 3567