Multi-Task Learning for Acoustic Event Detection Using Event and Frame Position Information

被引:15
|
作者
Xia, Xianjun [1 ]
Togneri, Roberto [1 ]
Sohel, Ferdous [2 ]
Zhao, Yuanjun [1 ]
Huang, Defeng [1 ]
机构
[1] Univ Western Australia, Dept Elect Elect & Comp Engn, Perth, WA 6009, Australia
[2] Murdoch Univ, Coll Sci Hlth Engn & Educ, Perth, WA 6150, Australia
关键词
Acoustics; Task analysis; Neural networks; Event detection; Training; Indexes; Hidden Markov models; Acoustic event detection; multi-label classification; joint learning; multi-task; CLASSIFICATION; SCENES;
D O I
10.1109/TMM.2019.2933330
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Acoustic event detection deals with the acoustic signals to determine the sound type and to estimate the audio event boundaries. Multi-label classification based approaches are commonly used to detect the frame wise event types with a median filter applied to determine the happening acoustic events. However, the multi-label classifiers are trained only on the acoustic event types ignoring the frame position within the audio events. To deal with this, this paper proposes to construct a joint learning based multi-task system. The first task performs the acoustic event type detection and the second task is to predict the frame position information. By sharing representations between the two tasks, we can enable the acoustic models to generalize better than the original classifier by averaging respective noise patterns to be implicitly regularized. Experimental results on the monophonic UPC-TALP and the polyphonic TUT Sound Event datasets demonstrate the superior performance of the joint learning method by achieving lower error rate and higher F-score compared to the baseline AED system.
引用
收藏
页码:569 / 578
页数:10
相关论文
共 50 条
  • [21] Acoustic Event Detection with Classifier Chains
    Komatsu, Tatsuya
    Watanabe, Shinji
    Miyazaki, Koichi
    Hayashi, Tomoki
    INTERSPEECH 2021, 2021, : 601 - 605
  • [22] A Unified Multi-Task Learning Architecture for Fast and Accurate Pedestrian Detection
    Zhou, Chengju
    Wu, Meiqing
    Lam, Siew-Kei
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) : 982 - 996
  • [23] CONFIDENCE BASED ACOUSTIC EVENT DETECTION
    Xia, Xianjun
    Togneri, Roberto
    Sohel, Ferdous
    Huang, David
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 306 - 310
  • [24] A brief review on multi-task learning
    Thung, Kim-Han
    Wee, Chong-Yaw
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (22) : 29705 - 29725
  • [25] Multi-task Learning for Stance and Early Rumor Detection
    Chen, Yongheng
    Yin, Chunyan
    Zuo, Wanli
    OPTICAL MEMORY AND NEURAL NETWORKS, 2021, 30 (02) : 131 - 139
  • [26] Multi-task learning for object keypoints detection and classification
    Xu, Jie
    Zhao, Lin
    Zhang, Shanshan
    Gong, Chen
    Yang, Jian
    PATTERN RECOGNITION LETTERS, 2020, 130 : 182 - 188
  • [27] Multi-task Learning for Stance and Early Rumor Detection
    Chunyan Yongheng Chen
    Wanli Yin
    Optical Memory and Neural Networks, 2021, 30 : 131 - 139
  • [28] Multi-task Learning for Transit Service Disruption Detection
    Ji, Taoran
    Fu, Kaiqun
    Self, Nathan
    Lu, Chang-Tien
    Ramakrishnan, Naren
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2018, : 634 - 641
  • [29] Hateful Memes Detection Based on Multi-Task Learning
    Ma, Zhiyu
    Yao, Shaowen
    Wu, Liwen
    Gao, Song
    Zhang, Yunqi
    MATHEMATICS, 2022, 10 (23)
  • [30] Multi-Task Learning for Recommendation Over Heterogeneous Information Network
    Li, Hui
    Wang, Yanlin
    Lyu, Ziyu
    Shi, Jieming
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (02) : 789 - 802