Self-supervised learning representation for abnormal acoustic event detection based on attentional contrastive learning

被引:1
|
作者
Wei, Juan [1 ]
Zhang, Qian [1 ]
Ning, Weichen [2 ]
机构
[1] Xidian Univ, Sch Commun Engn, Xian 710071, Peoples R China
[2] Hong Kong Polytech Univ, Fac Engn, Dept Comp, HongKong 100872, Peoples R China
关键词
Contrastive learning; Self -supervised learning; Attention mechanism; Abnormal acoustic event detection; FUSION;
D O I
10.1016/j.dsp.2023.104199
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most abnormal acoustic event detection (AAED) is completed by supervised training of deep learning methods, but manually labeled samples are costly and scarce. This work proposes a self-supervised learning representation for AAED based on contrastive learning to overcome the abovementioned problem. Auditory and visual data augmentations are applied simultaneously to create positive sample pairs. An attention mechanism is introduced into the encoder during self-supervised pre-training. A comparison between fused features by discriminant correlation analysis and a single feature is made to verify the ability of feature grasping for the self-supervised pre-trained model. The pre-training is completed on an abnormal acoustic dataset with noise. Research results show that the self-supervised pre-trained model can achieve an accuracy of 87.72% in linear evaluation and 88.70% in the downstream task with a pure small AAED dataset, which directly exceeds the results of supervised learning. This work releases the stress of the demand for abnormal acoustic event labels.(c) 2023 Published by Elsevier Inc.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Partial contrastive point cloud self-supervised representation learning
    Zijun Cheng
    Yiguo Wang
    Scientific Reports, 15 (1)
  • [22] Motion Sensitive Contrastive Learning for Self-supervised Video Representation
    Ni, Jingcheng
    Zhou, Nan
    Qin, Jie
    Wu, Qian
    Liu, Junqi
    Li, Boxun
    Huang, Di
    COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 457 - 474
  • [23] Self-supervised Segment Contrastive Learning for Medical Document Representation
    Abro, Waheed Ahmed
    Kteich, Hanane
    Bouraoui, Zied
    ARTIFICIAL INTELLIGENCE IN MEDICINE, PT I, AIME 2024, 2024, 14844 : 312 - 321
  • [24] FEDERATED SELF-SUPERVISED LEARNING FOR ACOUSTIC EVENT CLASSIFICATION
    Feng, Meng
    Kao, Chieh-Chi
    Tang, Qingming
    Sun, Ming
    Rozgic, Viktor
    Matsoukas, Spyros
    Wang, Chao
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 481 - 485
  • [25] Image classification framework based on contrastive self-supervised learning
    Zhao H.-W.
    Zhang J.-R.
    Zhu J.-P.
    Li H.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (08): : 1850 - 1856
  • [26] DimCL: Dimensional Contrastive Learning for Improving Self-Supervised Learning
    Nguyen, Thanh
    Pham, Trung Xuan
    Zhang, Chaoning
    Luu, Tung M.
    Vu, Thang
    Yoo, Chang D.
    IEEE ACCESS, 2023, 11 : 21534 - 21545
  • [27] SELF-SUPERVISED CONTRASTIVE LEARNING FOR CROSS-DOMAIN HYPERSPECTRAL IMAGE REPRESENTATION
    Lee, Hyungtae
    Kwon, Heesung
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3239 - 3243
  • [28] TimeCLR: A self-supervised contrastive learning framework for univariate time series representation
    Yang, Xinyu
    Zhang, Zhenguo
    Cui, Rongyi
    KNOWLEDGE-BASED SYSTEMS, 2022, 245
  • [29] Generative Variational-Contrastive Learning for Self-Supervised Point Cloud Representation
    Wang, Bohua
    Tian, Zhiqiang
    Ye, Aixue
    Wen, Feng
    Du, Shaoyi
    Gao, Yue
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6154 - 6166
  • [30] Attentive spatial-temporal contrastive learning for self-supervised video representation
    Yang, Xingming
    Xiong, Sixuan
    Wu, Kewei
    Shan, Dongfeng
    Xie, Zhao
    IMAGE AND VISION COMPUTING, 2023, 137