Self-supervised learning representation for abnormal acoustic event detection based on attentional contrastive learning

被引：1

作者：

Wei, Juan ^{[1
]}

Zhang, Qian ^{[1
]}

Ning, Weichen ^{[2
]}

机构：

[1] Xidian Univ, Sch Commun Engn, Xian 710071, Peoples R China

[2] Hong Kong Polytech Univ, Fac Engn, Dept Comp, HongKong 100872, Peoples R China

来源：

DIGITAL SIGNAL PROCESSING | 2023年 / 142卷

关键词：

Contrastive learning; Self -supervised learning; Attention mechanism; Abnormal acoustic event detection; FUSION;

D O I：

10.1016/j.dsp.2023.104199

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Most abnormal acoustic event detection (AAED) is completed by supervised training of deep learning methods, but manually labeled samples are costly and scarce. This work proposes a self-supervised learning representation for AAED based on contrastive learning to overcome the abovementioned problem. Auditory and visual data augmentations are applied simultaneously to create positive sample pairs. An attention mechanism is introduced into the encoder during self-supervised pre-training. A comparison between fused features by discriminant correlation analysis and a single feature is made to verify the ability of feature grasping for the self-supervised pre-trained model. The pre-training is completed on an abnormal acoustic dataset with noise. Research results show that the self-supervised pre-trained model can achieve an accuracy of 87.72% in linear evaluation and 88.70% in the downstream task with a pure small AAED dataset, which directly exceeds the results of supervised learning. This work releases the stress of the demand for abnormal acoustic event labels.(c) 2023 Published by Elsevier Inc.

引用

页数：9

共 50 条

[21] Partial contrastive point cloud self-supervised representation learning
Zijun Cheng
Yiguo Wang
Scientific Reports, 15 (1)
[22] Motion Sensitive Contrastive Learning for Self-supervised Video Representation
Ni, Jingcheng
Zhou, Nan
Qin, Jie
Wu, Qian
Liu, Junqi
Li, Boxun
Huang, Di
COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 457 - 474
[23] Self-supervised Segment Contrastive Learning for Medical Document Representation
Abro, Waheed Ahmed
Kteich, Hanane
Bouraoui, Zied
ARTIFICIAL INTELLIGENCE IN MEDICINE, PT I, AIME 2024, 2024, 14844 : 312 - 321
[24] FEDERATED SELF-SUPERVISED LEARNING FOR ACOUSTIC EVENT CLASSIFICATION
Feng, Meng
Kao, Chieh-Chi
Tang, Qingming
Sun, Ming
Rozgic, Viktor
Matsoukas, Spyros
Wang, Chao
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 481 - 485
[25] Image classification framework based on contrastive self-supervised learning
Zhao H.-W.
Zhang J.-R.
Zhu J.-P.
Li H.
Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (08): : 1850 - 1856
[26] DimCL: Dimensional Contrastive Learning for Improving Self-Supervised Learning
Nguyen, Thanh
Pham, Trung Xuan
Zhang, Chaoning
Luu, Tung M.
Vu, Thang
Yoo, Chang D.
IEEE ACCESS, 2023, 11 : 21534 - 21545
[27] SELF-SUPERVISED CONTRASTIVE LEARNING FOR CROSS-DOMAIN HYPERSPECTRAL IMAGE REPRESENTATION
Lee, Hyungtae
Kwon, Heesung
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3239 - 3243
[28] TimeCLR: A self-supervised contrastive learning framework for univariate time series representation
Yang, Xinyu
Zhang, Zhenguo
Cui, Rongyi
KNOWLEDGE-BASED SYSTEMS, 2022, 245
[29] Generative Variational-Contrastive Learning for Self-Supervised Point Cloud Representation
Wang, Bohua
Tian, Zhiqiang
Ye, Aixue
Wen, Feng
Du, Shaoyi
Gao, Yue
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6154 - 6166
[30] Attentive spatial-temporal contrastive learning for self-supervised video representation
Yang, Xingming
Xiong, Sixuan
Wu, Kewei
Shan, Dongfeng
Xie, Zhao
IMAGE AND VISION COMPUTING, 2023, 137

← 1 2 3 4 5 →