Anomalous Sound Detection Using Self-Supervised Classification Deep Hierarchical Reconstruction Network with Symmetric Fusion Attention

被引:0
作者
Wang, Hui [1 ]
Shen, Kuan [1 ]
Wang, Fuquan [1 ]
机构
[1] Chongqing Univ, Coll Optoelect Engn, Chongqing 400044, Peoples R China
关键词
Anomalous sound detection; Generative model; Discriminative model; Attention mechanism; Center loss;
D O I
10.1007/s00034-025-03064-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The main objective of unsupervised anomalous sound detection (ASD) is to identify anomalous sound events among normal sound samples. Existing ASD methods primarily rely on generative and discriminative models. Among them, the autoencoder (AE) based on generative models is widely used for anomaly detection. However, due to the 'shortcut' problem, it often misclassifies abnormal samples as normal. In contrast, discriminative model-based methods, while exhibiting good performance, often suffer from poor stability. This research introduces an architecture named the self-supervised classification deep hierarchical reconstruction network (SCDHR), which combines generative and discriminative model structures. The system uses convolutional kernels of varying sizes across different branches to process input data, aiming to extract more discriminative features. Additionally, a module called symmetric fusion attention (SFA) is introduced. This module enhances the model's ability to process input by integrating attention mechanisms for time, frequency, and coordinate across different branches. As a result, the model's ability to select relevant features is improved. Furthermore, the one class center loss is incorporated and combined with the standard center loss to obtain more compact feature representations, thereby enhancing the model's ability to distinguish anomalous samples. Finally, the proposed method is validated on the DCASE 2023 TASK 2 dataset, achieving a harmonic mean of 65.17% for AUC and PAUC on the Development Dataset and 68.16% on the Evaluation Dataset, outperforming state-of-the-art methods.
引用
收藏
页数:26
相关论文
共 17 条
  • [1] SELF-SUPERVISED LEARNING FOR ANOMALOUS SOUND DETECTION
    Wilkinghoff, Kevin
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 276 - 280
  • [2] ASDNet: An Efficient Self-Supervised Convolutional Network for Anomalous Sound Detection
    Kong, Dewei
    Yuan, Guoshun
    Yu, Hongjiang
    Wang, Shuai
    Zhang, Bo
    APPLIED SCIENCES-BASEL, 2025, 15 (02):
  • [3] SSDPT: Self-supervised dual-path transformer for anomalous sound detection
    Bai, Jisheng
    Chen, Jianfeng
    Wang, Mou
    Ayub, Muhammad Saad
    Yan, Qingli
    DIGITAL SIGNAL PROCESSING, 2023, 135
  • [4] A self-supervised anomalous machine sound detection model based on spectrogram decomposition and parallel sub-network
    Zhang, Tao
    Kong, Lingguo
    Zhao, Xin
    Li, Donglei
    Geng, Yanzhang
    Ding, Biyun
    Wang, Chao
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [5] Targeted self-supervised attention network for coronavirus disease 2019 detection
    Ding, Weixin
    Yu, Wenbo
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)
  • [6] SELF-SUPERVISED REPRESENTATION LEARNING FOR UNSUPERVISED ANOMALOUS SOUND DETECTION UNDER DOMAIN SHIFT
    Chen, Han
    Song, Yan
    Dai, Li-Rong
    McLoughlin, Ian
    Liu, Lin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 471 - 475
  • [7] A deep neural network for the classification of epileptic seizures using hierarchical attention mechanism
    Chirasani, Sateesh Kumar Reddy
    Manikandan, Suchetha
    SOFT COMPUTING, 2022, 26 (11) : 5389 - 5397
  • [8] Self-supervised 3D face reconstruction based on multi-scale feature fusion and dual attention mechanism
    Zhou D.-K.
    Zhang C.
    Yang X.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (10): : 2428 - 2437
  • [9] Contrastive Self-Supervised Two-Domain Residual Attention Network with Random Augmentation Pool for Hyperspectral Change Detection
    Huang, Yixiang
    Zhang, Lifu
    Qi, Wenchao
    Huang, Changping
    Song, Ruoxi
    REMOTE SENSING, 2023, 15 (15)
  • [10] Anomalous Sound Detection Using Self-Attention-Based Frequency Pattern Analysis of Machine Sounds
    Zhang, Hejing
    Guan, Jian
    Zhu, Qiaoxi
    Xiao, Feiyang
    Liu, Youde
    INTERSPEECH 2023, 2023, : 336 - 340