Anomalous Sound Detection Using Self-Supervised Classification Deep Hierarchical Reconstruction Network with Symmetric Fusion Attention

被引：0

作者：

Wang, Hui ^{[1
]}

Shen, Kuan ^{[1
]}

Wang, Fuquan ^{[1
]}

机构：

[1] Chongqing Univ, Coll Optoelect Engn, Chongqing 400044, Peoples R China

来源：

CIRCUITS SYSTEMS AND SIGNAL PROCESSING | 2025年

关键词：

Anomalous sound detection; Generative model; Discriminative model; Attention mechanism; Center loss;

D O I：

10.1007/s00034-025-03064-2

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The main objective of unsupervised anomalous sound detection (ASD) is to identify anomalous sound events among normal sound samples. Existing ASD methods primarily rely on generative and discriminative models. Among them, the autoencoder (AE) based on generative models is widely used for anomaly detection. However, due to the 'shortcut' problem, it often misclassifies abnormal samples as normal. In contrast, discriminative model-based methods, while exhibiting good performance, often suffer from poor stability. This research introduces an architecture named the self-supervised classification deep hierarchical reconstruction network (SCDHR), which combines generative and discriminative model structures. The system uses convolutional kernels of varying sizes across different branches to process input data, aiming to extract more discriminative features. Additionally, a module called symmetric fusion attention (SFA) is introduced. This module enhances the model's ability to process input by integrating attention mechanisms for time, frequency, and coordinate across different branches. As a result, the model's ability to select relevant features is improved. Furthermore, the one class center loss is incorporated and combined with the standard center loss to obtain more compact feature representations, thereby enhancing the model's ability to distinguish anomalous samples. Finally, the proposed method is validated on the DCASE 2023 TASK 2 dataset, achieving a harmonic mean of 65.17% for AUC and PAUC on the Development Dataset and 68.16% on the Evaluation Dataset, outperforming state-of-the-art methods.

引用

页数：26

共 17 条

[1] SELF-SUPERVISED LEARNING FOR ANOMALOUS SOUND DETECTION
Wilkinghoff, Kevin
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 276 - 280
[2] ASDNet: An Efficient Self-Supervised Convolutional Network for Anomalous Sound Detection
Kong, Dewei
Yuan, Guoshun
Yu, Hongjiang
Wang, Shuai
Zhang, Bo
APPLIED SCIENCES-BASEL, 2025, 15 (02):
[3] SSDPT: Self-supervised dual-path transformer for anomalous sound detection
Bai, Jisheng
Chen, Jianfeng
Wang, Mou
Ayub, Muhammad Saad
Yan, Qingli
DIGITAL SIGNAL PROCESSING, 2023, 135
[4] A self-supervised anomalous machine sound detection model based on spectrogram decomposition and parallel sub-network
Zhang, Tao
Kong, Lingguo
Zhao, Xin
Li, Donglei
Geng, Yanzhang
Ding, Biyun
Wang, Chao
APPLIED INTELLIGENCE, 2025, 55 (06)
[5] Targeted self-supervised attention network for coronavirus disease 2019 detection
Ding, Weixin
Yu, Wenbo
JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)
[6] SELF-SUPERVISED REPRESENTATION LEARNING FOR UNSUPERVISED ANOMALOUS SOUND DETECTION UNDER DOMAIN SHIFT
Chen, Han
Song, Yan
Dai, Li-Rong
McLoughlin, Ian
Liu, Lin
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 471 - 475
[7] A deep neural network for the classification of epileptic seizures using hierarchical attention mechanism
Chirasani, Sateesh Kumar Reddy
Manikandan, Suchetha
SOFT COMPUTING, 2022, 26 (11) : 5389 - 5397
[8] Self-supervised 3D face reconstruction based on multi-scale feature fusion and dual attention mechanism
Zhou D.-K.
Zhang C.
Yang X.
Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (10): : 2428 - 2437
[9] Contrastive Self-Supervised Two-Domain Residual Attention Network with Random Augmentation Pool for Hyperspectral Change Detection
Huang, Yixiang
Zhang, Lifu
Qi, Wenchao
Huang, Changping
Song, Ruoxi
REMOTE SENSING, 2023, 15 (15)
[10] Anomalous Sound Detection Using Self-Attention-Based Frequency Pattern Analysis of Machine Sounds
Zhang, Hejing
Guan, Jian
Zhu, Qiaoxi
Xiao, Feiyang
Liu, Youde
INTERSPEECH 2023, 2023, : 336 - 340

← 1 2 →