A lightweight framework for unsupervised anomalous sound detection based on selective learning of time-frequency domain features

被引:1
|
作者
Wang, Yawei [1 ]
Zhang, Qiaoling [1 ,2 ]
Zhang, Weiwei [3 ]
Zhang, Yi [4 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Comp Sci & Technol, Sch Artificial Intelligence, Hangzhou 310018, Peoples R China
[2] Zhejiang Sci Tech Univ, Key Lab Intelligent Text & Flexible Interconnect Z, Hangzhou 310018, Peoples R China
[3] Dalian Maritime Univ, Informat Sci & Technol Coll, Dalian 116026, Peoples R China
[4] Zhejiang Sci Tech Univ, Sch Informat Sci & Engn, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Anomalous sound detection; Spectrogram frames selection; Frequency-feature selection; Unsupervised deep learning;
D O I
10.1016/j.apacoust.2024.110308
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
For industrial anomalous sound detection (ASD), self-supervised methods have achieved significant detection performance in many cases. Nevertheless, these methods typically rely on the availability of external auxiliary information, and they may not work when such information are not feasible. Unsupervised methods do not leverage auxiliary information, whereas they usually obtained lower detection performance compared to self- supervised ones. Though some unsupervised methods have shown potential performance improvements, they are at the cost of complex implementation or large model sizes. As to the issues, this paper presents an unsupervised ASD method based on spectrogram frames selection (SFS) and AutoEncoder for Frequency-feature Selection (AEFS), called SFS-AEFS. First, SFS is developed based upon the temporal characteristics of machine sounds, which aims to select spectrogram frames (SFs) that contains the primary sound information while discarding the portions that are affected by noises or interferences or do not contain the target sound. Next, AEFS is developed by introducing a Scaling Gate (SG) after AE. For the selected SF features, AEFS aims to selectively enhance the mode learning of partial frequency dimensions and weaken the rest ones. Comparative experiments with the current ASD methods were made on the DCASE 2020 Challenge Task2 dataset. The related results demonstrate that our method achieved the best performance among all relevant unsupervised methods and is comparable to the current SOTA self-supervised methods. Moreover, our method is lightweight with model parameters being only 0.08MB.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Time-frequency Resolution Optimization Features on Spoof Detection
    Li, Zetian
    Wei, Jianguo
    Sun, Qilong
    2021 ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS TECHNOLOGY AND COMPUTER SCIENCE (ACCTCS 2021), 2021, : 10 - 16
  • [42] A Probabilistic Framework for Time-Frequency Detection of Burst Suppression
    Prerau, Michael J.
    Purdon, Patrick L.
    2013 6TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING (NER), 2013, : 609 - 612
  • [43] Auditory detection of sound signals with complex time-frequency characteristics
    ZHANG Shuying
    SUN Yaoqiu
    SUN Yong(Shanghai Acoustics Laboratory
    Chinese Journal of Acoustics, 1998, (03) : 199 - 205
  • [44] A TIME-FREQUENCY DOMAIN REPRESENTATION OF SOUND ENERGY BY USE OF WIGNER DISTRIBUTION
    SUZUKI, H
    KAWAURA, J
    ONO, T
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 81 : S96 - S96
  • [45] SVD-based newborn EEG seizure detection in the time-frequency domain
    Hassanpour, H
    Mesbah, M
    Boashash, B
    MODELLING AND CONTROL IN BIOMEDICAL SYSTEMS 2003 (INCLUDING BIOLOGICAL SYSTEMS), 2003, : 329 - 333
  • [46] Epilepsy Detection using Time-Frequency Domain and Entropy Based EEG Analysis
    Ficici, Cansel
    Telatar, Ziya
    Kocak, Onur
    2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
  • [47] Automatic detection of epileptic seizure events using the time-frequency features and machine learning
    Zeng, Jiale
    Tan, Xiao-dan
    Zhan, Chang'an A.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 69
  • [48] Selective Time-Frequency Reassignment Based on Synchrosqueezing
    Ahrabian, Alireza
    Mandic, Danilo P.
    IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (11) : 2039 - 2043
  • [49] Time-frequency analysis of heart sound based on HHT
    Zhao, ZD
    Zhao, ZJ
    Chen, YQ
    2005 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS, VOLS 1 AND 2, PROCEEDINGS: VOL 1: COMMUNICATION THEORY AND SYSTEMS, 2005, : 926 - 929
  • [50] Environmental Sound Classification based on Time-frequency Representation
    Thwe, Khine Zar
    War, Nu
    2017 18TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNDP 2017), 2017, : 251 - 255