A lightweight framework for unsupervised anomalous sound detection based on selective learning of time-frequency domain features

被引:1
|
作者
Wang, Yawei [1 ]
Zhang, Qiaoling [1 ,2 ]
Zhang, Weiwei [3 ]
Zhang, Yi [4 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Comp Sci & Technol, Sch Artificial Intelligence, Hangzhou 310018, Peoples R China
[2] Zhejiang Sci Tech Univ, Key Lab Intelligent Text & Flexible Interconnect Z, Hangzhou 310018, Peoples R China
[3] Dalian Maritime Univ, Informat Sci & Technol Coll, Dalian 116026, Peoples R China
[4] Zhejiang Sci Tech Univ, Sch Informat Sci & Engn, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Anomalous sound detection; Spectrogram frames selection; Frequency-feature selection; Unsupervised deep learning;
D O I
10.1016/j.apacoust.2024.110308
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
For industrial anomalous sound detection (ASD), self-supervised methods have achieved significant detection performance in many cases. Nevertheless, these methods typically rely on the availability of external auxiliary information, and they may not work when such information are not feasible. Unsupervised methods do not leverage auxiliary information, whereas they usually obtained lower detection performance compared to self- supervised ones. Though some unsupervised methods have shown potential performance improvements, they are at the cost of complex implementation or large model sizes. As to the issues, this paper presents an unsupervised ASD method based on spectrogram frames selection (SFS) and AutoEncoder for Frequency-feature Selection (AEFS), called SFS-AEFS. First, SFS is developed based upon the temporal characteristics of machine sounds, which aims to select spectrogram frames (SFs) that contains the primary sound information while discarding the portions that are affected by noises or interferences or do not contain the target sound. Next, AEFS is developed by introducing a Scaling Gate (SG) after AE. For the selected SF features, AEFS aims to selectively enhance the mode learning of partial frequency dimensions and weaken the rest ones. Comparative experiments with the current ASD methods were made on the DCASE 2020 Challenge Task2 dataset. The related results demonstrate that our method achieved the best performance among all relevant unsupervised methods and is comparable to the current SOTA self-supervised methods. Moreover, our method is lightweight with model parameters being only 0.08MB.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] A multi-source unsupervised domain adaptive bearing fault diagnosis method integrating time-frequency features
    Jin, Huaiping
    Liu, Zhiyong
    Wang, Bin
    Qian, Bin
    Liu, Haipeng
    Zhendong yu Chongji/Journal of Vibration and Shock, 2024, 43 (13): : 12 - 24
  • [32] Extreme Learning Machine for Heartbeat Classification with Hybrid Time-Domain and Wavelet Time-Frequency Features
    Xu, Yuefan
    Zhang, Sen
    Cao, Zhengtao
    Chen, Qinqin
    Xiao, Wendong
    JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [33] A DUAL-PATH FRAMEWORK WITH FREQUENCY-AND-TIME EXCITED NETWORK FOR ANOMALOUS SOUND DETECTION
    Zhang, Yucong
    Liu, Juan
    Tian, Yao
    Liu, Haifeng
    Li, Ming
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 1266 - 1270
  • [34] Electroencephalogram Based Biometric Framework Using Time and Frequency Domain Features
    Mishra, Puneet
    Singla, Sunil Kumar
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2014, 4 (04) : 593 - 599
  • [35] Machine Learning-Based Detection of Sleep-Disordered Breathing Type Using Time and Time-Frequency Features
    Balci, Mehmet
    Tasdemir, Sakir
    Ozmen, Guzin
    Golcuk, Adem
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 73
  • [36] Detection of daily physical activities in the time-frequency domain
    Nyan, M. N.
    Tay, Francis E. H.
    Seah, K. H. W.
    Ismail, N. H.
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2006, 6 (04) : 429 - 446
  • [37] A study on pitch detection in time-frequency hybrid domain
    Jo, W
    Kim, J
    Bae, MJ
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 437 - 440
  • [38] Anomalous Sound Detection Framework Based on Masking Strategy
    Li, Xiang
    Zhao, Caidan
    Gao, Chenxing
    Hu, Wenxin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14880 : 287 - 298
  • [39] Multitask Learning of Time-Frequency CNN for Sound Source Localization
    Pang, Cheng
    Liu, Hong
    Li, Xiaofei
    IEEE ACCESS, 2019, 7 : 40725 - 40737
  • [40] Underdetermined blind separation of delayed sound source in the time-frequency domain
    Xia, X. (xiaxxy@163.com), 1600, Sichuan University (46):