A lightweight framework for unsupervised anomalous sound detection based on selective learning of time-frequency domain features

被引:1
|
作者
Wang, Yawei [1 ]
Zhang, Qiaoling [1 ,2 ]
Zhang, Weiwei [3 ]
Zhang, Yi [4 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Comp Sci & Technol, Sch Artificial Intelligence, Hangzhou 310018, Peoples R China
[2] Zhejiang Sci Tech Univ, Key Lab Intelligent Text & Flexible Interconnect Z, Hangzhou 310018, Peoples R China
[3] Dalian Maritime Univ, Informat Sci & Technol Coll, Dalian 116026, Peoples R China
[4] Zhejiang Sci Tech Univ, Sch Informat Sci & Engn, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Anomalous sound detection; Spectrogram frames selection; Frequency-feature selection; Unsupervised deep learning;
D O I
10.1016/j.apacoust.2024.110308
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
For industrial anomalous sound detection (ASD), self-supervised methods have achieved significant detection performance in many cases. Nevertheless, these methods typically rely on the availability of external auxiliary information, and they may not work when such information are not feasible. Unsupervised methods do not leverage auxiliary information, whereas they usually obtained lower detection performance compared to self- supervised ones. Though some unsupervised methods have shown potential performance improvements, they are at the cost of complex implementation or large model sizes. As to the issues, this paper presents an unsupervised ASD method based on spectrogram frames selection (SFS) and AutoEncoder for Frequency-feature Selection (AEFS), called SFS-AEFS. First, SFS is developed based upon the temporal characteristics of machine sounds, which aims to select spectrogram frames (SFs) that contains the primary sound information while discarding the portions that are affected by noises or interferences or do not contain the target sound. Next, AEFS is developed by introducing a Scaling Gate (SG) after AE. For the selected SF features, AEFS aims to selectively enhance the mode learning of partial frequency dimensions and weaken the rest ones. Comparative experiments with the current ASD methods were made on the DCASE 2020 Challenge Task2 dataset. The related results demonstrate that our method achieved the best performance among all relevant unsupervised methods and is comparable to the current SOTA self-supervised methods. Moreover, our method is lightweight with model parameters being only 0.08MB.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Detection of pulmonary hypertension associated with congenital heart disease based on time-frequency domain and deep learning features
    Ge, Bingbing
    Yang, Hongbo
    Ma, Pengyue
    Guo, Tao
    Pan, Jiahua
    Wang, Weilian
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 81
  • [2] Ecological Sound Events Classification Based on Time-Frequency Features
    Ming, Li
    PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2016, : 345 - 348
  • [3] Heart Sound Classification Based on Nonlinear Time-frequency Features
    See, Aaron Raymond
    Cabili, Inah Salvador
    Chen, Yeou-Jiunn
    SENSORS AND MATERIALS, 2022, 34 (01) : 217 - 223
  • [4] Transfer learning based bridge damage detection: Leveraging time-frequency features
    Talaei, Saeid
    Zhu, Xinqun
    Li, Jianchun
    Yu, Yang
    Chan, Tommy H. T.
    STRUCTURES, 2023, 57
  • [5] Unsupervised Blue Whale Call Detection Using Multiple Time-Frequency Features
    Cuevas, Alejandro
    Veragua, Alejandro
    Espanol-Jimenez, Sonia
    Chiang, Gustavo
    Tobar, Felipe
    2017 CHILEAN CONFERENCE ON ELECTRICAL, ELECTRONICS ENGINEERING, INFORMATION AND COMMUNICATION TECHNOLOGIES (CHILECON), 2017,
  • [6] Detection of pulmonary arterial hypertension associated with congenital heart disease based on time-frequency domain and deep learning features
    Ge, Bingbing
    Yang, Hongbo
    Ma, Pengyue
    Guo, Tao
    Pan, Jiahua
    Wang, Weilian
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 81
  • [7] SELF-SUPERVISED REPRESENTATION LEARNING FOR UNSUPERVISED ANOMALOUS SOUND DETECTION UNDER DOMAIN SHIFT
    Chen, Han
    Song, Yan
    Dai, Li-Rong
    McLoughlin, Ian
    Liu, Lin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 471 - 475
  • [8] Investigation of the effectiveness of time-frequency domain images and acoustic features in urban sound classification
    Ozseven, Turgut
    APPLIED ACOUSTICS, 2023, 211
  • [9] A parallel neural network based structural anomaly detection: Leveraging time-frequency domain features
    He, Yingying
    Yang, Bo
    Jin, Weihong
    Zhang, Likai
    Chen, Hongyang
    NEUROCOMPUTING, 2025, 634
  • [10] A study of time-frequency features for CNN-based automatic heart sound classification for pathology detection
    Bozkurt, Baris
    Germanakis, Ioannis
    Stylianou, Yannis
    COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 100 : 132 - 143