Mutual information-based feature disentangled network for anomaly detection under variable working conditions

被引:17
作者
Hu, Chenye [1 ,2 ]
Wu, Jingyao [1 ,2 ]
Sun, Chuang [1 ,2 ]
Chen, Xuefeng [1 ,2 ]
Yan, Ruqiang [1 ,2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Mech Engn, Xian 710049, Shaanxi, Peoples R China
[2] Xi An Jiao Tong Univ, State Key Lab Mfg Syst Engn, Xian 710049, Shaanxi, Peoples R China
关键词
Anomaly detection; Deep learning; Mutual information; Feature disentanglement; Contrastive learning; Variable working conditions;
D O I
10.1016/j.ymssp.2023.110804
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Anomaly detection is of great significance to ensure operational safety of advanced equipment. However, existing anomaly detection algorithms presume that the working conditions are invariable, which cannot adapt to actual industrial scenarios and may cause false or missing alarms when the working conditions change. To address this issue, we expect to learn a condition-independent feature space, from which anomaly indicators can be derived to enable conditionagnostic anomaly detection. The objective to find such space is then deduced intuitively and justified by information theory. Correspondingly, a Mutual Information-based Feature Disentangled Network (MIFD-Net) is proposed for anomaly detection under variable working conditions. Specifically, to ensure feature independence from working conditions, the mutual information between latent representations and conditions is minimized by reducing its variational upper bound. Meanwhile, a self-supervised contrastive loss is deduced to maximize the mutual information between latent features and observed signals, which improves the fault discriminability of latent features. Finally, both feature constraints are integrated via dynamic weights in model training. Reconstruction errors derived from latent features are defined as anomaly scores for condition-agnostic anomaly detection. Experiments on part-level and component-level fault datasets demonstrate the superiority of the proposed method under both multiple and varying working condition scenarios.
引用
收藏
页数:19
相关论文
共 37 条
  • [1] Barber D, 2004, ADV NEUR IN, V16, P201
  • [2] Belghazi MI, 2018, PR MACH LEARN RES, V80
  • [3] Anomaly detection and critical SCADA parameters identification for wind turbines based on LSTM-AE neural network
    Chen, Hansi
    Liu, Hang
    Chu, Xuening
    Liu, Qingxiu
    Xue, Deyi
    [J]. RENEWABLE ENERGY, 2021, 172 : 829 - 840
  • [4] Chen T, 2020, PR MACH LEARN RES, V119
  • [5] Cheng PY, 2020, PR MACH LEARN RES, V119
  • [6] High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning
    Erfani, Sarah M.
    Rajasegarar, Sutharshan
    Karunasekera, Shanika
    Leckie, Christopher
    [J]. PATTERN RECOGNITION, 2016, 58 : 121 - 134
  • [7] On the application of kernelised Bayesian transfer learning to population-based structural health monitoring
    Gardner, P.
    Bull, L. A.
    Dervilis, N.
    Worden, K.
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 167
  • [8] Geiger D, 1990, Machine intelligence and pattern recognition, V10, P139, DOI DOI 10.1016/B978-0-444-88738-2.50018-X
  • [9] Interinstance and Intratemporal Self-Supervised Learning With Few Labeled Data for Fault Diagnosis
    Hu, Chenye
    Wu, Jingyao
    Sun, Chuang
    Yan, Ruqiang
    Chen, Xuefeng
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (05) : 6502 - 6512
  • [10] Beaudry NJ, 2012, Arxiv, DOI arXiv:1107.0740