A lightweight and rapidly converging transformer based on separable linear self-attention for fault diagnosis

被引:0
作者
Yin, Kexin [1 ]
Chen, Chunjun [1 ]
Shen, Qi [1 ]
Deng, Ji [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Mech Engn, Chengdu 610031, Sichuan, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
fault diagnosis; transformer; lightweight; inductive bias; FUSION;
D O I
10.1088/1361-6501/ad9f89
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Reaching reliable decisions on equipment maintenance is facilitated by the implementation of intelligent fault diagnosis techniques for rotating machineries. Recently, the Transformer model has demonstrated exceptional capabilities in global feature modeling for fault diagnosis tasks, garnering significant attention from the academic community. However, it lacks sufficient prior knowledge regarding rotation invariance, scale, and shift, necessitating pre-training on extensive datasets. In comparison, contemporary convolutional neural networks exhibit greater ease of optimization. This limitation becomes particularly evident when applying the Transformer model in fault diagnosis scenarios with limited data availability. Moreover, the increasing the number of parameters and FLOPs. Pose a challenge to its suitability for mobile services due to the limited computational resources available on edge devices. To mitigate these issues, this paper introduces a novel lightweight Transformer (SepFormer) based on separable linear self-attention (LSA) for fault diagnosis task. The SepFormer performs a novel sequence-level feature embedding to better leverage the inductive bias inherent in the convolutional layers. Furthermore, it integrate a novel separable LSA mechanism into the Transformer architecture, effectively mitigating the computational burden concerns and significantly enhancing the training convergence speed. Extensive experiments are conducted extensively on a bearing fault dataset and gear fault dataset. The experimental results demonstrate that the SepFormer achieves a top-1 accuracy exceeding state-of-the-art approaches by more than 5%, while utilizing the fewest FLOPs. Moreover, the optimizability of SepFormer surpasses that of CNN, ensuring its superior preservation of inductive bias.
引用
收藏
页数:13
相关论文
共 31 条
  • [1] Fault Diagnosis of DAB Converters Based on ResNet With Adaptive Threshold Denoising
    Cai, Fenghuang
    Zhan, Mingsong
    Chai, Qinqin
    Jiang, Jiahui
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [2] A Mask Self-Supervised Learning-Based Transformer for Bearing Fault Diagnosis With Limited Labeled Samples
    Cen, Jian
    Yang, Zhuohong
    Wu, Yinbo
    Hu, Xueliang
    Jiang, Liwei
    Chen, Honghua
    Si, Weiwei
    [J]. IEEE SENSORS JOURNAL, 2023, 23 (10) : 10359 - 10369
  • [3] Hyneter:Hybrid Network Transformer for Multiple Computer Vision Tasks
    Chen, Dong
    Miao, Duoqian
    Zhao, Xuerong
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (06) : 8773 - 8785
  • [4] Intelligent Fault Diagnosis for Rotary Machinery Using Transferable Convolutional Neural Network
    Chen, Zhuyun
    Gryllias, Konstantinos
    Li, Weihua
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (01) : 339 - 349
  • [5] CLFormer: A Lightweight Transformer Based on Convolutional Embedding and Linear Self-Attention With Strong Robustness for Bearing Fault Diagnosis Under Limited Sample Conditions
    Fang, Hairui
    Deng, Jin
    Bai, Yaoxu
    Feng, Bo
    Li, Sheng
    Shao, Siyu
    Chen, Dongsheng
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [6] A survey on Deep Learning based bearing fault diagnosis
    Hoang, Duy-Tang
    Kang, Hee-Jun
    [J]. NEUROCOMPUTING, 2019, 335 : 327 - 335
  • [7] Hu J, 2019, Arxiv, DOI [arXiv:1709.01507, DOI 10.48550/ARXIV.1709.01507]
  • [8] A Time Series Transformer based method for the rotating machinery fault diagnosis q
    Jin, Yuhong
    Hou, Lei
    Chen, Yushu
    [J]. NEUROCOMPUTING, 2022, 494 : 379 - 395
  • [9] Lessmeier C., 2016, PHM SOC EUR C, DOI 10.36001/phme.2016.v3i1.1577
  • [10] Energy-Propagation Graph Neural Networks for Enhanced Out-of-Distribution Fault Analysis in Intelligent Construction Machinery Systems
    Li, Xinming
    Li, Meng
    Gu, Jiawei
    Wang, Yanxue
    Yao, Jiachi
    Feng, Jianbo
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (01): : 531 - 543