Adaptive Knowledge Transfer by Continual Weighted Updating of Filter Kernels for Few-Shot Fault Diagnosis of Machines

被引:60
作者
Xing, Saibo [1 ]
Lei, Yaguo [1 ]
Yang, Bin [1 ]
Lu, Na [2 ]
机构
[1] Xi An Jiao Tong Univ, Educ Minist Modern Design & Rotor Bearing Syst, Key Lab, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Syst Engn Inst, Xian 710049, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Task analysis; Data models; Knowledge transfer; Fault diagnosis; Kernel; Adaptation models; Training; Continual machine learning (CML); few-shot learning; mechanical fault diagnosis; restricted Boltzmann machine; transfer learning; NEURAL-NETWORK; MODEL;
D O I
10.1109/TIE.2021.3063975
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning (DL) based diagnosis models have to be trained by large quantities of monitoring data of machines. However, in real-case scenarios, machines operate under the normal condition in most of their life time while faults seldom happen. Therefore, though massive data are accessible, most are data of the normal condition while fault data are still extremely limited. In other words, fault diagnosis of real machines is actually a few-shot diagnosis problem. To deal with few-shot diagnosis, this article proposes adaptive knowledge transfer with multiclassifier ensemble (AKTME) under the paradigm of continual machine learning. In AKTME, knowledge learned by DL models is considered to be represented by the learnable filter kernels (FKs). The key of AKTME is a proposed continual weighted updating (CWU) technique of FKs. By CWU, shared FKs are distilled from multiple auxiliary tasks and adaptively transferred to the target task. Then by multiclassifier ensemble, AKTME is able to recognize faults with few fault data accessible. AKTME is applied on two few-shot diagnosis cases. Results verify that AKTME achieves higher diagnosis accuracies than recently proposed methods. Moreover, AKTME tends to improve the diagnosis accuracy as it prelearns on more auxiliary tasks continually.
引用
收藏
页码:1968 / 1976
页数:9
相关论文
共 30 条
  • [1] [Anonymous], 2018, ARXIV180102808
  • [2] [Anonymous], 2012, PROCEEDINGSOFTHE 25T
  • [3] Cho K.H., 2013, P IJCNN, P1
  • [4] Fink M., 2004, P NIPS, P449, DOI DOI 10.5555/2976040.2976097
  • [5] Training restricted Boltzmann machines: An introduction
    Fischer, Asja
    Igel, Christian
    [J]. PATTERN RECOGNITION, 2014, 47 (01) : 25 - 39
  • [6] Data augmentation in fault diagnosis based on the Wasserstein generative adversarial network with gradient penalty
    Gao, Xin
    Deng, Fang
    Yue, Xianghu
    [J]. NEUROCOMPUTING, 2020, 396 (396) : 487 - 494
  • [7] Multitask Convolutional Neural Network With Information Fusion for Bearing Fault Diagnosis and Localization
    Guo, Sheng
    Zhang, Bin
    Yang, Tao
    Lyu, Dongzhen
    Gao, Wei
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (09) : 8005 - 8015
  • [8] An Imbalance Modified Deep Neural Network With Dynamical Incremental Learning for Chemical Fault Diagnosis
    Hu, Zhixin
    Jiang, Peng
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2019, 66 (01) : 540 - 550
  • [9] Deep normalized convolutional neural network for imbalanced fault classification of machinery and its understanding via visualization
    Jia, Feng
    Lei, Yaguo
    Lu, Na
    Xing, Saibo
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2018, 110 : 349 - 367
  • [10] Overcoming catastrophic forgetting in neural networks
    Kirkpatricka, James
    Pascanu, Razvan
    Rabinowitz, Neil
    Veness, Joel
    Desjardins, Guillaume
    Rusu, Andrei A.
    Milan, Kieran
    Quan, John
    Ramalho, Tiago
    Grabska-Barwinska, Agnieszka
    Hassabis, Demis
    Clopath, Claudia
    Kumaran, Dharshan
    Hadsell, Raia
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (13) : 3521 - 3526