Adaptive Knowledge Transfer by Continual Weighted Updating of Filter Kernels for Few-Shot Fault Diagnosis of Machines

被引：60

作者：

Xing, Saibo ^{[1
]}

Lei, Yaguo ^{[1
]}

Yang, Bin ^{[1
]}

Lu, Na ^{[2
]}

机构：

[1] Xi An Jiao Tong Univ, Educ Minist Modern Design & Rotor Bearing Syst, Key Lab, Xian 710049, Peoples R China

[2] Xi An Jiao Tong Univ, Syst Engn Inst, Xian 710049, Peoples R China

来源：

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS | 2022年 / 69卷 / 02期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Task analysis; Data models; Knowledge transfer; Fault diagnosis; Kernel; Adaptation models; Training; Continual machine learning (CML); few-shot learning; mechanical fault diagnosis; restricted Boltzmann machine; transfer learning; NEURAL-NETWORK; MODEL;

D O I：

10.1109/TIE.2021.3063975

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep learning (DL) based diagnosis models have to be trained by large quantities of monitoring data of machines. However, in real-case scenarios, machines operate under the normal condition in most of their life time while faults seldom happen. Therefore, though massive data are accessible, most are data of the normal condition while fault data are still extremely limited. In other words, fault diagnosis of real machines is actually a few-shot diagnosis problem. To deal with few-shot diagnosis, this article proposes adaptive knowledge transfer with multiclassifier ensemble (AKTME) under the paradigm of continual machine learning. In AKTME, knowledge learned by DL models is considered to be represented by the learnable filter kernels (FKs). The key of AKTME is a proposed continual weighted updating (CWU) technique of FKs. By CWU, shared FKs are distilled from multiple auxiliary tasks and adaptively transferred to the target task. Then by multiclassifier ensemble, AKTME is able to recognize faults with few fault data accessible. AKTME is applied on two few-shot diagnosis cases. Results verify that AKTME achieves higher diagnosis accuracies than recently proposed methods. Moreover, AKTME tends to improve the diagnosis accuracy as it prelearns on more auxiliary tasks continually.

引用

页码：1968 / 1976

页数：9

共 30 条

[1] [Anonymous], 2018, ARXIV180102808
[2] [Anonymous], 2012, PROCEEDINGSOFTHE 25T
[3] Cho K.H., 2013, P IJCNN, P1
[4] Fink M., 2004, P NIPS, P449, DOI DOI 10.5555/2976040.2976097
[5] Training restricted Boltzmann machines: An introduction
Fischer, Asja
Igel, Christian
[J]. PATTERN RECOGNITION, 2014, 47 (01) : 25 - 39
[6] Data augmentation in fault diagnosis based on the Wasserstein generative adversarial network with gradient penalty
Gao, Xin
Deng, Fang
Yue, Xianghu
[J]. NEUROCOMPUTING, 2020, 396 (396) : 487 - 494
[7] Multitask Convolutional Neural Network With Information Fusion for Bearing Fault Diagnosis and Localization
Guo, Sheng
Zhang, Bin
Yang, Tao
Lyu, Dongzhen
Gao, Wei
[J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (09) : 8005 - 8015
[8] An Imbalance Modified Deep Neural Network With Dynamical Incremental Learning for Chemical Fault Diagnosis
Hu, Zhixin
Jiang, Peng
[J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2019, 66 (01) : 540 - 550
[9] Deep normalized convolutional neural network for imbalanced fault classification of machinery and its understanding via visualization
Jia, Feng
Lei, Yaguo
Lu, Na
Xing, Saibo
[J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2018, 110 : 349 - 367
[10] Overcoming catastrophic forgetting in neural networks
Kirkpatricka, James
Pascanu, Razvan
Rabinowitz, Neil
Veness, Joel
Desjardins, Guillaume
Rusu, Andrei A.
Milan, Kieran
Quan, John
Ramalho, Tiago
Grabska-Barwinska, Agnieszka
Hassabis, Demis
Clopath, Claudia
Kumaran, Dharshan
Hadsell, Raia
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (13) : 3521 - 3526

← 1 2 3 →