A Novel Small-Sample Dense Teacher Assistant Knowledge Distillation Method for Bearing Fault Diagnosis

被引:17
作者
Zhong, Hongyu [1 ,2 ,3 ]
Yu, Samson [3 ]
Trinh, Hieu [3 ]
Lv, Yong [1 ,2 ]
Yuan, Rui [1 ,2 ]
Wang, Yanan [3 ]
机构
[1] Wuhan Univ Sci & Technol, Key Lab Met Equipment & Control Technol, Minist Educ, Wuhan 430081, Peoples R China
[2] Wuhan Univ Sci & Technol, Hubei Key Lab Mech Transmiss & Mfg Engn, Wuhan 430081, Peoples R China
[3] Deakin Univ, Sch Engn, Waurn Ponds, Vic 3216, Australia
基金
中国国家自然科学基金;
关键词
Dense connection; generative adversarial network (GAN); intelligent fault diagnosis; knowledge distillation;
D O I
10.1109/JSEN.2023.3307425
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, deep learning models have been widely studied and applied in fault diagnosis. However, two common drawbacks are: 1) they usually require a large amount of storage resources, making it difficult to run them on embedded devices and 2) there is usually no access to sufficient reliable training data to train a comprehensive diagnosis model. In this study, a fusion approach is proposed based on knowledge distillation and generative adversarial network (GAN). This approach is named small-sample dense teacher assistant knowledge distillation (SS-DTAKD), which aims to enable bearing fault diagnosis with small samples and limited on-board storage resources. First, the proposed self-attention GAN (SGAN) is used to expand the training data for the diagnostic model. The advantage is that the generator and discriminator embedded with the self-attention module can help improve the quality of the generated data. Then, the DTAKD method is proposed to compress the model parameter, where the dense distillation of multiple teacher-assistant networks helps the student network learn correct knowledge without requiring additional data and storage resources. Additionally, the dual-type data hierarchical training (DDHT) method is applied to train the student network, which is designed to utilize actual data to improve the student network's performance. Extensive experiments on two bearing fault datasets demonstrate that the data generated by the SGAN has high similarity and robustness. Furthermore, compared to other existing knowledge distillation methods, the proposed SS-DTAKD method can obtain higher fault diagnosis accuracy with small samples and limited on-board storage resources.
引用
收藏
页码:24279 / 24291
页数:13
相关论文
共 52 条
[1]   Knowledge distillation in deep learning and its applications [J].
Alkhulaifi, Abdolmaged ;
Alsahli, Fahad ;
Ahmad, Irfan .
PEERJ COMPUTER SCIENCE, 2021, PeerJ Inc. (07) :1-24
[2]   Knowledge from the original network: restore a better pruned network with knowledge distillation [J].
Chen, Liyang ;
Chen, Yongquan ;
Xi, Juntong ;
Le, Xinyi .
COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (02) :709-718
[3]   Multi-input CNN based vibro-acoustic fusion for accurate fault diagnosis of induction motor [J].
Choudhary, Anurag ;
Mishra, Rismaya Kumar ;
Fatima, Shahab ;
Panigrahi, B. K. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
[4]   A Lightweight Transformer With Strong Robustness Application in Portable Bearing Fault Diagnosis [J].
Fang, Hairui ;
An, Jialin ;
Liu, Han ;
Xiang, Jiawei ;
Zhao, Bo ;
Dunkin, Fir .
IEEE SENSORS JOURNAL, 2023, 23 (09) :9649-9657
[5]   A Method for Improving CNN-Based Image Recognition Using DCGAN [J].
Fang, Wei ;
Zhang, Feihong ;
Sheng, Victor S. ;
Ding, Yewen .
CMC-COMPUTERS MATERIALS & CONTINUA, 2018, 57 (01) :167-178
[6]   An efficient way to refine DenseNet [J].
Feng, Xinjie ;
Yao, Hongxun ;
Zhang, Shengping .
SIGNAL IMAGE AND VIDEO PROCESSING, 2019, 13 (05) :959-965
[7]   Review of Smart Health Monitoring Approaches With Survey Analysis and Proposed Framework [J].
Gahlot, Sonal ;
Reddy, S. R. N. ;
Kumar, Dinesh .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02) :2116-2127
[8]   Generative Adversarial Networks [J].
Goodfellow, Ian ;
Pouget-Abadie, Jean ;
Mirza, Mehdi ;
Xu, Bing ;
Warde-Farley, David ;
Ozair, Sherjil ;
Courville, Aaron ;
Bengio, Yoshua .
COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144
[9]   Model Compression Using Progressive Channel Pruning [J].
Guo, Jinyang ;
Zhang, Weichen ;
Ouyang, Wanli ;
Xu, Dong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) :1114-1124
[10]   Data Augmentation for Intelligent Mechanical Fault Diagnosis Based on Local Shared Multiple-Generator GAN [J].
Guo, Qingwen ;
Li, Yibin ;
Liu, Yanjun ;
Gao, Shengyao ;
Song, Yan .
IEEE SENSORS JOURNAL, 2022, 22 (10) :9598-9609