A New Deep Transfer Learning Based on Sparse Auto-Encoder for Fault Diagnosis

被引:875
作者
Wen, Long [1 ]
Gao, Liang [1 ]
Li, Xinyu [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, State Key Lab Digital Mfg Equipment & Technol, Wuhan 430074, Hubei, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2019年 / 49卷 / 01期
关键词
Deep learning (DL); fault diagnosis; sparse auto-encoder (SAE); transfer learning; NEURAL-NETWORKS; SIGNAL; MODEL;
D O I
10.1109/TSMC.2017.2754287
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fault diagnosis plays an important role in modern industry. With the development of smart manufacturing, the data-driven fault diagnosis becomes hot. However, traditional methods have two shortcomings: 1) their performances depend on the good design of handcrafted features of data, but it is difficult to predesign these features and 2) they work well under a general assumption: the training data and testing data should be drawn from the same distribution, but this assumption fails in many engineering applications. Since deep learning (DL) can extract the hierarchical representation features of raw data, and transfer learning provides a good way to perform a learning task on the different but related distribution datasets, deep transfer learning (DTL) has been developed for fault diagnosis. In this paper, a new DTL method is proposed. It uses a three-layer sparse auto-encoder to extract the features of raw data, and applies the maximum mean discrepancy term to minimizing the discrepancy penalty between the features from training data and testing data. The proposed DTL is tested on the famous motor bearing dataset from the Case Western Reserve University. The results show a good improvement, and DTL achieves higher prediction accuracies on most experiments than DL. The prediction accuracy of DTL, which is as high as 99.82%, is better than the results of other algorithms, including deep belief network, sparse filter, artificial neural network, support vector machine and some other traditional methods. What is more, two additional analytical experiments are conducted. The results show that a good unlabeled third dataset may be helpful to DTL, and a good linear relationship between the final prediction accuracies and their standard deviations have been observed.
引用
收藏
页码:136 / 144
页数:9
相关论文
共 41 条
[1]  
Bengio Y, 2012, JMLR WORKSHOP C P, P17
[2]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[3]   Deep transfer learning for classification of time-delayed Gaussian networks [J].
Chaturvedi, Iti ;
Ong, Yew-Soon ;
Arumugam, Rajesh Vellore .
SIGNAL PROCESSING, 2015, 110 :250-262
[4]   Fault Detection and Isolation of Induction Motors Using Recurrent Neural Networks and Dynamic Bayesian Modeling [J].
Cho, Hyun Cheol ;
Knowles, Jeremy ;
Fadali, M. Sami ;
Lee, Kwon Soon .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2010, 18 (02) :430-437
[5]   From Model, Signal to Knowledge: A Data-Driven Perspective of Fault Detection and Diagnosis [J].
Dai, Xuewu ;
Gao, Zhiwei .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2013, 9 (04) :2226-2238
[6]   Construction of hierarchical diagnosis network based on deep learning and its application in the fault pattern recognition of rolling element bearings [J].
Gan, Meng ;
Wang, Cong ;
Zhu, Chang'an .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2016, 72-73 :92-104
[7]  
Gao Z., 2015, IEEE T IND ELECTRON, V62, P3768, DOI DOI 10.1109/TIE.2015.2419013
[8]  
Gao ZW, 2015, IEEE T IND ELECTRON, V62, P3757, DOI [10.1109/TIE.2015.2417501, 10.1109/TIE.2015.2419013]
[9]  
Glorot X., 2011, Proceedings of the 28th international conference on machine learning, P513
[10]  
Gretton A., 2006, Advances in Neural Information Processing Systems, P513