A two-stage multi-scale domain adversarial transfer learning method and its application in fault diagnosis

被引:5
作者
Zhang, Mingyuan [1 ]
Huang, Chengxuan [1 ]
Wang, Hongsen [1 ]
He, Chen [1 ]
Yang, Debin [1 ]
Yang, Jianhong [1 ,2 ,3 ]
机构
[1] Univ Sci & Technol Beijing, Sch Mech Engn, Beijing 100083, Peoples R China
[2] Univ Sci & Technol Beijing, Tech Support Ctr Prevent & Control Disastrous Acci, Beijing 100083, Peoples R China
[3] Univ Sci & Technol Beijing, Res Inst Macrosafety Sci, Beijing 100083, Peoples R China
关键词
fault diagnosis; deep transfer learning; domain adversarial learning; multi-scale feature extraction; CONVOLUTIONAL NEURAL-NETWORK;
D O I
10.1088/1361-6501/acf6d9
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Domain adaptation (DA) based intelligent fault diagnosis (IFD) methods have attracted great attention in recent years. The key motivation of DA methods is to extract the domain-invariant features. In most cases, the one-dimensional convolutional neural networks are adopted as the feature extractor, in which the kernels are usually single and fixed. However, the monitoring data for IFD usually involve various scale information, and the feature representation extracted by the above models may be incomprehensive. Moreover, the target domain data is only used to narrow the distribution discrepancy in an unsupervised way, which may lead to ignorance of class information of the target domain. To address these issues, in this paper, a two-stage multi-scale domain adversarial fault diagnosis method is proposed. A multi-scale feature extractor with different kernel sizes is designed to acquire more discriminative domain-invariant features. Meanwhile, pseudo label learning is adopted to provide the transfer learning process with the pseudo labels of the target domain, which are generated by a pre-trained network in the first stage and then are optimized through iterations in the second stage. The maximum mean discrepancy is also adopted to enhance the ability of the model for marginal distribution alignment, which can make the model more robust. Thirty-eight transfer tasks from two different datasets were conducted to evaluate the effectiveness of the proposed method. The experimental results demonstrated that the proposed method achieved higher average diagnosis accuracy compared with several popular methods. The superiority of our proposed method was further explained by visualization of learned features.
引用
收藏
页数:15
相关论文
共 36 条
[1]  
Ganin Y, 2016, J MACH LEARN RES, V17
[2]  
Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
[3]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[4]   A Hybrid Generalization Network for Intelligent Fault Diagnosis of Rotating Machinery Under Unseen Working Conditions [J].
Han, Te ;
Li, Yan-Fu ;
Qian, Min .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
[5]   Deep transfer learning with limited data for machinery fault diagnosis [J].
Han, Te ;
Liu, Chao ;
Wu, Rui ;
Jiang, Dongxiang .
APPLIED SOFT COMPUTING, 2021, 103
[6]   A novel adversarial learning framework in deep convolutional neural network for intelligent diagnosis of mechanical faults [J].
Han, Te ;
Liu, Chao ;
Yang, Wenguang ;
Jiang, Dongxiang .
KNOWLEDGE-BASED SYSTEMS, 2019, 165 :474-487
[7]   Training products of experts by minimizing contrastive divergence [J].
Hinton, GE .
NEURAL COMPUTATION, 2002, 14 (08) :1771-1800
[8]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]
[9]   Double-level adversarial domain adaptation network for intelligent fault diagnosis [J].
Jiao, Jinyang ;
Lin, Jing ;
Zhao, Ming ;
Liang, Kaixuan .
KNOWLEDGE-BASED SYSTEMS, 2020, 205
[10]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324