Transfer reinforcement learning method with multi-label learning for compound fault recognition

被引:30
作者
Wang, Zisheng [1 ]
Zhang, Qing [1 ]
Tang, Lv [1 ]
Shi, Tielin [1 ]
Xuan, Jianping [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Compound fault recognition; Deep reinforcement learning; Multi-label learning; Transfer learning; Trust region policy optimization; ROTATING MACHINERY; NEURAL-NETWORKS; DEEP; DIAGNOSIS; GAME; GO;
D O I
10.1016/j.aei.2022.101818
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In complex working site, bearings used as the important part of machine, could simultaneously have faults on several positions. Consequently, multi-label learning approach considering fully the correlation between different faulted positions of bearings becomes the popular learning pattern. Deep reinforcement learning (DRL) combining the perception ability of deep learning and the decision-making ability of reinforcement learning, could be adapted to the compound fault diagnosis while having a strong ability extracting the fault feature from the raw data. However, DRL is difficult to converge and easily falls into the unstable training problem. Therefore, this paper integrates the feature extraction ability of DRL and the knowledge transfer ability of transfer learning (TL), and proposes the multi-label transfer reinforcement learning (ML-TRL). In detail, the proposed method utilizes the improved trust region policy optimization (TRPO) as the basic DRL framework and pre-trains the fixed convolutional networks of ML-TRL using the multi-label convolutional neural network method. In compound fault experiment, the final results demonstrate powerfully that the proposed method could have the higher accuracy than other multi-label learning methods. Hence, the proposed method is a remarkable alternative when recognizing the compound fault of bearings.
引用
收藏
页数:13
相关论文
共 47 条
[1]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[2]   Parametric Doppler correction analysis for wayside acoustic bearing fault diagnosis [J].
Ding, Xiaoxi ;
Li, Yulan ;
Xiao, Jiawei ;
He, Qingbo ;
Yang, Xiaoqing ;
Shao, Yimin .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 166
[3]   Intelligent fault diagnosis for rotating machinery using deep Q-network based health state classification: A deep reinforcement learning approach [J].
Ding, Yu ;
Ma, Liang ;
Ma, Jian ;
Suo, Mingliang ;
Tao, Laifa ;
Cheng, Yujie ;
Lu, Chen .
ADVANCED ENGINEERING INFORMATICS, 2019, 42
[4]   Electromechanical modeling of a motor-gearbox system for local gear tooth faults detection [J].
El Yousfi, Bilal ;
Soualhi, Abdenour ;
Medjaher, Kamal ;
Guillet, Francois .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 166
[5]  
Fan JQ, 2020, PR MACH LEARN RES, V120, P486
[6]   A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients [J].
Grondman, Ivo ;
Busoniu, Lucian ;
Lopes, Gabriel A. D. ;
Babuska, Robert .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06) :1291-1307
[7]  
Haarnoja T., 2018, SOFT ACTOR CRITIC AL
[8]   Deep transfer network with joint distribution adaptation: A new intelligent fault diagnosis framework for industry application [J].
Han, Te ;
Liu, Chao ;
Yang, Wenguang ;
Jiang, Dongxiang .
ISA TRANSACTIONS, 2020, 97 :269-281
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]   Ensemble transfer CNNs driven by multi-channel signals for fault diagnosis of rotating machinery cross working conditions [J].
He Zhiyi ;
Shao Haidong ;
Zhong Xiang ;
Zhao Xianzhu .
KNOWLEDGE-BASED SYSTEMS, 2020, 207