Reinforcement learning-based cost-sensitive classifier for imbalanced fault classification

被引:8
作者
Zhang, Xinmin [1 ]
Fan, Saite [1 ]
Song, Zhihuan [1 ]
机构
[1] Zhejiang Univ, Coll Control Sci & Engn, State Key Lab Ind Control Technol, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
imbalanced fault classification; fault diagnosis; industrial process monitoring; deep reinforcement learning; cost-sensitive learning; policy gradient; sample weights; PROCESS INDUSTRY; NEURAL-NETWORKS;
D O I
10.1007/s11432-021-3775-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fault classification plays a crucial role in the industrial process monitoring domain. In the datasets collected from real-life industrial processes, the data distribution is usually imbalanced. The datasets contain a large amount of normal data (majority) and only a small amount of faulty data (minority); this phenomenon is also known as the imbalanced fault classification problem. To solve the imbalanced fault classification problem, a novel reinforcement learning (RL)-based cost-sensitive classifier (RLCC) based on policy gradient is proposed in this paper. In RLCC, a novel cost-sensitive learning strategy based on policy gradient and the actor-critic of RL is developed. The novel cost-sensitive learning strategy can adaptively learn the cost matrix and dynamically yield the sample weights. In addition, RLCC uses a newly designed reward to train the sample weight learner and classifier using an alternating iterative approach. The alternating iterative approach makes RLCC highly flexible and effective in solving the imbalanced fault classification problem. The effectiveness and practicability of the proposed RLCC method are verified through its application in a real-world dataset and an industrial process benchmark.
引用
收藏
页数:14
相关论文
共 46 条
[1]  
Alejo R, 2008, LECT NOTES COMPUT SC, V5197, P479, DOI 10.1007/978-3-540-85920-8_59
[2]  
Alejo R, 2007, LECT NOTES COMPUT SC, V4507, P162
[3]  
[Anonymous], 2000, ICML
[4]   Natural actor-critic algorithms [J].
Bhatnagar, Shalabh ;
Sutton, Richard S. ;
Ghavamzadeh, Mohammad ;
Lee, Mark .
AUTOMATICA, 2009, 45 (11) :2471-2482
[5]  
Brodersen Kay H., 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P3121, DOI 10.1109/ICPR.2010.764
[6]   Novel Cost-Sensitive Approach to Improve the Multilayer Perceptron Performance on Imbalanced Data [J].
Castro, Cristiano L. ;
Braga, Antonio P. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (06) :888-899
[7]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[8]   K-means Bayes algorithm for imbalanced fault classification and big data application [J].
Chen, Gecheng ;
Liu, Yue ;
Ge, Zhiqiang .
JOURNAL OF PROCESS CONTROL, 2019, 81 :54-64
[9]   Decision threshold adjustment in class prediction [J].
Chen, J. J. ;
Tsai, C. -A. ;
Moon, H. ;
Ahn, H. ;
Young, J. J. ;
Chen, C. -H. .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2006, 17 (03) :337-352
[10]   Imbalanced Sample Selection With Deep Reinforcement Learning for Fault Diagnosis [J].
Fan, Saite ;
Zhang, Xinmin ;
Song, Zhihuan .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (04) :2518-2527