Reinforcement learning-based cost-sensitive classifier for imbalanced fault classification

被引：8

作者：

Zhang, Xinmin ^{[1
]}

Fan, Saite ^{[1
]}

Song, Zhihuan ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Control Sci & Engn, State Key Lab Ind Control Technol, Hangzhou 310027, Peoples R China

来源：

SCIENCE CHINA-INFORMATION SCIENCES | 2023年 / 66卷 / 11期

基金：

中国国家自然科学基金;

关键词：

imbalanced fault classification; fault diagnosis; industrial process monitoring; deep reinforcement learning; cost-sensitive learning; policy gradient; sample weights; PROCESS INDUSTRY; NEURAL-NETWORKS;

D O I：

10.1007/s11432-021-3775-4

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Fault classification plays a crucial role in the industrial process monitoring domain. In the datasets collected from real-life industrial processes, the data distribution is usually imbalanced. The datasets contain a large amount of normal data (majority) and only a small amount of faulty data (minority); this phenomenon is also known as the imbalanced fault classification problem. To solve the imbalanced fault classification problem, a novel reinforcement learning (RL)-based cost-sensitive classifier (RLCC) based on policy gradient is proposed in this paper. In RLCC, a novel cost-sensitive learning strategy based on policy gradient and the actor-critic of RL is developed. The novel cost-sensitive learning strategy can adaptively learn the cost matrix and dynamically yield the sample weights. In addition, RLCC uses a newly designed reward to train the sample weight learner and classifier using an alternating iterative approach. The alternating iterative approach makes RLCC highly flexible and effective in solving the imbalanced fault classification problem. The effectiveness and practicability of the proposed RLCC method are verified through its application in a real-world dataset and an industrial process benchmark.

引用

页数：14

共 46 条

[1]

Alejo R, 2008, LECT NOTES COMPUT SC, V5197, P479, DOI 10.1007/978-3-540-85920-8_59

[2]

Alejo R, 2007, LECT NOTES COMPUT SC, V4507, P162

[3]

[Anonymous], 2000, ICML

[4] Natural actor-critic algorithms [J].