Diagnosis of Rolling Bearing Based on Classification for High Dimensional Unbalanced Data

被引:45
作者
Hang, Qi [1 ]
Yang, Jinghui [1 ]
Xing, Lining [2 ]
机构
[1] Shanghai Polytech Univ, Sch Intelligent Mfg & Control Engn, Shanghai 200120, Peoples R China
[2] Natl Univ Def Technol, Coll Informat Syst & Management, Changsha 410073, Hunan, Peoples R China
关键词
Fault diagnosis of rolling bearing; high dimensional unbalanced data; random forests; PRINCIPAL COMPONENT ANALYSIS; IMBALANCED DATA; FEATURE-SELECTION; FAULT-DIAGNOSIS; GENE SELECTION; ALGORITHM; ENSEMBLE; MODELS; SMOTE; PCA;
D O I
10.1109/ACCESS.2019.2919406
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Motor systems are becoming more and more vital in modern manufacturing and bearings play an important role in the performance of a motor system. Many problems that arise in motor operation are related to bearing faults. In many cases, the accuracy of the devices for monitoring or controlling a motor system highly depends on the dynamic properties of motor bearings. Thus, fault diagnosis of a motor system is inseparably related to the diagnosis of the bearing assembly. The fault diagnosis of rolling bearings is substantially a classification problem. The traditional application of random forest (RF) to fault diagnosis methods is based on balanced data. However, in a practical situation, it is difficult to collect the fault data that are usually unbalanced. In order to solve this problem, in the first step, we propose a two-step (TS) clustering algorithm to enhance the original synthetic minority oversampling technique (SMOTE) algorithm for the unbalanced data classification. Then, based on the improvement of the SMOTE algorithm, we propose the principal component analysis (PCA) and apply it in the field of high-dimensional unbalanced fault diagnosis data. In this paper, we apply this new method to the fault diagnosis of rolling bearings, and the experiments conducted in the end show that the improved algorithm has a better classification performance.
引用
收藏
页码:79159 / 79172
页数:14
相关论文
共 63 条
[1]  
Almas A., 2012, 2012 Seventh International Conference on Digital Information Management (ICDIM 2012), P7, DOI 10.1109/ICDIM.2012.6360115
[2]  
[Anonymous], 2010, BIOM ENG ICBME 2010
[3]  
[Anonymous], 2012, Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, DOI [DOI 10.1145/1401890.1401910, 10.1145/1401890.1401910]
[4]  
Babic V, 2012, ROM AGRIC RES, V29, P53
[5]   Unsupervised Learning [J].
Barlow, H. B. .
NEURAL COMPUTATION, 1989, 1 (03) :295-311
[6]   Semi-supervised learning on Riemannian manifolds [J].
Belkin, M ;
Niyogi, P .
MACHINE LEARNING, 2004, 56 (1-3) :209-239
[7]   A classifier based on the artificial neural network approach for cardiologic auscultation in pediatrics [J].
Bhatikar, SR ;
DeGroff, C ;
Mahajan, RL .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2005, 33 (03) :251-260
[8]   SMOTE for high-dimensional class-imbalanced data [J].
Blagus, Rok ;
Lusa, Lara .
BMC BIOINFORMATICS, 2013, 14
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)