Imbalanced fault diagnosis of rotating machinery via multi-domain feature extraction and cost-sensitive learning

被引:75
作者
Xu, Qifa [1 ,2 ]
Lu, Shixiang [1 ]
Jia, Weiyin [3 ]
Jiang, Cuixia [1 ]
机构
[1] Hefei Univ Technol, Sch Management, Hefei 230009, Anhui, Peoples R China
[2] Minist Educ, Key Lab Proc Optimizat & Intelligent Decis Making, Hefei 230009, Anhui, Peoples R China
[3] Anhui Ronds Sci & Technol Inc Co, Hefei 230088, Peoples R China
基金
中国国家自然科学基金;
关键词
Rotating machinery; Fault diagnosis; Imbalanced classification; Feature extraction; Cost-sensitive learning; DATA-DRIVEN; NEURAL-NETWORK; CLASSIFICATION; BEARINGS; DESIGN; SMOTE;
D O I
10.1007/s10845-019-01522-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fault diagnosis plays an essential role in rotating machinery manufacturing systems to reduce their maintenance costs. How to improve diagnosis accuracy remains an open issue. To this end, we develop a novel framework through combined use of multi-domain vibration feature extraction, feature selection and cost-sensitive learning method. First, we extract time-domain, frequency-domain, and time-frequency-domain features to make full use of vibration signals. Second, a feature selection technique is employed to obtain a feature subset with good generalization properties, by simultaneously measuring the relevance and redundancy of features. Third, a cost-sensitive learning method is designed for a classifier to effectively learn the discriminating boundaries, with an extremely imbalanced distribution of fault instances. For illustration, a real-world dataset of rotating machinery collected from an oil refinery in China is utilized. The extensive experiments have demonstrated that our multi-domain feature extraction and feature selection can significantly improve the diagnosis accuracy. Meanwhile, our cost-sensitive learning method consistently outperforms the traditional classifiers such as support vector machine (SVM), gradient boosting decision tree (GBDT), etc., and even better than the classification method calibrated by six popular imbalanced data resampling algorithms, such as the Synthetic Minority Over-sampling Technique (SMOTE) and the Adaptive Synthetic sampling method (ADASYN), in terms of decreasing missed alarms and reducing the average cost. Owing to its high evaluation scores and low average misclassification cost, cost-sensitive GBDT (CS-GBDT) is preferred for imbalanced fault diagnosis in practice.
引用
收藏
页码:1467 / 1481
页数:15
相关论文
共 56 条
  • [1] Magnetic Levitation Systems for Cost-Sensitive Applications-Some Design Aspects
    Amrhein, Wolfgang
    Gruber, Wolfgang
    Bauer, Walter
    Reisinger, Martin
    [J]. IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2016, 52 (05) : 3739 - 3752
  • [2] Example-dependent cost-sensitive decision trees
    Bahnsen, Alejandro Correa
    Aouada, Djamila
    Ottersten, Bjoern
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (19) : 6609 - 6619
  • [3] Online automatic diagnosis of wind turbine bearings progressive degradations under real experimental conditions based on unsupervised machine learning
    Ben Ali, Jaouher
    Saidi, Lotfi
    Harrath, Salma
    Bechhoefer, Eric
    Benbouzid, Mohamed
    [J]. APPLIED ACOUSTICS, 2018, 132 : 167 - 181
  • [4] Beygelzimer Alina, 2005, INT C MACH LEARN, P49
  • [5] Novel Cost-Sensitive Approach to Improve the Multilayer Perceptron Performance on Imbalanced Data
    Castro, Cristiano L.
    Braga, Antonio P.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (06) : 888 - 899
  • [6] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [7] Statistical Spectral Analysis for Fault Diagnosis of Rotating Machines
    Ciabattoni, Lucio
    Ferracuti, Francesco
    Freddi, Alessandro
    Monteriu, Andrea
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2018, 65 (05) : 4301 - 4310
  • [8] COMPARING PREDICTIVE ACCURACY
    DIEBOLD, FX
    MARIANO, RS
    [J]. JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 1995, 13 (03) : 253 - 263
  • [9] Minimum redundancy feature selection from microarray gene expression data
    Ding, C
    Peng, HC
    [J]. PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, : 523 - 528
  • [10] Domingos P., 1999, P 5 ACM SIGKDD INT C, P155, DOI DOI 10.1145/312129.312220