Attribute clustering using rough set theory for feature selection in fault severity classification of rotating machinery

被引:91
作者
Pacheco, Fannia [1 ]
Cerrada, Mariela [1 ,2 ]
Sanchez, Rene-Vinicio [1 ]
Cabrera, Diego [1 ]
Li, Chuan [3 ]
de Oliveira, Jose Valente [4 ]
机构
[1] Univ Politecn Salesiana, Dept Mech Engn, Calle Vieja, Cuenca, Ecuador
[2] Univ Los Andes, CEMISID, Merida, Venezuela
[3] Chongqing Technol & Business Univ, Natl Res Base Intelligent Mfg Serv, Chongqing, Peoples R China
[4] Univ Algrave, CEOT, Faro, Portugal
关键词
Attribute clustering; Rough set; Feature selection; Fault severity classification; Rotating machinery; UNSUPERVISED FEATURE-SELECTION; FEATURE SUBSET-SELECTION; GENETIC ALGORITHMS; DIAGNOSIS; INFORMATION;
D O I
10.1016/j.eswa.2016.11.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Features extracted from real world applications increase dramatically, while machine learning methods decrease their performance given the previous scenario, and feature reduction is required. Particularly, for fault diagnosis in rotating machinery, the number of extracted features are sizable in order to collect all the available information from several monitored signals. Several approaches lead to data reduction using supervised or unsupervised strategies, where the supervised ones are the most reliable and its main disadvantage is the beforehand knowledge of the fault condition. This work proposes a new unsupervised algorithm for feature selection based on attribute clustering and rough set theory. Rough set theory is used to compute similarities between features through the relative dependency. The clustering approach combines classification based on distance with clustering based on prototype to group similar features, without requiring the number of clusters as an input. Additionally, the algorithm has an evolving property that allows the dynamic adjustment of the cluster structure during the clustering process, even when a new set of attributes feeds the algorithm, That gives to the algorithm an incremental learning property, avoiding a retraining process. These properties define the main contribution and significance of the proposed algorithm. Two fault diagnosis problems of fault severity classification in gears and bearings are studied to test the algorithm. Classification results show that the proposed algorithm is able to select adequate features as accurate as other feature selection and reduction approaches. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:69 / 86
页数:18
相关论文
共 58 条
[41]   A statistical comparison of neuroclassifiers and feature selection methods for gearbox fault diagnosis under realistic conditions [J].
Pacheco, Fannia ;
Valente de Oliveira, Jose ;
Sanchez, Rene-Vinicio ;
Cerrada, Mariela ;
Cabrera, Diego ;
Li, Chuan ;
Zurita, Grover ;
Artes, Mariano .
NEUROCOMPUTING, 2016, 194 :192-206
[42]   Mutual information criterion for feature selection from incomplete data [J].
Qian, Wenbin ;
Shu, Wenhao .
NEUROCOMPUTING, 2015, 168 :210-220
[43]   A novel soft set approach in selecting clustering attribute [J].
Qin, Hongwu ;
Ma, Xiuqin ;
Zain, Jasni Mohamad ;
Herawan, Tutut .
KNOWLEDGE-BASED SYSTEMS, 2012, 36 :139-145
[44]   Theoretical comparison between the Gini Index and Information Gain criteria [J].
Raileanu, LE ;
Stoffel, K .
ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2004, 41 (01) :77-93
[45]   A novel feature selection algorithm for text categorization [J].
Shang, Wenqian ;
Huang, Houkuan ;
Zhu, Haibin ;
Lin, Yongmin ;
Qu, Youli ;
Wang, Zhihai .
EXPERT SYSTEMS WITH APPLICATIONS, 2007, 33 (01) :1-5
[46]   Cluster structure preserving unsupervised feature selection for multi-view tasks [J].
Shi, Hong ;
Li, Yin ;
Han, Yahong ;
Hu, Qinghua .
NEUROCOMPUTING, 2016, 175 :686-697
[47]   A NOTE ON GENETIC ALGORITHMS FOR LARGE-SCALE FEATURE-SELECTION [J].
SIEDLECKI, W ;
SKLANSKY, J .
PATTERN RECOGNITION LETTERS, 1989, 10 (05) :335-347
[48]   A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data [J].
Song, Qinbao ;
Ni, Jingjie ;
Wang, Guangtao .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (01) :1-14
[49]  
Swiniarski R. W., 2001, International Journal of Applied Mathematics and Computer Science, V11, P565
[50]   An unsupervised feature selection algorithm based on ant colony optimization [J].
Tabakhi, Sina ;
Moradi, Parham ;
Akhlaghian, Fardin .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 32 :112-123