Hierarchical feature selection based on relative dependency for gear fault diagnosis

被引:1
作者
Mariela Cerrada
René-Vinicio Sánchez
Fannia Pacheco
Diego Cabrera
Grover Zurita
Chuan Li
机构
[1] Universidad de Los Andes,Control Systems Department
[2] Universidad Politécnica Salesiana,Mechanical Engineering Department
[3] Chongqing Technology and Business University,Research Center of System Health Maintenance
来源
Applied Intelligence | 2016年 / 44卷
关键词
Feature selection; Attribute clustering; Rough sets; Relative dependency; Gear fault diagnosis;
D O I
暂无
中图分类号
学科分类号
摘要
Feature selection is an important aspect under study in machine learning based diagnosis, that aims to remove irrelevant features for reaching good performance in the diagnostic systems. The behaviour of diagnostic models could be sensitive with regard to the amount of features, and significant features can represent the problem better than the entire set. Consequently, algorithms to identify these features are valuable contributions. This work deals with the feature selection problem through attribute clustering. The proposed algorithm is inspired by existing approaches, where the relative dependency between attributes is used to calculate dissimilarity values. The centroids of the created clusters are selected as representative attributes. The selection algorithm uses a random process for proposing centroid candidates, in this way, the inherent exploration in random search is included. A hierarchical procedure is proposed for implementing this algorithm. In each level of the hierarchy, the entire set of available attributes is split in disjoint sets and the selection process is applied on each subset. Once the significant attributes are proposed for each subset, a new set of available attributes is created and the selection process runs again in the next level. The hierarchical implementation aims to refine the search space in each level on a reduced set of selected attributes, while the computational time-consumption is improved also. The approach is tested with real data collected from a test bed, results show that the diagnosis precision by using a Random Forest based classifier is over 98 % with only 12 % of the attributes from the available set.
引用
收藏
页码:687 / 703
页数:16
相关论文
共 99 条
  • [1] Bartkowiak A(2014)Dimensionality reduction via variables selection linear and nonlinear approaches with application to vibration-based condition monitoring of planetary gearbox Appl Acoust 77 169-177
  • [2] Zimroz R(2013)Feature selection for nonlinear models with extreme learning machines Neurocomputing 102 111-124
  • [3] Benot F(2014)Support vector machine based optimization of multi-fault classification of gears with evolutionary algorithms from time frequency vibration data Measurement 55 1-14
  • [4] van Heeswijk M(2001)Random forests Mach Learn 45 5-32
  • [5] Miche Y(2015)Multi-stage feature selection by using genetic algorithms for fault diagnosis in gearboxes based on vibration signal Sensors 15 23,903-23,926
  • [6] Verleysen M(2014)A survey on feature selection methods Comput Electr Eng 40 16-28
  • [7] Lendasse A(2013)Fuzzy rough sets, and a granular neural network for unsupervised feature selection Neural Netw 48 91-108
  • [8] Bordoloi D(2012)A support vector machine approach based on physical model training for rolling element bearing fault detection in industrial environments Eng Appl Artif Intell 25 326-344
  • [9] Tiwari R(2014)Feature selection and replacement by clustering attributes Vietnam Journal of Computer Science 1 47-55
  • [10] Breiman L(2015)Criterion fusion for spectral segmentation and its application to optimal demodulation of bearing vibration signals Mech Syst Signal Process 6465 132-148