Supervised feature selection by clustering using conditional mutual information-based distances

被引:163
作者
Martinez Sotoca, Jose [1 ]
Pla, Filiberto [1 ]
机构
[1] Univ Jaume 1, Inst New Imaging Technol, Dept Llenguatges & Sistemes Informat, Castellon de La Plana 12071, Spain
关键词
Supervised feature selection; Clustering; Conditional mutual information; INPUT FEATURE-SELECTION;
D O I
10.1016/j.patcog.2009.12.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a supervised feature selection approach is presented, which is based on metric applied on continuous and discrete data representations. This method builds a dissimilarity space using information theoretic measures, in particular conditional mutual information between features with respect to a relevant variable that represents the class labels. Applying a hierarchical clustering, the algorithm searches for a compression of the information contained in the original set of features. The proposed technique is compared with other state of art methods also based on information measures. Eventually, several experiments are presented to show the effectiveness of the features selected from the point of view of classification accuracy. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2068 / 2081
页数:14
相关论文
共 30 条