A group incremental approach for feature selection on hybrid data

被引:0
作者
Feng Wang
Wei Wei
Jiye Liang
机构
[1] Shanxi University,Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education, School of Computer and Information Technology
来源
Soft Computing | 2022年 / 26卷
关键词
Incremental feature selection; Dynamic data sets; information entropy; Neighborhood relation;
D O I
暂无
中图分类号
学科分类号
摘要
Feature selection for dynamic data sets has been perceived as a very significant hot research problem in data mining. In practice, most real-world data usually are hybrid, which means both include categorical data and numerical data. For dynamic hybrid data, this paper first introduces a new neighborhood relation and information entropy based on neighborhood accordingly. Secondly, the single incremental mechanism and group incremental mechanism are analyzed and proofed to construct feature significance. On this basis, two incremental approaches to feature selection are developed for dealing with hybrid data. To better demonstrate the new algorithm, four common classifiers and twelve UCI data sets are introduced in the experiments. The experimental results further validate the feasibility of the incremental algorithms, and especially the efficiency of the group incremental algorithm.
引用
收藏
页码:3663 / 3677
页数:14
相关论文
共 124 条
[1]  
Almuallim H(1994)Learning boolean concepts in the presence of many irrelevant features Artif Intell 69 279-305
[2]  
Dietterich TG(2004)Efficient semi-supervised feature selection: constraint, relevance and redundancy IEEE Trans Knowl Data Eng 26 1131-1143
[3]  
Benabdeslem K(2019)Feature selection for imbalanced data based on neighborhood rough sets Inf Sci 483 1-20
[4]  
Hindawi M(1997)Feature selection for classification Intell Data Anal 1 131-156
[5]  
Chen HM(2003)Consistency-based search in feature selection Artif Intell 151 155-176
[6]  
Li TR(2012)A survey on learning from data streams: current and future trends Prog Artif Intell 1 45-55
[7]  
Fan X(2003)An introduction to variable feature selection Mach Learn Res 3 1157-1182
[8]  
Luo C(2007)Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation Pattern Recogn 40 3509-3521
[9]  
Dash M(2006)Information-preserving hybrid data reduction based on fuzzy-rough techniques Pattern Recogn Lett 27 414-423
[10]  
Liu H(2008)Neighborhood rough set based heterogeneous feature subset selection Inf Sci 178 3577-3594