Ensemble Fuzzy Feature Selection Based on Relevancy, Redundancy, and Dependency Criteria

被引:11
作者
Salem, Omar A. M. [1 ,2 ]
Liu, Feng [1 ]
Chen, Yi-Ping Phoebe [3 ]
Chen, Xi [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
[2] Suez Canal Univ, Fac Comp & Informat, Dept Informat Syst, Ismailia 41522, Egypt
[3] La Trobe Univ, Dept Comp Sci & Informat Technol, Melbourne, Vic 3086, Australia
关键词
feature selection; fuzzy sets; mutual information; rough set; STABLE FEATURE-SELECTION; INPUT FEATURE-SELECTION; MUTUAL INFORMATION; MAX-RELEVANCE; ROUGH SETS; REDUCTION; PERFORMANCE;
D O I
10.3390/e22070757
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The main challenge of classification systems is the processing of undesirable data. Filter-based feature selection is an effective solution to improve the performance of classification systems by selecting the significant features and discarding the undesirable ones. The success of this solution depends on the extracted information from data characteristics. For this reason, many research theories have been introduced to extract different feature relations. Unfortunately, traditional feature selection methods estimate the feature significance based on either individually or dependency discriminative ability. This paper introduces a new ensemble feature selection, called fuzzy feature selection based on relevancy, redundancy, and dependency (FFS-RRD). The proposed method considers both individually and dependency discriminative ability to extract all possible feature relations. To evaluate the proposed method, experimental comparisons are conducted with eight state-of-the-art and conventional feature selection methods. Based on 13 benchmark datasets, the experimental results over four well-known classifiers show the outperformance of our proposed method in terms of classification performance and stability.
引用
收藏
页数:17
相关论文
共 52 条
[11]  
Dua D., 2017, UCI MACHINE LEARNING
[12]  
Fleuret F, 2004, J MACH LEARN RES, V5, P1531
[13]   An evaluation of classifier-specific filter measure performance for feature selection [J].
Freeman, Cecille ;
Kulic, Dana ;
Basir, Otman .
PATTERN RECOGNITION, 2015, 48 (05) :1812-1826
[14]   A Survey of Discretization Techniques: Taxonomy and Empirical Analysis in Supervised Learning [J].
Garcia, Salvador ;
Luengo, Julian ;
Antonio Saez, Jose ;
Lopez, Victoria ;
Herrera, Francisco .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (04) :734-750
[15]  
Han JC, 2004, LECT NOTES ARTIF INT, V3066, P176
[16]  
Hassanien A.E, 2008, ROUGH COMPUTING O
[17]   Mutual information-based method for selecting informative feature sets [J].
Herman, Gunawan ;
Zhang, Bang ;
Wang, Yang ;
Ye, Getian ;
Chen, Fang .
PATTERN RECOGNITION, 2013, 46 (12) :3315-3327
[18]   Feature selection considering two types of feature relevancy and feature interdependency [J].
Hu, Liang ;
Gao, Wanfu ;
Zhao, Kuo ;
Zhang, Ping ;
Wang, Feng .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 93 :423-434
[19]   Information-preserving hybrid data reduction based on fuzzy-rough techniques [J].
Hu, QH ;
Yu, DR ;
Xie, ZX .
PATTERN RECOGNITION LETTERS, 2006, 27 (05) :414-423
[20]   Fuzzy sets in machine learning and data mining [J].
Huellermeier, Eyke .
APPLIED SOFT COMPUTING, 2011, 11 (02) :1493-1505