Ensemble Fuzzy Feature Selection Based on Relevancy, Redundancy, and Dependency Criteria

被引:11
作者
Salem, Omar A. M. [1 ,2 ]
Liu, Feng [1 ]
Chen, Yi-Ping Phoebe [3 ]
Chen, Xi [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
[2] Suez Canal Univ, Fac Comp & Informat, Dept Informat Syst, Ismailia 41522, Egypt
[3] La Trobe Univ, Dept Comp Sci & Informat Technol, Melbourne, Vic 3086, Australia
关键词
feature selection; fuzzy sets; mutual information; rough set; STABLE FEATURE-SELECTION; INPUT FEATURE-SELECTION; MUTUAL INFORMATION; MAX-RELEVANCE; ROUGH SETS; REDUCTION; PERFORMANCE;
D O I
10.3390/e22070757
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The main challenge of classification systems is the processing of undesirable data. Filter-based feature selection is an effective solution to improve the performance of classification systems by selecting the significant features and discarding the undesirable ones. The success of this solution depends on the extracted information from data characteristics. For this reason, many research theories have been introduced to extract different feature relations. Unfortunately, traditional feature selection methods estimate the feature significance based on either individually or dependency discriminative ability. This paper introduces a new ensemble feature selection, called fuzzy feature selection based on relevancy, redundancy, and dependency (FFS-RRD). The proposed method considers both individually and dependency discriminative ability to extract all possible feature relations. To evaluate the proposed method, experimental comparisons are conducted with eight state-of-the-art and conventional feature selection methods. Based on 13 benchmark datasets, the experimental results over four well-known classifiers show the outperformance of our proposed method in terms of classification performance and stability.
引用
收藏
页数:17
相关论文
共 52 条
[31]  
LEWIS DD, 1992, SPEECH AND NATURAL LANGUAGE, P212
[32]   FREL: A Stable Feature Selection Algorithm [J].
Li, Yun ;
Si, Jennie ;
Zhou, Guojing ;
Huang, Shasha ;
Chen, Songcan .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (07) :1388-1402
[33]  
Lin DH, 2006, LECT NOTES COMPUT SC, V3951, P68
[34]   Theoretical foundations of forward feature selection methods based on mutual information [J].
Macedo, Francisco ;
Rosario Oliveira, M. ;
Pacheco, Antonio ;
Valadas, Rui .
NEUROCOMPUTING, 2019, 325 :67-89
[35]  
Nogueira S., 2016, JOINT EUR C MACH LEA, V9852, P442, DOI [10.1007/978-3-319-46227-1_28, DOI 10.1007/978-3-319-46227-1_28]
[36]   Effects of dataset characteristics on the performance of feature selection techniques [J].
Oreski, Dijana ;
Oreski, Stjepan ;
Klicek, Bozidar .
APPLIED SOFT COMPUTING, 2017, 52 :109-119
[37]  
Pawlak Z., 2012, Rough sets: theoretical aspects of reasoning about data, DOI DOI 10.1007/978-94-011-3534-4
[38]   Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy [J].
Peng, HC ;
Long, FH ;
Ding, C .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (08) :1226-1238
[39]   Exploiting the ensemble paradigm for stable feature selection: A case study on high-dimensional genomic data [J].
Pes, Barbara ;
Dessi, Nicoletta ;
Angioni, Marta .
INFORMATION FUSION, 2017, 35 :132-147
[40]   A review of feature selection techniques in bioinformatics [J].
Saeys, Yvan ;
Inza, Inaki ;
Larranaga, Pedro .
BIOINFORMATICS, 2007, 23 (19) :2507-2517