Hybrid filter-wrapper attribute selection with alpha-level fuzzy rough sets

被引:16
作者
Nguyen Ngoc Thuy [1 ,2 ]
Wongthanavasu, Sartra [2 ]
机构
[1] Hue Univ, Univ Sci, Fac Informat Technol, Hue, Vietnam
[2] Khon Kaen Univ, Fac Sci, Dept Comp Sci, Khon Kaen 40002, Thailand
关键词
Attribute selection; Reducts; Decision information systems; alpha-level fuzzy rough sets; alpha-level fuzzy certainty factor; Numerical/real attributes; INCREMENTAL FEATURE-SELECTION; MIXED FEATURE-SELECTION; MAX-RELEVANCE; REDUCTION; COMBINATION; ALGORITHM;
D O I
10.1016/j.eswa.2021.116428
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Selection of important attributes/features from decision information systems plays a vital role in data mining and machine learning tasks. It is regarded as a very interesting, but challenge problem, especially when faced with continuous numerical/real attributes. Neighborhood rough sets and fuzzy rough sets based attribute selection methods are well-known for dealing effectively with numerical/real attributes. However, characteristics of data may be described incompletely by neighborhood classes in the neighborhood rough set model, while the fuzzy rough sets based approach is still quite time-consuming because of the complex calculations on fuzzy equivalence classes. To address these limitations, we apply the concept of sets of level alpha (alpha-cut sets) in the fuzzy set theory to construct alpha-level fuzzy equivalence classes which provide a foundation for developing basic concepts of a new alpha-level fuzzy rough set model. We will see that under the properties of the alpha-cut sets, the alpha-level fuzzy equivalence classes not only help to significantly reduce the computational cost, but also preserve most of the information about the relationships between the objects, and even can decrease some noise in the data. Based on the alpha-level fuzzy rough set model, we define new reducts and then propose the FSFCF algorithm for attribute subset selection from the decision information systems containing continuous data. It is important to emphasize some advantages of the proposed method. First, in order to evaluate and select optimal attributes, we use an alpha-level fuzzy certainty factor with the comprehensive consideration to all objects in the universe. Second, the FSFCF algorithm is designed in the hybrid filter-wrapper approach to reduce the size of selected attribute subset as well as enhance the classification accuracy. Therefore, the proposed method can significantly improve the performance of the attribute selection for continuous data. To verify the effectiveness of FSFCF, we implement experiments on a variety of real-world data sets. The results demonstrated that the proposed method outperforms the compared state-of-the-art methods in terms of the computational time, the size of reduct and the classification accuracy for almost all of data sets.
引用
收藏
页数:13
相关论文
共 58 条
[21]   Towards scalable fuzzy-rough feature selection [J].
Jensen, Richard ;
Mac Parthalain, Neil .
INFORMATION SCIENCES, 2015, 323 :1-15
[22]   New Approaches to Fuzzy-Rough Feature Selection [J].
Jensen, Richard ;
Shen, Qiang .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2009, 17 (04) :824-838
[23]   Attribute reduction for multi-label learning with fuzzy rough set [J].
Lin, Yaojin ;
Li, Yuwen ;
Wang, Chenxi ;
Chen, Jinkun .
KNOWLEDGE-BASED SYSTEMS, 2018, 152 :51-61
[24]   Adaptive intrusion detection via GA-GOGMM-based pattern learning with fuzzy rough set-based attribute selection [J].
Liu, Jinping ;
Zhang, Wuxia ;
Tang, Zhaohui ;
Xie, Yongfang ;
Ma, Tianyu ;
Zhang, Jingjing ;
Zhang, Guoyong ;
Niyoyita, Jean Paul .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 139
[25]   CoEvil: A Coevolutionary Model for Crime Inference Based on Fuzzy Rough Feature Selection [J].
Liu, Xiaoming ;
Shen, Chao ;
Wang, Wei ;
Guan, Xiaohong .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (05) :806-817
[26]   Quick attribute reduct algorithm for neighborhood rough set model [J].
Liu Yong ;
Huang Wenliang ;
Jiang Yunliang ;
Zeng Zhiyong .
INFORMATION SCIENCES, 2014, 271 :65-81
[27]   A combination of fuzzy similarity measures and fuzzy entropy measures for supervised feature selection [J].
Lohrmann, Christoph ;
Luukka, Pasi ;
Jablonska-Sabuka, Matylda ;
Kauranne, Tuomo .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 110 :216-236
[28]   IT2 Fuzzy-Rough Sets and Max Relevance-Max Significance Criterion for Attribute Selection [J].
Maji, Pradipta ;
Garai, Partha .
IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (08) :1657-1668
[29]   On fuzzy-rough attribute selection: Criteria of Max-Dependency, Max-Relevance, Min-Redundancy, and Max-Significance [J].
Maji, Pradipta ;
Garai, Partha .
APPLIED SOFT COMPUTING, 2013, 13 (09) :3968-3980
[30]   Fuzzy-Rough Simultaneous Attribute Selection and Feature Extraction Algorithm [J].
Maji, Pradipta ;
Garai, Partha .
IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (04) :1166-1177