Hybrid filter-wrapper attribute selection with alpha-level fuzzy rough sets

被引:12
作者
Nguyen Ngoc Thuy [1 ,2 ]
Wongthanavasu, Sartra [2 ]
机构
[1] Hue Univ, Univ Sci, Fac Informat Technol, Hue, Vietnam
[2] Khon Kaen Univ, Fac Sci, Dept Comp Sci, Khon Kaen 40002, Thailand
关键词
Attribute selection; Reducts; Decision information systems; alpha-level fuzzy rough sets; alpha-level fuzzy certainty factor; Numerical/real attributes; INCREMENTAL FEATURE-SELECTION; MIXED FEATURE-SELECTION; MAX-RELEVANCE; REDUCTION; COMBINATION; ALGORITHM;
D O I
10.1016/j.eswa.2021.116428
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Selection of important attributes/features from decision information systems plays a vital role in data mining and machine learning tasks. It is regarded as a very interesting, but challenge problem, especially when faced with continuous numerical/real attributes. Neighborhood rough sets and fuzzy rough sets based attribute selection methods are well-known for dealing effectively with numerical/real attributes. However, characteristics of data may be described incompletely by neighborhood classes in the neighborhood rough set model, while the fuzzy rough sets based approach is still quite time-consuming because of the complex calculations on fuzzy equivalence classes. To address these limitations, we apply the concept of sets of level alpha (alpha-cut sets) in the fuzzy set theory to construct alpha-level fuzzy equivalence classes which provide a foundation for developing basic concepts of a new alpha-level fuzzy rough set model. We will see that under the properties of the alpha-cut sets, the alpha-level fuzzy equivalence classes not only help to significantly reduce the computational cost, but also preserve most of the information about the relationships between the objects, and even can decrease some noise in the data. Based on the alpha-level fuzzy rough set model, we define new reducts and then propose the FSFCF algorithm for attribute subset selection from the decision information systems containing continuous data. It is important to emphasize some advantages of the proposed method. First, in order to evaluate and select optimal attributes, we use an alpha-level fuzzy certainty factor with the comprehensive consideration to all objects in the universe. Second, the FSFCF algorithm is designed in the hybrid filter-wrapper approach to reduce the size of selected attribute subset as well as enhance the classification accuracy. Therefore, the proposed method can significantly improve the performance of the attribute selection for continuous data. To verify the effectiveness of FSFCF, we implement experiments on a variety of real-world data sets. The results demonstrated that the proposed method outperforms the compared state-of-the-art methods in terms of the computational time, the size of reduct and the classification accuracy for almost all of data sets.
引用
收藏
页数:13
相关论文
共 58 条
[1]   A hybrid fuzzy feature selection algorithm for high-dimensional regression problems: An mRMR-based framework [J].
Aghaeipoor, Fatemeh ;
Javidi, Mohammad Masoud .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 162
[2]  
[Anonymous], 2011, Acm T. Intel. Syst. Tec., DOI DOI 10.1145/1961189.1961199
[3]   Attribute Reduction for Heterogeneous Data Based on the Combination of Classical and Fuzzy Rough Set Models [J].
Chen, Degang ;
Yang, Yanyan .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (05) :1325-1334
[4]   A Novel Algorithm for Finding Reducts With Fuzzy Rough Sets [J].
Chen, Degang ;
Zhang, Lei ;
Zhao, Suyun ;
Hu, Qinghua ;
Zhu, Pengfei .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (02) :385-389
[5]   A graph approach for fuzzy -rough feature selection [J].
Chen, Jinkun ;
Mi, Jusheng ;
Lin, Yaojin .
FUZZY SETS AND SYSTEMS, 2020, 391 :96-116
[6]   Neighborhood rough set reduction with fish swarm algorithm [J].
Chen, Yumin ;
Zeng, Zhiqiang ;
Lu, Junwen .
SOFT COMPUTING, 2017, 21 (23) :6907-6918
[7]   Multi-adjoint fuzzy rough sets: Definition, properties and attribute selection [J].
Cornelis, Chris ;
Medina, Jesus ;
Verbiest, Nele .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2014, 55 (01) :412-426
[8]   Maximal-Discernibility-Pair-Based Approach to Attribute Reduction in Fuzzy Rough Sets [J].
Dai, Jianhua ;
Hu, Hu ;
Wu, Wei-Zhi ;
Qian, Yuhua ;
Huang, Debiao .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (04) :2174-2187
[9]   Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification [J].
Dai, Jianhua ;
Xu, Qing .
APPLIED SOFT COMPUTING, 2013, 13 (01) :211-221
[10]   Multigranulation consensus fuzzy-rough based attribute reduction [J].
Ding, Weiping ;
Wang, Jiandong ;
Wang, Jiehua .
KNOWLEDGE-BASED SYSTEMS, 2020, 198