共 58 条
Hybrid filter-wrapper attribute selection with alpha-level fuzzy rough sets
被引:16
作者:
Nguyen Ngoc Thuy
[1
,2
]
Wongthanavasu, Sartra
[2
]
机构:
[1] Hue Univ, Univ Sci, Fac Informat Technol, Hue, Vietnam
[2] Khon Kaen Univ, Fac Sci, Dept Comp Sci, Khon Kaen 40002, Thailand
关键词:
Attribute selection;
Reducts;
Decision information systems;
alpha-level fuzzy rough sets;
alpha-level fuzzy certainty factor;
Numerical/real attributes;
INCREMENTAL FEATURE-SELECTION;
MIXED FEATURE-SELECTION;
MAX-RELEVANCE;
REDUCTION;
COMBINATION;
ALGORITHM;
D O I:
10.1016/j.eswa.2021.116428
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Selection of important attributes/features from decision information systems plays a vital role in data mining and machine learning tasks. It is regarded as a very interesting, but challenge problem, especially when faced with continuous numerical/real attributes. Neighborhood rough sets and fuzzy rough sets based attribute selection methods are well-known for dealing effectively with numerical/real attributes. However, characteristics of data may be described incompletely by neighborhood classes in the neighborhood rough set model, while the fuzzy rough sets based approach is still quite time-consuming because of the complex calculations on fuzzy equivalence classes. To address these limitations, we apply the concept of sets of level alpha (alpha-cut sets) in the fuzzy set theory to construct alpha-level fuzzy equivalence classes which provide a foundation for developing basic concepts of a new alpha-level fuzzy rough set model. We will see that under the properties of the alpha-cut sets, the alpha-level fuzzy equivalence classes not only help to significantly reduce the computational cost, but also preserve most of the information about the relationships between the objects, and even can decrease some noise in the data. Based on the alpha-level fuzzy rough set model, we define new reducts and then propose the FSFCF algorithm for attribute subset selection from the decision information systems containing continuous data. It is important to emphasize some advantages of the proposed method. First, in order to evaluate and select optimal attributes, we use an alpha-level fuzzy certainty factor with the comprehensive consideration to all objects in the universe. Second, the FSFCF algorithm is designed in the hybrid filter-wrapper approach to reduce the size of selected attribute subset as well as enhance the classification accuracy. Therefore, the proposed method can significantly improve the performance of the attribute selection for continuous data. To verify the effectiveness of FSFCF, we implement experiments on a variety of real-world data sets. The results demonstrated that the proposed method outperforms the compared state-of-the-art methods in terms of the computational time, the size of reduct and the classification accuracy for almost all of data sets.
引用
收藏
页数:13
相关论文