An efficient accelerator for attribute reduction from incomplete data in rough set framework

被引：191

作者：

Qian, Yuhua ^{[1
,2
]}

Liang, Jiye ^{[1
]}

Pedrycz, Witold ^{[3
]}

Dang, Chuangyin ^{[2
]}

机构：

[1] Shanxi Univ, Key Lab Computat Intelligence & Chinese Informat, Minist Educ, Taiyuan 030006, Shanxi, Peoples R China

[2] City Univ Hong Kong, Dept Mfg Engn & Engn Management, Hong Kong, Hong Kong, Peoples R China

[3] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB, Canada

来源：

PATTERN RECOGNITION | 2011年 / 44卷 / 08期

关键词：

Feature selection; Rough set theory; Incomplete information systems; Positive approximation; Granular computing; FEATURE-SELECTION; KNOWLEDGE REDUCTION; INFORMATION; GRANULATION; ENTROPY; DIMENSIONALITY; UNCERTAINTY; SYSTEMS;

D O I：

10.1016/j.patcog.2011.02.020

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Feature selection (attribute reduction) from large-scale incomplete data is a challenging problem in areas such as pattern recognition, machine learning and data mining. In rough set theory, feature selection from incomplete data aims to retain the discriminatory power of original features. To address this issue, many feature selection algorithms have been proposed, however, these algorithms are often computationally time-consuming. To overcome this shortcoming, we introduce in this paper a theoretic framework based on rough set theory, which is called positive approximation and can be used to accelerate a heuristic process for feature selection from incomplete data. As an application of the proposed accelerator, a general feature selection algorithm is designed. By integrating the accelerator into a heuristic algorithm, we obtain several modified representative heuristic feature selection algorithms in rough set theory. Experiments show that these modified algorithms outperform their original counterparts. It is worth noting that the performance of the modified algorithms becomes more visible when dealing with larger data sets. (C) 2011 Elsevier Ltd. All rights reserved.

引用

页码：1658 / 1670

页数：13

共 46 条

[1] On the compact computational domain of fuzzy-rough sets [J].