Fuzzy-rough data reduction with ant colony optimization

被引:134
作者
Jensen, R [1 ]
Shen, Q [1 ]
机构
[1] Univ Wales, Dept Comp Sci, Aberystwyth, Dyfed, Wales
基金
英国工程与自然科学研究理事会;
关键词
data reduction; fuzzy-rough sets; ant colony optimization; feature selection;
D O I
10.1016/j.fss.2004.07.014
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Feature selection refers to the problem of selecting those input features that are most predictive of a given outcome; a problem encountered in many areas such as machine learning, pattern recognition and signal processing. In particular, solution to this has found successful application in tasks that involve datasets containing huge numbers of features (in the order of tens of thousands), which would be impossible to process further. Recent examples include text processing and web content classification. Rough set theory has been used as such a dataset preprocessor with much success, but current methods are inadequate at finding minimal reductions, the smallest sets of features possible. To alleviate this difficulty, a feature selection technique that employs a hybrid variant of rough sets, fuzzy-rough sets, has been developed recently and has been shown to be effective. However, this method is still not able to find the optimal subsets regularly. This paper proposes a new feature selection mechanism based on ant colony optimization in an attempt to combat this. The method is then applied to the problem of finding optimal feature subsets in the fuzzy-rough data reduction process. The present work is applied to complex systems monitoring and experimentally compared with the original fuzzy-rough method, an entropy-based feature selector, and a transformation-based reduction method, PCA. Comparisons with the use of a support vector classifier are also included. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:5 / 20
页数:16
相关论文
共 22 条
[1]  
Blake C.L., 1998, UCI repository of machine learning databases
[2]  
Bonabeau E., 1999, Swarm Intelligence: From Natural to Artificial Systems, DOI 10.1093/oso/9780195131581.001.0001
[3]   A new method for generating fuzzy rules from numerical data for handling classification problems [J].
Chen, SM ;
Lee, SH ;
Lee, CH .
APPLIED ARTIFICIAL INTELLIGENCE, 2001, 15 (07) :645-664
[4]   Rough set-aided keyword reduction for text categorization [J].
Chouchoulas, A ;
Shen, Q .
APPLIED ARTIFICIAL INTELLIGENCE, 2001, 15 (09) :843-873
[5]  
Dash M., 1997, Intelligent Data Analysis, V1
[6]  
Devijver P., 1982, PATTERN RECOGN
[7]   Ant system: Optimization by a colony of cooperating agents [J].
Dorigo, M ;
Maniezzo, V ;
Colorni, A .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1996, 26 (01) :29-41
[8]   On the representation of fuzzy rules in terms of crisp rules [J].
Dubois, D ;
Hüllermeier, E ;
Prade, H .
INFORMATION SCIENCES, 2003, 151 :301-326
[9]  
Dubois D., 1992, Putting Rough Sets and Fuzzy Sets Together, P203, DOI [10.1007/978-94-015-7975-9_14, DOI 10.1007/978-94-015-7975-9_14]
[10]   QUOTIENTS WITH RESPECT TO SIMILARITY RELATIONS [J].
HOHLE, U .
FUZZY SETS AND SYSTEMS, 1988, 27 (01) :31-44