A novel approach for discretization of continuous attributes in rough set theory

被引:52
作者
Jiang, Feng [1 ]
Sui, Yuefei [2 ]
机构
[1] Qingdao Univ Sci & Technol, Coll Informat Sci & Technol, Qingdao 266061, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
Rough sets; Discretization; Supervised; Multivariate; Cuts; KNOWLEDGE REDUCTION; FEATURE-SELECTION; GRANULATION; ALGORITHM;
D O I
10.1016/j.knosys.2014.10.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discretization of continuous attributes is an important task in rough sets and many discretization algorithms have been proposed. However, most of the current discretization algorithms are univariate, which may reduce the classification ability of a given decision table. To solve this problem, we propose a supervised and multivariate discretization algorithm-SMDNS in rough sets, which is derived from the traditional algorithm naive scaler (called Naive). Given a decision table DT = (U,C,D,V,f), since SMDNS uses both class information and the interdependence among various condition attributes in C to determine the discretization scheme, the cuts obtained by SMDNS are much less than those obtained by Naive, while the classification ability of DT remains unchanged after discretization. Experimental results show that SMDNS is efficient in terms of the classification accuracy and the number of generated cuts. In particular, our algorithm can obtain a satisfactory compromise between the number of cuts and the classification accuracy. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:324 / 334
页数:11
相关论文
共 61 条
[1]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[2]  
[Anonymous], 1980, Computer Security Threat Monitoring and Surveillance
[3]  
[Anonymous], 1995, P 2 JOINT C INFORM S
[4]  
[Anonymous], 1999, KDD CUP 99 DATASET
[5]  
[Anonymous], 2001, Rough Set Theory and Knowledge Acquisition
[6]   Multivariate Discretization for Set Mining [J].
Stephen D. Bay .
Knowledge and Information Systems, 2001, 3 (4) :491-512
[7]  
Bay S.D., 1999, UCI KDD REPOSITORY
[8]  
Bazan JG, 2000, STUD FUZZ SOFT COMP, V56, P49
[9]  
Blajdo P., 2008, LECT NOTES ARTIF INT, V5009, p31C38
[10]  
CATLETT J, 1991, LECT NOTES ARTIF INT, V482, P164, DOI 10.1007/BFb0017012