DB-HReduction: A data preprocessing algorithm for data mining applications

被引:27
作者
Hu, XH [1 ]
机构
[1] Drexel Univ, Coll Informat Sci & Techol, Philadelphia, PA 19104 USA
关键词
data mining; data preprocessing; data reduction; horizontal reduction;
D O I
10.1016/S0893-9659(03)90013-9
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Data preprocessing is an important and critical step in the data mining process and it has a huge impact on the success of. a data mining project. In this paper, we present an algorithm DBHReduction, which discretizes or eliminates numeric attributes and generalizes or eliminates symbolic attributes very efficiently and effectively. This algorithm greatly decreases the number of attributes and tuples of the data set and improves the accuracy and decreases the running time of the data mining algorithms in the later stage. (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:889 / 895
页数:7
相关论文
共 9 条
[1]  
ALLMUALLIM H, 1994, ARTIF INTELL, V69, P279
[2]  
[Anonymous], 1992, The Tenth National Conference on Artificial Intelligence
[3]  
[Anonymous], KNOWLEDGE INFORM SYS
[4]  
HAN JW, 1992, PROC INT CONF VERY L, P547
[5]  
HU X, UNPUB 2002 IEEE INT
[6]  
Kohavi R., 1996, P 2 INT C KNOWL DISC, V96, P1
[7]  
LIU H, 1996, P 8 IEEE TOOLS AI WA, P388
[8]  
Liu H., 1998, FEATURE EXTRACTION C, DOI 10.1007/978-1-4615-5725-8
[9]  
Pyle D., 1999, Data Preparation for Data Mining