Robust rule-based prediction

被引:9
作者
Li, Jiuyong [1 ]
机构
[1] Univ So Queensland, Dept Math & Comp, Toowoomba, Qld 4350, Australia
基金
澳大利亚研究理事会;
关键词
data mining; rule; classification; robustness;
D O I
10.1109/TKDE.2006.129
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies a problem of robust rule-based classification, i.e., making predictions in the presence of missing values in data. This study differs from other missing value handling research in that it does not handle missing values but builds a rule-based classification model to tolerate missing values. Based on a commonly used rule-based classification model, we characterize the robustness of a hierarchy of rule sets as k-optimal rule sets with the decreasing size corresponding to the decreasing robustness. We build classifiers based on k-optimal rule sets and show experimentally that they are more robust than some benchmark rule-based classifiers, such as C4.5rules and CBA. We also show that the proposed approach is better than two well-known missing value handling methods for missing values in test data.
引用
收藏
页码:1043 / 1054
页数:12
相关论文
共 24 条
[1]  
Agrawal R, 1994, P 20 INT C VER LARG, V1215, P487
[2]  
[Anonymous], UCI REPOSITORY MACHI
[3]   Essential classification rule sets [J].
Baralis, E ;
Chiusano, S .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2004, 29 (04) :635-674
[4]  
Batista GEAPA, 2003, APPL ARTIF INTELL, V17, P519, DOI [10.1080/713827181, 10.1080/08839510390219309]
[5]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[6]  
Clark P., 1989, Machine Learning, V3, P261, DOI 10.1023/A:1022641700528
[7]  
Clark P., 1991, P 5 EUR WORK SESS LE, P151, DOI DOI 10.1007/BFB0017011
[8]  
Cohen W. W., 1995, P 12 INT C MACH LEAR, P115, DOI DOI 10.1016/B978-1-55860-377-6.50023-2
[9]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139
[10]  
Freund Y, 1996, ICML