A kernel-based semi-naive Bayesian classifier using P-Trees

被引:0
作者
Denton, A [1 ]
Perrizo, W [1 ]
机构
[1] N Dakota State Univ, Dept Comp Sci, Fargo, ND 58105 USA
来源
PROCEEDINGS OF THE FOURTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING | 2004年
关键词
Bayesian classifiers; semi-naive Bayes; scalable algorithms; correlations; kernel methods; P-Trees;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A novel semi-naive Bayesian classifier is introduced that is particularly suitable to data with many attributes. The naive Bayesian classifier is taken as a starting point and correlations are reduced through joining of highly correlated attributes. Our technique differs from related work in its use of kernel-functions that systematically include continuous attributes rather than relying on discretization as a preprocessing step. This retains distance information within the attribute domains and ensures that attributes are joined based on their correlation for the particular values of the test sample. We implement a kernel-based semi-naive Bayesian classifier using P-Trees and demonstrate that it generally outperforms the naive Bayesian classifier as well as a discrete semi-naive Bayesian classifier.
引用
收藏
页码:427 / 431
页数:5
相关论文
empty
未找到相关数据