A Preliminary Study on Class Probability Estimation for Random Forest Using Kernel Density estimators

被引:0
作者
Yang, Fan [1 ]
Peng, Piao [1 ]
Zhou, Qifeng [1 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen, Peoples R China
来源
2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE) | 2016年
关键词
random forest; class probability estimation; kernel density estimation;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Random forest cannot give accurate and calibrated posterior class probability estimates for its predictions. In this paper, we propose novel probabilities estimators combining random forests with kernel density estimation. Kernel density estimator can manage to obtain smooth non-parametric estimations of class probabilities, but fail to scale up to the high dimensional data. In order to apply kernel density estimator to high dimensional data, we proposed to utilize random forest for dimension reduction. First, a random forest model is built with the training data. Secondly, for the test instance, we perform local kernel density estimators in the reduced subspaces corresponding to trees of random forest, and then average the estimated probabilities over the trees. Preliminary experiments on synthetic high-dimensional data showed that the new method provided accurate probability estimates for the output of the random forest.
引用
收藏
页码:118 / 122
页数:5
相关论文
共 28 条
[1]  
[Anonymous], KDD 02 P 8 ACM SIGKD
[2]  
[Anonymous], 6 INT C MACH LEARN A
[3]  
[Anonymous], 0004IS CDER STERN SC
[4]  
[Anonymous], P 12 INT C MACH LEAR
[5]  
[Anonymous], DENSITY ESTIMATION S
[6]  
[Anonymous], INT C MACH LEARN
[7]  
[Anonymous], MACHINE LEARNING CHA
[8]  
[Anonymous], STAT MED
[9]   Confidence in predictions from random tree ensembles [J].
Bhattacharyya, Siddhartha .
KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 35 (02) :391-410
[10]   Calibrating Random Forests [J].
Bostrom, Henrik .
SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, :121-126