Probability estimation with machine learning methods for dichotomous and multicategory outcome: Applications

被引:47
作者
Kruppa, Jochen [1 ]
Liu, Yufeng [2 ]
Diener, Hans-Christian [3 ]
Holste, Theresa [1 ]
Weimar, Christian [3 ]
Koenig, Inke R. [1 ]
Ziegler, Andreas [1 ,4 ]
机构
[1] Med Univ Lubeck, Inst Med Biometrie & Stat, Univ Klinikum Schleswig Holstein, D-23562 Lubeck, Germany
[2] Univ N Carolina, Carolina Ctr Genome Sci, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
[3] Univ Klinikum Essen, Neurol Klin, D-45147 Essen, Germany
[4] Zentrum Klin Studien Lubeck, D-23562 Lubeck, Germany
关键词
Brier score; German Stroke Study Collaboration; Probability estimation; Random forest; Random Jungle; Support vector machine; SUPPORT VECTOR MACHINES; 2 BINOMIAL PROPORTIONS; CONFIDENCE-INTERVALS; RANDOM FOREST; DIFFERENCE; BRIER; PREDICTION; VALIDATION; DIAGNOSIS; SCORE;
D O I
10.1002/bimj.201300077
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Machine learning methods are applied to three different large datasets, all dealing with probability estimation problems for dichotomous or multicategory data. Specifically, we investigate k-nearest neighbors, bagged nearest neighbors, random forests for probability estimation trees, and support vector machines with the kernels of Bessel, linear, Laplacian, and radial basis type. Comparisons are made with logistic regression. The dataset from the German Stroke Study Collaboration with dichotomous and three-category outcome variables allows, in particular, for temporal and external validation. The other two datasets are freely available from the UCI learning repository and provide dichotomous outcome variables. One of them, the Cleveland Clinic Foundation Heart Disease dataset, uses data from one clinic for training and from three clinics for external validation, while the other, the thyroid disease dataset, allows for temporal validation by separating data into training and test data by date of recruitment into study. For dichotomous outcome variables, we use receiver operating characteristics, areas under the curve values with bootstrapped 95% confidence intervals, and Hosmer-Lemeshow-type figures as comparison criteria. For dichotomous and multicategory outcomes, we calculated bootstrap Brier scores with 95% confidence intervals and also compared them through bootstrapping. In a supplement, we provide R code for performing the analyses and for random forest analyses in Random Jungle, version 2.1.0. The learning machines show promising performance over all constructed models. They are simple to apply and serve as an alternative approach to logistic or multinomial logistic regression analysis.
引用
收藏
页码:564 / 583
页数:20
相关论文
共 62 条
[1]   Sampling Uncertainty and Confidence Intervals for the Brier Score and Brier Skill Score [J].
Bradley, A. Allen ;
Schwartz, Stuart S. ;
Hashino, Tempei .
WEATHER AND FORECASTING, 2008, 23 (05) :992-1006
[2]  
Brier G. W., 1950, Monthly weather review, V78, P1, DOI [DOI 10.1175/1520-0493(1950)078, DOI 10.1175/1520-0493(1950)078ANDLT
[3]  
0001:VOFEITANDGT
[4]  
2.0.CO
[5]  
2, 10.1175/1520-0493(1950)078()0001:VOFEIT()2.0.CO
[6]  
2, DOI 10.1175/1520-0493(1950)0782.0.CO
[7]  
2]
[8]  
Cortes Corinna., 2005, ADV NEURAL INFORM PR, V17, P305
[9]   COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH [J].
DELONG, ER ;
DELONG, DM ;
CLARKEPEARSON, DI .
BIOMETRICS, 1988, 44 (03) :837-845
[10]   INTERNATIONAL APPLICATION OF A NEW PROBABILITY ALGORITHM FOR THE DIAGNOSIS OF CORONARY-ARTERY DISEASE [J].
DETRANO, R ;
JANOSI, A ;
STEINBRUNN, W ;
PFISTERER, M ;
SCHMID, JJ ;
SANDHU, S ;
GUPPY, KH ;
LEE, S ;
FROELICHER, V .
AMERICAN JOURNAL OF CARDIOLOGY, 1989, 64 (05) :304-310