The Lack of Cross-Validation Can Lead to Inflated Results and Spurious Conclusions: A Re-Analysis of the MacArthur Violence Risk Assessment Study

被引:12
作者
Bokhari, Ehsan [1 ,2 ]
Hubert, Lawrence [1 ]
机构
[1] Univ Illinois, Champaign, IL USA
[2] Los Angeles Dodgers, Los Angeles, CA 90009 USA
关键词
Classification trees; Cross-validation; Replicability; Misclassification costs; Random forests; Violence prediction; CLASSIFICATION; SCALE; COVR;
D O I
10.1007/s00357-018-9252-3
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Cross-validation is an important evaluation strategy in behavioral predictive modeling; without it, a predictive model is likely to be overly optimistic. Statistical methods have been developed that allow researchers to straightforwardly cross-validate predictive models by using the same data employed to construct the model. In the present study, cross-validation techniques were used to construct several decision-tree models with data from the MacArthur Violence Risk Assessment Study (Monahan et al., 2001). The models were then compared with the original (non-cross-validated) Classification of Violence Risk assessment tool. The results show that the measures of predictive model accuracy (AUC, misclassification error, sensitivity, specificity, positive and negative predictive values) degrade considerably when applied to a testing sample, compared with the training sample used to fit the model initially. In addition, unless false negatives (that is, incorrectly predicting individuals to be nonviolent) are considered more costly than false positives (that is, incorrectly predicting individuals to be violent), the models generally make few predictions of violence. The results suggest that employing cross-validation when constructing models can make an important contribution to increasing the reliability and replicability of psychological research.
引用
收藏
页码:147 / 171
页数:25
相关论文
共 35 条
[1]   A multiple-models approach to violence risk assessment among people with mental disorder [J].
Banks, S ;
Robbins, PC ;
Silver, E ;
Vesselinov, R ;
Steadman, HJ ;
Monahan, J ;
Mulvey, EP ;
Appelbaum, PS ;
Grisso, T ;
Roth, LH .
CRIMINAL JUSTICE AND BEHAVIOR, 2004, 31 (03) :324-340
[2]  
BERK R, 2012, CRIMINAL JUSTICE FOR
[3]   Asymmetric Loss Functions for Forecasting in Criminal Justice Settings [J].
Berk, Richard .
JOURNAL OF QUANTITATIVE CRIMINOLOGY, 2011, 27 (01) :107-123
[4]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[5]   SUBMODEL SELECTION AND EVALUATION IN REGRESSION - THE X-RANDOM CASE [J].
BREIMAN, L ;
SPECTOR, P .
INTERNATIONAL STATISTICAL REVIEW, 1992, 60 (03) :291-319
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]  
Doyle M., 2010, International Journal of Forensic Mental Health, V9, P316, DOI DOI 10.1080/14999013.2010.527428
[9]  
Fernández-Delgado M, 2014, J MACH LEARN RES, V15, P3133
[10]   A comparison of actuarial methods for identifying repetitively violent patients with mental illnesses [J].
Gardner, W ;
Lidz, CW ;
Mulvey, EP ;
Shaw, EC .
LAW AND HUMAN BEHAVIOR, 1996, 20 (01) :35-48