When to consult precision-recall curves

被引:67
作者
Cook, Jonathan [1 ]
Ramadas, Vikram [1 ]
机构
[1] Publ Co Accounting Oversight Board, Washington, DC 20006 USA
关键词
st0591; prcurve; precision-recall curves; classifier evaluation; ROC curves; PREDICT CLASSIFICATION PERFORMANCE; ROC ANALYSIS;
D O I
10.1177/1536867X20909693
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Receiver operating characteristic (ROC) curves are commonly used to evaluate predictions of binary outcomes. When there is a small percentage of items of interest (as would be the case with fraud detection, for example), ROC curves can provide an inflated view of performance. This can cause challenges in determining which set of predictions is better. In this article, we discuss the conditions under which precision-recall curves may be preferable to ROC curves. As an illustrative example, we compare two commonly used fraud predictors (Beneish's [1999, Financial Analysts Journal 55: 24-36] M score and Dechow et al.'s [2011, Contemporary Accounting Research 28: 17-82] F score) using both ROC and precision-recall curves. To aid the reader with using precision-recall curves, we also introduce the command prcurve to plot them.
引用
收藏
页码:131 / 148
页数:18
相关论文
共 17 条
[1]  
[Anonymous], P 29 INT C INT C MAC
[2]  
Beneish M. D., 1999, Financial Analysts Journal, V55, P24, DOI [10.2469/faj.v55.n5.2296, DOI 10.2469/FAJ.V55.N5.2296]
[3]   Estimating receiver operative characteristic curves for time-dependent outcomes: The stroccurve package [J].
Cattaneo, Mattia ;
Malighetti, Paolo ;
Spinelli, Daniele .
STATA JOURNAL, 2017, 17 (04) :1015-1023
[4]   Detecting Management Fraud in Public Companies [J].
Cecchini, Mark ;
Aytug, Haldun ;
Koehler, Gary J. ;
Pathak, Praveen .
MANAGEMENT SCIENCE, 2010, 56 (07) :1146-1160
[5]   heckroccurve: ROC curves for selected samples [J].
Cook, Jonathan A. ;
Rajbhandari, Ashish .
STATA JOURNAL, 2018, 18 (01) :174-183
[6]   ROC curves and nonrandom data [J].
Cook, Jonathan Aaron .
PATTERN RECOGNITION LETTERS, 2017, 85 :35-41
[7]  
Davis JE, 2006, DEAF WAY II READER: PERSPECTIVES FROM THE SECOND INTERNATIONAL CONFERENCE ON DEAF CULTURE, P233
[8]   Predicting Material Accounting Misstatements [J].
Dechow, Patricia M. ;
Ge, Weili ;
Larson, Chad R. ;
Sloan, Richard G. .
CONTEMPORARY ACCOUNTING RESEARCH, 2011, 28 (01) :17-82
[9]   COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH [J].
DELONG, ER ;
DELONG, DM ;
CLARKEPEARSON, DI .
BIOMETRICS, 1988, 44 (03) :837-845
[10]   A response to Webb and Ting's On the application of ROC analysis to predict classification performance under varying class distributions [J].
Fawcett, T ;
Flach, PA .
MACHINE LEARNING, 2005, 58 (01) :33-38