Influence Analysis for the Area Under the Receiver Operating Characteristic Curve

被引:5
作者
Ke, Bo-Shiang [1 ]
Chiang, An Jen [2 ]
Chang, Yuan-chin Ivan [3 ]
机构
[1] Natl Chiao Tung Univ, Inst Stat, Hsinchu, Taiwan
[2] Kaohsiung Vet Gen Hosp, Dept Obstet & Gynecol, Kaohsiung, Taiwan
[3] Acad Sinica, Inst Stat Sci, 128 Acad Rd,Sect 2, Taipei 11529, Taiwan
关键词
AUC; cumulative lift chart; influence function; local influence; partial AUC; LOGISTIC-REGRESSION; AUC; CLASSIFICATION; COMBINATION; SELECTION;
D O I
10.1080/10543406.2017.1377728
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Classification measures play essential roles in the assessment and construction of classifiers. Hence, determining how to prevent these measures from being affected by individual observations has become an important problem. In this paper, we propose several indexes based on the influence function and the concept of local influence to identify influential observations that affect the estimate of the area under the receiver operating characteristic curve (AUC), an important and commonly used measure. Cumulative lift charts are also used to equipoise the disagreements among the proposed indexes. Both the AUC indexes and the graphical tools only rely on the classification scores, and both are applicable to classifiers that can produce real-valued classification scores. A real data set is used for illustration.
引用
收藏
页码:722 / 734
页数:13
相关论文
共 28 条
[1]  
[Anonymous], 2011, Statistical Pattern Recognition
[2]  
ATKINSON AC, 1986, BIOMETRIKA, V73, P533
[3]   Handling class imbalance in customer churn prediction [J].
Burez, J. ;
Van den Poel, D. .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) :4626-4636
[4]  
Calders T, 2007, LECT NOTES ARTIF INT, V4702, P42
[5]   Maximizing an ROC-type measure via linear combination of markers when the gold reference is continuous [J].
Chang, Yuan-chin Ivan .
STATISTICS IN MEDICINE, 2013, 32 (11) :1893-1903
[6]   Differentiating between borderline and invasive malignancies in ovarian tumors using a multivariate logistic regression model [J].
Chen, Jiabin ;
Chang, Chung ;
Huang, Hung-Chi ;
Chung, Yu-Che ;
Huang, Huan-Jung ;
Liou, Wen Shiung ;
Chiang, An Jen ;
Teng, Nelson N. H. .
TAIWANESE JOURNAL OF OBSTETRICS & GYNECOLOGY, 2015, 54 (04) :398-402
[7]  
Cook R., 1982, Residuals and Influence in Regression
[8]  
COOK RD, 1986, J ROY STAT SOC B MET, V48, P133
[9]   ROBUST ESTIMATION AND OUTLIER DETECTION WITH CORRELATION-COEFFICIENTS [J].
DEVLIN, SJ ;
GNANADESIKAN, R ;
KETTENRING, JR .
BIOMETRIKA, 1975, 62 (03) :531-545
[10]   A class of logistic-type discriminant functions [J].
Eguchi, S ;
Copas, J .
BIOMETRIKA, 2002, 89 (01) :1-22