Limitations of receiver operating characteristic curve on imbalanced data: Assist device mortality risk scores

被引:37
作者
Movahedi, Faezeh [1 ]
Padman, Rema [2 ]
Antaki, James F. [3 ,4 ]
机构
[1] Univ Pittsburgh, Swanson Sch Engn, Pittsburgh, PA USA
[2] Carnegie Mellon Univ, Heinz Coll, Pittsburgh, PA USA
[3] Cornell Univ, Meinig Sch Biomed Engn, Ithaca, NY USA
[4] Cornell Univ, Biomed Engn, 109 Weill Hall, Ithaca, NY 14853 USA
基金
美国国家卫生研究院;
关键词
CLASSIFICATION;
D O I
10.1016/j.jtcvs.2021.07.041
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective: In the left ventricular assist device domain, the receiver operating char-acteristic is a commonly applied metric of performance of classifiers. However, the receiver operating characteristic can provide a distorted view of classifiers' ability to predict short-term mortality due to the overwhelmingly greater proportion of patients who survive, that is, imbalanced data. This study illustrates the ambiguity of the receiver operating characteristic in evaluating 2 classifiers of 90-day left ven-tricular assist device mortality and introduces the precision recall curve as a supple-mental metric that is more representative of left ventricular assist device classifiers in predicting the minority class.Methods: This study compared the receiver operating characteristic and precision recall curve for 2 classifiers for 90-day left ventricular assist device mortality, HeartMate Risk Score and Random Forest for 800 patients (test group) recorded in the Interagency Registry for Mechanically Assisted Circulatory Support who received a continuous -flow left ventricular assist device between 2006 and 2016 (mean age, 59 years; 146 female vs 654 male patients), in whom 90-day mortality rate is only 8%.Results: The receiver operating characteristic indicates similar performance of Random Forest and HeartMate Risk Score classifiers with respect to area under the curve of 0.77 and Random Forest 0.63, respectively. This is in contrast to their precision recall curve with area under the curve of 0.43 versus 0.16 for Random For-est and HeartMate Risk Score, respectively. The precision recall curve for Heart -Mate Risk Score showed the precision rapidly decreased to only 10% with slightly increasing sensitivity.Conclusions: The receiver operating characteristic can portray an overly optimistic performance of a classifier or risk score when applied to imbalanced data. The pre-cision recall curve provides better insight about the performance of a classifier by focusing on the minority class. (J Thorac Cardiovasc Surg 2023;165:1433-42)
引用
收藏
页码:1433 / +
页数:12
相关论文
共 23 条
[1]   Fraud detection system: A survey [J].
Abdallah, Aisha ;
Maarof, Mohd Aizaini ;
Zainal, Anazida .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2016, 68 :90-113
[2]  
[Anonymous], 2004, ACM SIGKDD Explorations Newsletter, DOI DOI 10.1145/1007730.1007736
[3]   Caveats and pitfalls of ROC analysis in clinical microarray research (and how to avoid them) [J].
Berrar, Daniel ;
Flach, Peter .
BRIEFINGS IN BIOINFORMATICS, 2012, 13 (01) :83-97
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   When to consult precision-recall curves [J].
Cook, Jonathan ;
Ramadas, Vikram .
STATA JOURNAL, 2020, 20 (01) :131-148
[6]   Predicting Survival in Patients Receiving Continuous Flow Left Ventricular Assist Devices The HeartMate II Risk Score [J].
Cowger, Jennifer ;
Sundareswaran, Kartik ;
Rogers, Joseph G. ;
Park, Soon J. ;
Pagani, Francis D. ;
Bhat, Geetha ;
Jaski, Brian ;
Farrar, David J. ;
Slaughter, Mark S. .
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2013, 61 (03) :313-321
[7]  
Davis J., 2006, Proceedings of the 23rd International Conference on Machine Learning, P233, DOI DOI 10.1145/1143844.1143874
[8]  
Fernandez A, 2018, Learning from Imbalanced Data Sets, P63, DOI [DOI 10.1007/978-3-319-98074-4, 10.1007/978-3-319-98074-44]
[9]   A comprehensive data level analysis for cancer diagnosis on imbalanced data [J].
Fotouhi, Sara ;
Asadi, Shahrokh ;
Kattan, Michael W. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 90
[10]   Predicting pathological response to neoadjuvant chemotherapy in breast cancer patients based on imbalanced clinical data [J].
Gao, Ting ;
Hao, Yaguang ;
Zhang, Haipeng ;
Hu, Lina ;
Li, Hongzhi ;
Li, Hui ;
Hu, LiHong ;
Han, Bing .
PERSONAL AND UBIQUITOUS COMPUTING, 2018, 22 (5-6) :1039-1047