Receiver Operating Characteristic Prediction for Classification: Performances in Cross-Validation by Example

被引:6
|
作者
Ciocan, Andra [1 ,2 ]
Hajjar, Nadim Al [2 ,3 ]
Graur, Florin [2 ,3 ]
Oprea, Valentin C. [2 ,4 ]
Ciocan, Razvan A. [5 ]
Bolboaca, Sorana D. [1 ]
机构
[1] Iuliu Hatieganu Univ Med & Pharm Cluj Napoca, Dept Med Informat & Biostat, Louis Pasteur St 6, Cluj Napoca 400349, Romania
[2] Prof Dr Octavian Fodor Reg Inst Gastroenterol & H, Croitorilor St 19-21, Cluj Napoca 400162, Romania
[3] Iuliu Hatieganu Univ Med & Pharm Cluj Napoca, Dept Surg, Croitorilor St 19-21, Cluj Napoca 400162, Romania
[4] Dr Constantin Papilian Mil Emergency Hosp Cluj Na, Gen Traian Mosoiu St 22, Cluj Napoca 400132, Romania
[5] Iuliu Hatieganu Univ Med & Pharm, Dept Med Skills Human Sci, Marinescu St 23, Cluj Napoca 400337, Romania
关键词
receiver operating characteristic; area under the curve; models performances; biomarkers; metastasis; NEUTROPHIL-TO-LYMPHOCYTE; EVALUATING DIAGNOSTIC-TESTS; COLORECTAL-CANCER; ROC CURVE; MODEL SELECTION; GOLD STANDARD; RATIO; ACCURACY; SURVIVAL; AREA;
D O I
10.3390/math8101741
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The stability of receiver operating characteristic in context of random split used in development and validation sets, as compared to the full models for three inflammatory ratios (neutrophil-to-lymphocyte (NLR), derived neutrophil-to-lymphocyte (dNLR) and platelet-to-lymphocyte (PLR) ratio) evaluated as predictors for metastasis in patients with colorectal cancer, was investigated. Data belonging to patients admitted with the diagnosis of colorectal cancer from January 2014 until September 2019 in a single hospital were used. There were 1688 patients eligible for the study, 418 in the metastatic stage. All investigated inflammatory ratios proved to be significant classification models on both the full models and on cross-validations (AUCs > 0.05). High variability of the cut-off values was observed in the unrestricted and restricted split (full models: 4.255 for NLR, 2.745 for dNLR and 255.56 for PLR; random splits: cut-off from 3.215 to 5.905 for NLR, from 2.625 to 3.575 for dNLR and from 134.67 to 335.9 for PLR), but with no effect on the models characteristics or performances. The investigated biomarkes proved limited value as predictors for metastasis (AUCs < 0.8), with largely sensitivity and specificity (from 33.3% to 79.2% for the full model and 29.1% to 82.7% in the restricted splits). Our results showed that a simple random split of observations, weighting or not the patients with and whithout metastasis, in a ROC analysis assures the performances similar to the full model, if at least 70% of the available population is included in the study.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 35 条
  • [1] Tournament leave-pair-out cross-validation for receiver operating characteristic analysis
    Perez, Ileana Montoya
    Airola, Antti
    Bostrom, Peter J.
    Jambor, Ivan
    Pahikkala, Tapio
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2019, 28 (10-11) : 2975 - 2991
  • [2] Receiver Operating Characteristic Analysis for the Evaluation of Diagnosis and Prediction
    Gatsonis, Constantine A.
    RADIOLOGY, 2009, 253 (03) : 593 - 596
  • [3] Matrix variate receiver operating characteristic curve for binary classification
    Siva, G.
    STATISTICS, 2024, 58 (01) : 1 - 8
  • [4] Evaluating Classification Performance: Receiver Operating Characteristic and Expected Utility
    Yang, Yueran
    PSYCHOLOGICAL METHODS, 2024, 29 (05) : 827 - 843
  • [5] On the Accuracy of Cross-Validation in the Classification Problem
    Nedel'ko, V. M.
    BULLETIN OF IRKUTSK STATE UNIVERSITY-SERIES MATHEMATICS, 2021, 38 : 84 - 95
  • [6] Prediction-Based Structured Variable Selection through the Receiver Operating Characteristic Curves
    Wang, Yuanjia
    Chen, Huaihou
    Li, Runze
    Duan, Naihua
    Lewis-Fernandez, Roberto
    BIOMETRICS, 2011, 67 (03) : 896 - 905
  • [7] Assessing cure status prediction from survival data using receiver operating characteristic curves
    Amico, M.
    Van Keilegom, I
    Han, B.
    BIOMETRIKA, 2021, 108 (03) : 727 - 740
  • [8] Prediction and cross-validation of neural networks versus logistic regression: Using hepatic disorders as an example
    Duh, MS
    Walker, AM
    Pagano, M
    Kronlund, K
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 1998, 147 (04) : 407 - 413
  • [9] Cross-Validation of Resting Metabolic Rate Prediction Equations
    Flack, Kyle D.
    Siders, William A.
    Johnson, LuAnn
    Roemmich, James N.
    JOURNAL OF THE ACADEMY OF NUTRITION AND DIETETICS, 2016, 116 (09) : 1413 - 1422
  • [10] An Extension of the Receiver Operating Characteristic Curve and AUC-Optimal Classification
    Takenouchi, Takashi
    Komori, Osamu
    Eguchi, Shinto
    NEURAL COMPUTATION, 2012, 24 (10) : 2789 - 2824