Application of artificial intelligence for overall survival risk stratification in oropharyngeal carcinoma: A validation of ProgTOOL

被引:8
作者
Alabi, Rasheed Omobolaji [1 ,2 ]
Sjoblom, Anni [3 ]
Carpen, Timo [1 ,3 ,4 ,5 ]
Elmusrati, Mohammed [2 ]
Leivo, Ilmo [6 ]
Almangush, Alhadi [1 ,3 ,6 ,7 ]
Makitie, Antti A. [1 ,4 ,5 ,8 ,9 ]
机构
[1] Univ Helsinki, Fac Med, Res Program Syst Oncol, Helsinki, Finland
[2] Univ Vaasa, Sch Technol & Innovat, Dept Ind Digitalizat, Vaasa, Finland
[3] Univ Helsinki, Dept Pathol, Helsinki, Finland
[4] Univ Helsinki, Dept Otorhinolaryngol Head & Neck Surg, Helsinki, Finland
[5] Helsinki Univ Hosp, Helsinki, Finland
[6] Univ Turku, Inst Biomed, Pathol, Turku, Finland
[7] Misurata Univ, Fac Dent, Misurata, Libya
[8] Karolinska Inst, Div Ear Nose & Throat Dis, Dept Clin Sci Intervent & Technol, Stockholm, Sweden
[9] Karolinska Univ Hosp, Stockholm, Sweden
关键词
Machine learning; External validation; Internal validation; Overall survival; Prognostication; Web -based tool; Oropharyngeal; EXTERNAL VALIDATION; MARITAL-STATUS; NASOPHARYNGEAL CARCINOMA; PREDICTION MODELS; CANCER; PROGNOSIS; DIAGNOSIS; OUTCOMES; HEAD;
D O I
10.1016/j.ijmedinf.2023.105064
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: In recent years, there has been a surge in machine learning-based models for diagnosis and prog-nostication of outcomes in oncology. However, there are concerns relating to the model's reproducibility and generalizability to a separate patient cohort (i.e., external validation).Objectives: This study primarily provides a validation study for a recently introduced and publicly available machine learning (ML) web-based prognostic tool (ProgTOOL) for overall survival risk stratification of oropharyngeal squamous cell carcinoma (OPSCC). Additionally, we reviewed the published studies that have utilized ML for outcome prognostication in OPSCC to examine how many of these models were externally validated, type of external validation, characteristics of the external dataset, and diagnostic performance char-acteristics on the internal validation (IV) and external validation (EV) datasets were extracted and compared.Methods: We used a total of 163 OPSCC patients obtained from the Helsinki University Hospital to externally validate the ProgTOOL for generalizability. In addition, PubMed, OvidMedline, Scopus, and Web of Science databases were systematically searched according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines.Results: The ProgTOOL produced a predictive performance of 86.5% balanced accuracy, Mathew's correlation coefficient of 0.78, Net Benefit (0.7) and Brier score (0.06) for overall survival stratification of OPSCC patients as either low-chance or high-chance. In addition, out of a total of 31 studies found to have used ML for the prognostication of outcomes in OPSCC, only seven (22.6%) reported a form of EV. Three studies (42.9%) each used either temporal EV or geographical EV while only one study (14.2%) used expert as a form of EV. Most of the studies reported a reduction in performance when externally validated.Conclusion: The performance of the model in this validation study indicates that it may be generalized, therefore, bringing recommendations of the model for clinical evaluation closer to reality. However, the number of externally validated ML-based models for OPSCC is still relatively small. This significantly limits the transfer of these models for clinical evaluation and subsequently reduces the likelihood of the use of these models in daily clinical practice. As a gold standard, we recommend the use of geographical EV and validation studies to reveal biases and overfitting of these models. These recommendations are poised to facilitate the implementation of these models in clinical practice.
引用
收藏
页数:13
相关论文
共 79 条
[1]   Marital Status and Survival in Patients With Cancer [J].
Aizer, Ayal A. ;
Chen, Ming-Hui ;
McCarthy, Ellen P. ;
Mendu, Mallika L. ;
Koo, Sophia ;
Wilhite, Tyler J. ;
Graham, Powell L. ;
Choueiri, Toni K. ;
Hoffman, Karen E. ;
Martin, Neil E. ;
Hu, Jim C. ;
Nguyen, Paul L. .
JOURNAL OF CLINICAL ONCOLOGY, 2013, 31 (31) :3869-3876
[2]  
Alabi R.O., 2020, CEUR WORKSHOP P
[3]   An interpretable machine learning prognostic system for risk stratification in oropharyngeal cancer [J].
Alabi, Rasheed Omobolaji ;
Almangush, Alhadi ;
Elmusrati, Mohammed ;
Leivo, Ilmo ;
Makitie, Antti A. .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2022, 168
[4]   Measuring the Usability and Quality of Explanations of a Machine Learning Web-Based Tool for Oral Tongue Cancer Prognostication [J].
Alabi, Rasheed Omobolaji ;
Almangush, Alhadi ;
Elmusrati, Mohammed ;
Leivo, Ilmo ;
Makitie, Antti .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (14)
[5]   Machine learning in oral squamous cell carcinoma: Current status, clinical concerns and prospects for future-A systematic review [J].
Alabi, Rasheed Omobolaji ;
Youssef, Omar ;
Pirinen, Matti ;
Elmusrati, Mohammed ;
Makitie, Antti A. ;
Leivo, Ilmo ;
Almangush, Alhadi .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 115
[6]   Machine learning application for prediction of locoregional recurrences in early oral tongue cancer: a Web-based prognostic tool [J].
Alabi, Rasheed Omobolaji ;
Elmusrati, Mohammed ;
Sawazaki-Calone, Iris ;
Kowalski, Luiz Paulo ;
Haglund, Caj ;
Coletta, Ricardo D. ;
Makitie, Antti A. ;
Salo, Tuula ;
Leivo, Ilmo ;
Almangush, Alhadi .
VIRCHOWS ARCHIV, 2019, 475 (04) :489-497
[7]   Comparison of supervised machine learning classification techniques in prediction of locoregional recurrences in early oral tongue cancer [J].
Alabi, Rasheed Omobolaji ;
Elmusrati, Mohammed ;
Sawazaki-Calone, Iris ;
Kowalski, Luiz Paulo ;
Haglund, Caj ;
Coletta, Ricardo D. ;
Makitie, Antti A. ;
Salo, Tuula ;
Almangush, Alhadi ;
Leivo, Ilmo .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2020, 136
[8]   Machine learning in head and neck cancer: Importance of a web-based prognostic tool for improved decision making [J].
Almangush, Alhadi ;
Alabi, Rasheed Omobolaji ;
Makitie, Antti A. ;
Leivo, Ilmo .
ORAL ONCOLOGY, 2022, 124
[9]   Prognosis and prognostic research: validating a prognostic model [J].
Altman, Douglas G. ;
Vergouwe, Yvonne ;
Royston, Patrick ;
Moons, Karel G. M. .
BMJ-BRITISH MEDICAL JOURNAL, 2009, 338 :1432-1435
[10]   The Importance of Real-World Validation of Machine Learning Systems in Wearable Exercise Biofeedback Platforms: A Case Study [J].
Argent, Rob ;
Bevilacqua, Antonio ;
Keogh, Alison ;
Daly, Ailish ;
Caulfield, Brian .
SENSORS, 2021, 21 (07)