Application of artificial intelligence for overall survival risk stratification in oropharyngeal carcinoma: A validation of ProgTOOL

被引：8

作者：

Alabi, Rasheed Omobolaji ^{[1
,2
]}

Sjoblom, Anni ^{[3
]}

Carpen, Timo ^{[1
,3
,4
,5
]}

Elmusrati, Mohammed ^{[2
]}

Leivo, Ilmo ^{[6
]}

Almangush, Alhadi ^{[1
,3
,6
,7
]}

Makitie, Antti A. ^{[1
,4
,5
,8
,9
]}

机构：

[1] Univ Helsinki, Fac Med, Res Program Syst Oncol, Helsinki, Finland

[2] Univ Vaasa, Sch Technol & Innovat, Dept Ind Digitalizat, Vaasa, Finland

[3] Univ Helsinki, Dept Pathol, Helsinki, Finland

[4] Univ Helsinki, Dept Otorhinolaryngol Head & Neck Surg, Helsinki, Finland

[5] Helsinki Univ Hosp, Helsinki, Finland

[6] Univ Turku, Inst Biomed, Pathol, Turku, Finland

[7] Misurata Univ, Fac Dent, Misurata, Libya

[8] Karolinska Inst, Div Ear Nose & Throat Dis, Dept Clin Sci Intervent & Technol, Stockholm, Sweden

[9] Karolinska Univ Hosp, Stockholm, Sweden

来源：

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS | 2023年 / 175卷

关键词：

Machine learning; External validation; Internal validation; Overall survival; Prognostication; Web -based tool; Oropharyngeal; EXTERNAL VALIDATION; MARITAL-STATUS; NASOPHARYNGEAL CARCINOMA; PREDICTION MODELS; CANCER; PROGNOSIS; DIAGNOSIS; OUTCOMES; HEAD;

D O I：

10.1016/j.ijmedinf.2023.105064

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Background: In recent years, there has been a surge in machine learning-based models for diagnosis and prog-nostication of outcomes in oncology. However, there are concerns relating to the model's reproducibility and generalizability to a separate patient cohort (i.e., external validation).Objectives: This study primarily provides a validation study for a recently introduced and publicly available machine learning (ML) web-based prognostic tool (ProgTOOL) for overall survival risk stratification of oropharyngeal squamous cell carcinoma (OPSCC). Additionally, we reviewed the published studies that have utilized ML for outcome prognostication in OPSCC to examine how many of these models were externally validated, type of external validation, characteristics of the external dataset, and diagnostic performance char-acteristics on the internal validation (IV) and external validation (EV) datasets were extracted and compared.Methods: We used a total of 163 OPSCC patients obtained from the Helsinki University Hospital to externally validate the ProgTOOL for generalizability. In addition, PubMed, OvidMedline, Scopus, and Web of Science databases were systematically searched according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines.Results: The ProgTOOL produced a predictive performance of 86.5% balanced accuracy, Mathew's correlation coefficient of 0.78, Net Benefit (0.7) and Brier score (0.06) for overall survival stratification of OPSCC patients as either low-chance or high-chance. In addition, out of a total of 31 studies found to have used ML for the prognostication of outcomes in OPSCC, only seven (22.6%) reported a form of EV. Three studies (42.9%) each used either temporal EV or geographical EV while only one study (14.2%) used expert as a form of EV. Most of the studies reported a reduction in performance when externally validated.Conclusion: The performance of the model in this validation study indicates that it may be generalized, therefore, bringing recommendations of the model for clinical evaluation closer to reality. However, the number of externally validated ML-based models for OPSCC is still relatively small. This significantly limits the transfer of these models for clinical evaluation and subsequently reduces the likelihood of the use of these models in daily clinical practice. As a gold standard, we recommend the use of geographical EV and validation studies to reveal biases and overfitting of these models. These recommendations are poised to facilitate the implementation of these models in clinical practice.

引用

页数：13

共 79 条

[1] Marital Status and Survival in Patients With Cancer [J].