Comparison of nomogram with machine learning techniques for prediction of overall survival in patients with tongue cancer

被引:62
作者
Alabi, Rasheed Omobolaji [1 ]
Makitie, Antti A. [2 ,3 ,4 ,5 ,6 ]
Pirinen, Matti [7 ,8 ,9 ]
Elmusrati, Mohammed [1 ]
Leivo, Ilmo [10 ]
Almangush, Alhadi [2 ,10 ,11 ,12 ]
机构
[1] Univ Vaasa, Sch Technol & Innovat, Dept Ind Digitalizat, Vaasa, Finland
[2] Univ Helsinki, Fac Med, Res Program Syst Oncol, Helsinki, Finland
[3] Univ Helsinki, Dept Otorhinolaryngol Head & Neck Surg, Helsinki, Finland
[4] Helsinki Univ Hosp, Helsinki, Finland
[5] Karolinska Inst, Dept Clin Sci Intervent & Technol, Div Ear Nose & Throat Dis, Stockholm, Sweden
[6] Karolinska Univ Hosp, Stockholm, Sweden
[7] Univ Helsinki, Inst Mol Med Finland FIMM, Helsinki, Finland
[8] Univ Helsinki, Dept Publ Hlth, Helsinki, Finland
[9] Univ Helsinki, Dept Math & Stat, Helsinki, Finland
[10] Univ Turku, Inst Biomed Pathol, Turku, Finland
[11] Univ Helsinki, Dept Pathol, Helsinki, Finland
[12] Univ Misurata, Fac Dent, Misurata, Libya
关键词
Machine learning; Nomogram; tongue cancer; Predict; overall survival; SQUAMOUS-CELL CARCINOMA; ESTIMATE LONG-TERM; EXTERNAL VALIDATION; ORAL-CANCER; METASTASIS; EPIDEMIOLOGY; RECURRENCE; FEATURES; TRENDS; HEAD;
D O I
10.1016/j.ijmedinf.2020.104313
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: The prediction of overall survival in tongue cancer is important for planning of personalized care and patient counselling. Objectives: This study compares the performance of a nomogram with a machine learning model to predict overall survival in tongue cancer. The nomogram and machine learning model were built using a large data set from the Surveillance, Epidemiology, and End Results (SEER) program database. The comparison is necessary to provide the clinicians with a comprehensive, practical, and most accurate assistive system to predict overall survival of this patient population. Methods: The data set used included the records of 7596 tongue cancer patients. The considered machine learning algorithms were logistic regression, support vector machine, Bayes point machine, boosted decision tree, decision forest, and decision jungle. These algorithms were mainly evaluated in terms of the areas under the receiver operating characteristic (ROC) curve (AUC) and accuracy values. The performance of the algorithm that produced the best result was compared with a nomogram to predict overall survival in tongue cancer patients. Results: The boosted decision-tree algorithm outperformed other algorithms. When compared with a nomogram using external validation data, the boosted decision tree produced an accuracy of 88.7% while the nomogram showed an accuracy of 60.4%. In addition, it was found that age of patient, T stage, radiotherapy, and the surgical resection were the most prominent features with significant influence on the machine learning model's performance to predict overall survival. Conclusion: The machine learning model provides more personalized and reliable prognostic information of tongue cancer than the nomogram. However, the level of transparency offered by the nomogram in estimating patients' outcomes seems more confident and strengthened the principle of shared decision making between the patient and clinician. Therefore, a combination of a nomogram - machine learning (NomoML) predictive model may help to improve care, provides information to patients, and facilitates the clinicians in making tongue cancer management-related decisions.
引用
收藏
页数:9
相关论文
共 56 条
[1]   Machine learning application for prediction of locoregional recurrences in early oral tongue cancer: a Web-based prognostic tool [J].
Alabi, Rasheed Omobolaji ;
Elmusrati, Mohammed ;
Sawazaki-Calone, Iris ;
Kowalski, Luiz Paulo ;
Haglund, Caj ;
Coletta, Ricardo D. ;
Makitie, Antti A. ;
Salo, Tuula ;
Leivo, Ilmo ;
Almangush, Alhadi .
VIRCHOWS ARCHIV, 2019, 475 (04) :489-497
[2]   Comparison of supervised machine learning classification techniques in prediction of locoregional recurrences in early oral tongue cancer [J].
Alabi, Rasheed Omobolaji ;
Elmusrati, Mohammed ;
Sawazaki-Calone, Iris ;
Kowalski, Luiz Paulo ;
Haglund, Caj ;
Coletta, Ricardo D. ;
Makitie, Antti A. ;
Salo, Tuula ;
Almangush, Alhadi ;
Leivo, Ilmo .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2020, 136
[3]   Postoperative Nomograms Predictive of Survival After Surgical Management of Malignant Tumors of the Major Salivary Glands [J].
Ali, Safina ;
Palmer, Frank L. ;
Yu, Changhong ;
DiLorenzo, Monica ;
Shah, Jatin P. ;
Kattan, Michael W. ;
Patel, Snehal G. ;
Ganly, Ian .
ANNALS OF SURGICAL ONCOLOGY, 2014, 21 (02) :637-642
[4]   For early-stage oral tongue cancer, depth of invasion and worst pattern of invasion are the strongest pathological predictors for locoregional recurrence and mortality [J].
Almangush, Alhadi ;
Bello, Ibrahim O. ;
Coletta, Ricardo D. ;
Makitie, Antti A. ;
Makinen, Laura K. ;
Kauppila, Joonas H. ;
Pukkila, Matti ;
Hagstrom, Jaana ;
Laranne, Jussi ;
Soini, Ylermi ;
Kosma, Veli-Matti ;
Koivunen, Petri ;
Kelner, Natalie ;
Kowalski, Luiz Paulo ;
Grenman, Reidar ;
Leivo, Ilmo ;
Laara, Esa ;
Salo, Tuula .
VIRCHOWS ARCHIV, 2015, 467 (01) :39-46
[5]   Contrast-enhanced computed tomography image assessment of cervical lymph node metastasis in patients with oral cancer by using a deep learning system of artificial intelligence [J].
Ariji, Yoshiko ;
Fukuda, Motoki ;
Kise, Yoshitaka ;
Nozawa, Michihito ;
Yanashita, Yudai ;
Fujita, Hiroshi ;
Katsumata, Akitoshi ;
Ariji, Eiichiro .
ORAL SURGERY ORAL MEDICINE ORAL PATHOLOGY ORAL RADIOLOGY, 2019, 127 (05) :458-463
[6]  
Arrangoiz R., 2018, Cancer Reports Rev, DOI [10.15761/crr.1000153, DOI 10.15761/CRR.1000153]
[7]   Nomograms in oncology: more than meets the eye [J].
Balachandran, Vinod P. ;
Gonen, Mithat ;
Smith, J. Joshua ;
DeMatteo, Ronald P. .
LANCET ONCOLOGY, 2015, 16 (04) :E173-E180
[8]  
Barga R., 2015, Predictive Analytics with Microsoft Azure Machine Learning
[9]   Prognostic evaluation of oral tongue cancer: Means, markers and perspectives (I) [J].
Bello, Ibrahim O. ;
Soini, Ylermi ;
Salo, Tuula .
ORAL ONCOLOGY, 2010, 46 (09) :630-635
[10]   SMOTE for high-dimensional class-imbalanced data [J].
Blagus, Rok ;
Lusa, Lara .
BMC BIOINFORMATICS, 2013, 14