Machine learning-based classifiers to predict metastasis in colorectal cancer patients

被引:3
|
作者
Talebi, Raheleh [1 ,2 ]
Celis-Morales, Carlos A. [3 ,4 ]
Akbari, Abolfazl [5 ]
Talebi, Atefeh [5 ,6 ]
Borumandnia, Nasrin [7 ]
Pourhoseingholi, Mohamad Amin [8 ]
机构
[1] Univ Appl Sci & Technol, Dept Pure Math, Unit 10, Tehran, Iran
[2] Univ Appl Sci & Technol, Math Architecture & Comp Engn Dept, Unit 10, Tehran, Iran
[3] Univ Glasgow, Sch Cardiovasc & Metab Hlth, Glasgow, Scotland
[4] Univ Catolica Maule, Human Performance Lab, Educ Phys Act & Hlth Res Unit, Talca, Chile
[5] Iran Univ Med Sci, Colorectal Res Ctr, Tehran, Iran
[6] Univ Glasgow, British Heart Fdn, Cardiovasc Res Ctr, Glasgow, Scotland
[7] Shahid Beheshti Univ Med Sci, Urol & Nephrol Res Ctr, Tehran, Iran
[8] Shahid Beheshti Univ Med Sci, Res Inst Gastroenterol & Liver Dis, Gastroenterol & Liver Dis Res Ctr, Tehran, Iran
来源
FRONTIERS IN ARTIFICIAL INTELLIGENCE | 2024年 / 7卷
关键词
colorectal cancer; machine learning; metastasis; model performance and validation; balance data; MODEL;
D O I
10.3389/frai.2024.1285037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background The increasing prevalence of colorectal cancer (CRC) in Iran over the past three decades has made it a key public health burden. This study aimed to predict metastasis in CRC patients using machine learning (ML) approaches in terms of demographic and clinical factors.Methods This study focuses on 1,127 CRC patients who underwent appropriate treatments at Taleghani Hospital, a tertiary care facility. The patients were divided into training and test datasets in an 80:20 ratio. Various ML methods, including Naive Bayes (NB), random rorest (RF), support vector machine (SVM), neural network (NN), decision tree (DT), and logistic regression (LR), were used for predicting metastasis in CRC patients. Model performance was evaluated using 5-fold cross-validation, reporting sensitivity, specificity, the area under the curve (AUC), and other indexes.Results Among the 1,127 patients, 183 (16%) had experienced metastasis. In the predictionof metastasis, both the NN and RF algorithms had the highest AUC, while SVM ranked third in both the original and balanced datasets. The NN and RF algorithms achieved the highest AUC (100%), sensitivity (100% and 100%, respectively), and accuracy (99.2% and 99.3%, respectively) on the balanced dataset, followed by the SVM with an AUC of 98.8%, a sensitivity of 97.5%, and an accuracy of 97%. Moreover, lower false negative rate (FNR), false positive rate (FPR), and higher negative predictive value (NPV) can be confirmed by these two methods. The results also showed that all methods exhibited good performance in the test datasets, and the balanced dataset improved the performance of most ML methods. The most important variables for predicting metastasis were the tumor stage, the number of involved lymph nodes, and the treatment type. In a separate analysis of patients with tumor stages I-III, it was identified that tumor grade, tumor size, and tumor stage are the most important features.Conclusion This study indicated that NN and RF were the best among ML-based approaches for predicting metastasis in CRC patients. Both the tumor stage and the number of involved lymph nodes were considered the most important features.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Machine learning based on SEER database to predict distant metastasis of thyroid cancer
    Qiao, Lixue
    Li, Hao
    Wang, Ziyang
    Sun, Hanlin
    Feng, Guicheng
    Yin, Detao
    ENDOCRINE, 2024, 84 (03) : 1040 - 1050
  • [32] Machine learning-based modeling to predict inhibitors of acetylcholinesterase
    Hardeep Sandhu
    Rajaram Naresh Kumar
    Prabha Garg
    Molecular Diversity, 2022, 26 : 331 - 340
  • [33] Machine learning-based modeling to predict inhibitors of acetylcholinesterase
    Sandhu, Hardeep
    Kumar, Rajaram Naresh
    Garg, Prabha
    MOLECULAR DIVERSITY, 2022, 26 (01) : 331 - 340
  • [34] Systematic review of machine learning-based radiomics approach for predicting microsatellite instability status in colorectal cancer
    Wang, Qiang
    Xu, Jianhua
    Wang, Anrong
    Chen, Yi
    Wang, Tian
    Chen, Danyu
    Zhang, Jiaxing
    Brismar, Torkel B. B.
    RADIOLOGIA MEDICA, 2023, 128 (02): : 136 - 148
  • [35] Machine learning for predicting liver and/or lung metastasis in colorectal cancer: A retrospective study based on the SEER database
    Guo, Zhentian
    Zhang, Zongming
    Liu, Limin
    Zhao, Yue
    Liu, Zhuo
    Zhang, Chong
    Qi, Hui
    Feng, Jinqiu
    Yang, Chunmin
    Tai, Weiping
    Banchini, Filippo
    Inchingolo, Riccardo
    EJSO, 2024, 50 (07):
  • [36] Machine learning-based algorithms to predict severe psychological distress among cancer patients with spinal metastatic disease
    Gao, Le
    Cao, Yuncen
    Cao, Xuyong
    Shi, Xiaolin
    Lei, Mingxing
    Su, Xiuyun
    Liu, Yaosheng
    SPINE JOURNAL, 2023, 23 (09) : 1255 - 1269
  • [37] Comprehensive machine learning-based preoperative blood features predict the prognosis for ovarian cancer
    Wu, Meixuan
    Gu, Sijia
    Yang, Jiani
    Zhao, Yaqian
    Sheng, Jindan
    Cheng, Shanshan
    Xu, Shilin
    Wu, Yongsong
    Ma, Mingjun
    Luo, Xiaomei
    Zhang, Hao
    Wang, Yu
    Zhao, Aimin
    BMC CANCER, 2024, 24 (01)
  • [38] Comprehensive machine learning-based preoperative blood features predict the prognosis for ovarian cancer
    Meixuan Wu
    Sijia Gu
    Jiani Yang
    Yaqian Zhao
    Jindan Sheng
    Shanshan Cheng
    Shilin Xu
    Yongsong Wu
    Mingjun Ma
    Xiaomei Luo
    Hao Zhang
    Yu Wang
    Aimin Zhao
    BMC Cancer, 24
  • [39] Analysis of Colorectal and Gastric Cancer Classification: A Mathematical Insight Utilizing Traditional Machine Learning Classifiers
    Rai, Hari Mohan
    Yoo, Joon
    MATHEMATICS, 2023, 11 (24)
  • [40] Explainable machine learning for predicting lung metastasis of colorectal cancer
    Zhentian Guo
    Zongming Zhang
    Limin Liu
    Yue Zhao
    Zhuo Liu
    Chong Zhang
    Hui Qi
    Jinqiu Feng
    Peijie Yao
    Scientific Reports, 15 (1)