Machine learning-based models for the prediction of breast cancer recurrence risk

被引:25
|
作者
Zuo, Duo [1 ,2 ,3 ,4 ,5 ]
Yang, Lexin [1 ,2 ,3 ,4 ,5 ]
Jin, Yu [1 ,6 ]
Qi, Huan [7 ]
Liu, Yahui [1 ,2 ,3 ,4 ,5 ]
Ren, Li [1 ,2 ,3 ,4 ,5 ]
机构
[1] Tianjin Med Univ, Dept Clin Lab, Canc Inst & Hosp, Tianjin 300060, Peoples R China
[2] Natl Clin Res Ctr Canc, Tianjin 300060, Peoples R China
[3] Tianjins Clin Res Ctr Canc, Tianjin 300060, Peoples R China
[4] Key Lab Canc Prevent & Therapy, Tianjin 300060, Peoples R China
[5] Tianjin Med Univ, Key Lab Breast Canc Prevent & Therapy, Minist Educ, Tianjin 300060, Peoples R China
[6] Tongji Univ, Canc Ctr, Shanghai Peoples Hosp 10, Sch Med, Shanghai 200072, Peoples R China
[7] China Mobile Grp Tianjin Co Ltd, Tianjin 300130, Peoples R China
关键词
Breast cancer; Machine learning; Artificial intelligence; Disease recurrence; Prediction model; PLASMA-FIBRINOGEN LEVEL; ARTIFICIAL-INTELLIGENCE; HEALTH-CARE; FOLLOW-UP; SURVIVAL; OVARIAN; CA125; CLASSIFICATION; PROGNOSIS; INDICATOR;
D O I
10.1186/s12911-023-02377-z
中图分类号
R-058 [];
学科分类号
摘要
Breast cancer is the most common malignancy diagnosed in women worldwide. The prevalence and incidence of breast cancer is increasing every year; therefore, early diagnosis along with suitable relapse detection is an important strategy for prognosis improvement. This study aimed to compare different machine algorithms to select the best model for predicting breast cancer recurrence. The prediction model was developed by using eleven different machine learning (ML) algorithms, including logistic regression (LR), random forest (RF), support vector classification (SVC), extreme gradient boosting (XGBoost), gradient boosting decision tree (GBDT), decision tree, multilayer perceptron (MLP), linear discriminant analysis (LDA), adaptive boosting (AdaBoost), Gaussian naive Bayes (GaussianNB), and light gradient boosting machine (LightGBM), to predict breast cancer recurrence. The area under the curve (AUC), accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and F1 score were used to evaluate the performance of the prognostic model. Based on performance, the optimal ML was selected, and feature importance was ranked by Shapley Additive Explanation (SHAP) values. Compared to the other 10 algorithms, the results showed that the AdaBoost algorithm had the best prediction performance for successfully predicting breast cancer recurrence and was adopted in the establishment of the prediction model. Moreover, CA125, CEA, Fbg, and tumor diameter were found to be the most important features in our dataset to predict breast cancer recurrence. More importantly, our study is the first to use the SHAP method to improve the interpretability of clinicians to predict the recurrence model of breast cancer based on the AdaBoost algorithm. The AdaBoost algorithm offers a clinical decision support model and successfully identifies the recurrence of breast cancer.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Predicting the recurrence of breast cancer using machine learning algorithms
    Alzu'bi, Amal
    Najadat, Hassan
    Doulat, Wesam
    Al-Shari, Osama
    Zhou, Leming
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (09) : 13787 - 13800
  • [22] Breast Cancer Prediction using Machine Learning Models
    Iparraguirre-Villanueva, Orlando
    Epifania-Huerta, Andres
    Torres-Ceclen, Carmen
    Ruiz-Alvarado, John
    Cabanillas-Carbonell, Michael
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (02) : 610 - 620
  • [23] Development and validation of machine learning-based models for prediction of adolescent idiopathic scoliosis: A retrospective study
    Lv, Zheng
    Lv, Wen
    Wang, Lei
    Ou, Jiayuan
    MEDICINE, 2023, 102 (14) : E33441
  • [24] Identification of Risk Factors and Machine Learning-Based Prediction Models for Knee Osteoarthritis Patients
    Kokkotis, Christos
    Moustakidis, Serafeim
    Giakas, Giannis
    Tsaopoulos, Dimitrios
    APPLIED SCIENCES-BASEL, 2020, 10 (19):
  • [25] A case-based ensemble learning system for explainable breast cancer recurrence prediction
    Gu, Dongxiao
    Su, Kaixiang
    Zhao, Huimin
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 107
  • [26] Analysis of breast cancer prediction and visualisation using machine learning models
    Magesh G.
    Swarnalatha P.
    International Journal of Cloud Computing, 2022, 11 (01) : 43 - 60
  • [27] Prediction models for chronic postsurgical pain in patients with breast cancer based on machine learning approaches
    Sun, Chen
    Li, Mohan
    Lan, Ling
    Pei, Lijian
    Zhang, Yuelun
    Tan, Gang
    Zhang, Zhiyong
    Huang, Yuguang
    FRONTIERS IN ONCOLOGY, 2023, 13
  • [28] MRI Radiomics of Breast Cancer: Machine Learning-Based Prediction of Lymphovascular Invasion Status
    Kayadibi, Yasemin
    Kocak, Burak
    Ucar, Nese
    Akan, Yesim Namdar
    Yildirim, Emine
    Bektas, Sibel
    ACADEMIC RADIOLOGY, 2022, 29 : S126 - S134
  • [29] Breast Cancer Prediction Based on Multiple Machine Learning Algorithms
    Zhou, Sheng
    Hu, Chujiao
    Wei, Shanshan
    Yan, Xiaofan
    TECHNOLOGY IN CANCER RESEARCH & TREATMENT, 2024, 23
  • [30] Machine Learning-Based Prediction of Distant Recurrence in Invasive Breast Carcinoma Using Clinicopathological Data: A Cross-Institutional Study
    Sukhadia, Shrey S.
    Muller, Kristen E.
    Workman, Adrienne A.
    Nagaraj, Shivashankar H.
    CANCERS, 2023, 15 (15)