Applying machine learning techniques to predict the risk of lung metastases from rectal cancer: a real-world retrospective study

被引:2
|
作者
Qiu, Binxu [1 ]
Shen, Zixiong [2 ]
Yang, Dongliang [1 ]
Wang, Quan [1 ]
机构
[1] First Hosp Jilin Univ, Gen Surg Ctr, Dept Gastr & Colorectal Surg, Changchun, Peoples R China
[2] First Hosp Jilin Univ, Dept Thorac Surg, Changchun, Peoples R China
来源
FRONTIERS IN ONCOLOGY | 2023年 / 13卷
关键词
machine learning; rectal cancer; lung metastasis; real world; web calculator; COLORECTAL-CANCER; PULMONARY METASTASECTOMY; SURGICAL INDICATIONS; PROGNOSTIC-FACTORS; RESECTION; SURVIVAL; MODEL;
D O I
10.3389/fonc.2023.1183072
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: Metastasis in the lungs is common in patients with rectal cancer, and it can have severe consequences on their survival and quality of life. Therefore, it is essential to identify patients who may be at risk of developing lung metastasis from rectal cancer. Methods: In this study, we utilized eight machine-learning methods to create a model for predicting the risk of lung metastasis in patients with rectal cancer. Our cohort consisted of 27,180 rectal cancer patients selected from the Surveillance, Epidemiology and End Results (SEER) database between 2010 and 2017 for model development. Additionally, we validated our models using 1118 rectal cancer patients from a Chinese hospital to evaluate model performance and generalizability. We assessed our models' performance using various metrics, including the area under the curve (AUC), the area under the precision-recall curve ( AUPR), the Matthews Correlation Coefficient (MCC), decision curve analysis (DCA), and calibration curves. Finally, we applied the best model to develop a web-based calculator for predicting the risk of lung metastasis in patients with rectal cancer. Result: Our study employed tenfold cross-validation to assess the performance of eight machine-learning models for predicting the risk of lung metastasis in patients with rectal cancer. The AUC values ranged from 0.73 to 0.96 in the training set, with the extreme gradient boosting (XGB) model achieving the highest AUC value of 0.96. Moreover, the XGB model obtained the best AUPR and MCC in the training set, reaching 0.98 and 0.88, respectively. We found that the XGB model demonstrated the best predictive power, achieving an AUC of 0.87, an AUPR of 0.60, an accuracy of 0.92, and a sensitivity of 0.93 in the internal test set. Furthermore, the XGB model was evaluated in the external test set and achieved an AUC of 0.91, an AUPR of 0.63, an accuracy of 0.93, a sensitivity of 0.92, and a specificity of 0.93. The XGB model obtained the highest MCC in the internal test set and external validation set, with 0.61 and 0.68, respectively. Based on the DCA and calibration curve analysis, the XGB model had better clinical decision-making ability and predictive power than the other seven models. Lastly, we developed an online web calculator using the XGB model to assist doctors in making informed decisions and to facilitate the model's wider adoption (https://share.streamlit.io/woshiwz/rectal_cancer/main/lung.py). Conclusion: In this study, we developed an XGB model based on clinicopathological information to predict the risk of lung metastasis in patients with rectal cancer, which may help physicians make clinical decisions.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Application of machine learning techniques in real-world research to predict the risk of liver metastasis in rectal cancer
    Qiu, Binxu
    Su, Xiao Hu
    Qin, Xinxin
    Wang, Quan
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [2] Construction and validation of machine learning models for predicting distant metastases in newly diagnosed colorectal cancer patients: A large-scale and real-world cohort study
    Wei, Ran
    Yu, Guanhua
    Wang, Xishan
    Jiang, Zheng
    Guan, Xu
    CANCER MEDICINE, 2024, 13 (05):
  • [3] Applying machine learning to predict real-world individual treatment effects: insights from a virtual patient cohort
    Fang, Gang
    Annis, Izabela E.
    Elston-Lafata, Jennifer
    Cykert, Samuel
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2019, 26 (10) : 977 - 988
  • [4] Development and validation of a model to predict the risk of distant metastases from hepatocellular carcinoma: a real-world retrospective study
    Shao, Guangzhao
    Fan, Zhongqi
    Qiu, Wei
    Lv, Guoyue
    JOURNAL OF CANCER RESEARCH AND CLINICAL ONCOLOGY, 2023, 149 (18) : 16489 - 16499
  • [5] Real-world data on severe lung cancer: a multicenter retrospective study
    Wang, Fei
    Xie, Xiaohong
    Wang, Liqiang
    Deng, Haiyi
    Wang, Qian
    Qi, Min
    Guo, Min
    Chen, Juan
    Zhou, Maolin
    Sun, Ni
    Li, Ru
    Yang, Yilin
    He, Zuer
    Lin, Xinqing
    Liu, Ming
    Wu, Di
    Sun, Gengyun
    Zhou, Chengzhi
    TRANSLATIONAL LUNG CANCER RESEARCH, 2023, 12 (03) : 460 - +
  • [6] Using machine learning to identify risk factors for pancreatic cancer: a retrospective cohort study of real-world data
    Su, Na
    Tang, Rui
    Zhang, Yice
    Ni, Jiaqi
    Huang, Yimei
    Liu, Chunqi
    Xiao, Yuzhou
    Zhu, Baoting
    Zhao, Yinglan
    FRONTIERS IN PHARMACOLOGY, 2024, 15
  • [7] Prognostic score for synchronous metastatic rectal cancer: A real-world study
    Muzellec, Lea
    Campion, Loic
    Bachet, Jean-Baptiste
    Taieb, Julien
    Fremont, Elodie
    Senellart, Helene
    Moreau, Johanna
    Bouche, Olivier
    Garric, Marie
    Guimbaud, Rosine
    Greilsamer, Charlotte
    Bodere, Anais
    Lievre, Astrid
    Girot, Paul
    Edeline, Julien
    Tougeron, David
    Bennouna, Jaafar
    Touchefeu, Yann
    DIGESTIVE AND LIVER DISEASE, 2023, 55 (10) : 1411 - 1416
  • [8] Combined application of inflammation-related biomarkers to predict postoperative complications of rectal cancer patients: a retrospective study by machine learning analysis
    Wang, Kunyue
    Tang, Youyuan
    Zhang, Feng
    Guo, Xingpo
    Gao, Ling
    LANGENBECKS ARCHIVES OF SURGERY, 2023, 408 (01)
  • [9] Real-World Data and Machine Learning to Predict Cardiac Amyloidosis
    Garcia-Garcia, Elena
    Maria Gonzalez-Romero, Gracia
    Martin-Perez, Encarna M.
    Zapata Cornejo, Enrique de Dios
    Escobar-Aguilar, Gema
    Cardenas Bonnet, Marlon Felix
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (03) : 1 - 15
  • [10] A novel analytical framework for risk stratification of real-world data using machine learning: A small cell lung cancer study
    Marzano, Luca
    Darwich, Adam S.
    Tendler, Salomon
    Dan, Asaf
    Lewensohn, Rolf
    De Petris, Luigi
    Raghothama, Jayanth
    Meijer, Sebastiaan
    CTS-CLINICAL AND TRANSLATIONAL SCIENCE, 2022, 15 (10): : 2437 - 2447