Development and validation of machine learning models for predicting venous thromboembolism in colorectal cancer patients: A cohort study in China

被引:0
作者
Hu, Zuhai [1 ]
Li, Xiaosheng [2 ]
Yuan, Yuliang [1 ]
Xu, Qianjie [3 ]
Zhang, Wei [2 ]
Lei, Haike [1 ]
机构
[1] Chongqing Univ Canc Hosp, Chongqing Canc Multi Big Data Applicat Engn Res Ct, Chongqing 400030, Peoples R China
[2] Chongqing Univ Canc Hosp, Chongqing Key Lab Translat Res Canc Metastasis & I, Chongqing 400030, Peoples R China
[3] Chongqing Med Univ, Sch Publ Hlth, Dept Hlth Stat, Chongqing 400016, Peoples R China
关键词
Colorectal cancer; VTE; Machine learning; Bootstrap; RISK-ASSESSMENT MODELS; DIAGNOSIS;
D O I
10.1016/j.ijmedinf.2024.105770
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: With advancements in healthcare, traditional VTE risk assessment tools are increasingly insufficient to meet the demands of high-quality care, underscoring the need for innovative and specialized assessment methods. Objective: Owing to the remarkable success of machine learning in supervised learning and disease prediction, our objective is to develop a reliable and efficient model for assessing VTE risk by leveraging the fundamental data and clinical characteristics of colorectal cancer patients within our medical facility. Methods: Six commonly used machine learning algorithms were utilized in our study to predict the occurrence of VTE in patients with rectal cancer. In the modeling process, LASSO regression was employed to identify and exclude variables not associated with VTE. Additionally, hyperparameter tuning was conducted via 5-fold cross- validation to mitigate overfitting, and 200 bootstrap samples were used to adjust the apparent performance on the training set. The selection of the VTE assessment model was determined by a thorough evaluation of performance criteria, such as the AUC, ACC and F1 score. Results: The RF model exhibits consistent and efficient performance. Specifically, in the internally validation dataset, where generalizability was adjusted, the RF model achieved the highest scores across multiple metrics: AD-AUC (0.895), AD-ACC (0.871), AD-F1 (0.311), AD-MCC (0.316), AD-Precision (0.241), AD-Specificity (0.888). For external validation on unseen colon cancer data, the RF model also performed best in terms of ACC (0.728), F1 (0.292), MCC (0.225), Precision (0.192), and Specificity (0.740), with a suboptimal AUC of 0.745 and a Sensitivity (Recall) of 0.615. Additionally, the RF model demonstrates strong performance not only on the original dataset but also on datasets processed via alternative imbalance handling techniques. Conclusions: Our research successfully established and validated a risk assessment model for assessing the risk of VTE in colorectal cancer patients.
引用
收藏
页数:12
相关论文
共 35 条
  • [1] Khorana score and thromboembolic risk in stage II-III colorectal cancer patients: a post hoc analysis from the adjuvant TOSCA trial
    Barni, Sandro
    Rosati, Gerardo
    Lonardi, Sara
    Pella, Nicoletta
    Banzi, Maria
    Zampino, Maria G.
    Dotti, Katia F.
    Rimassa, Lorenza
    Marchetti, Paolo
    Maiello, Evaristo
    Artioli, Fabrizio
    Ferrari, Daris
    Labianca, Roberto
    Bidoli, Paolo
    Zaniboni, Alberto
    Sobrero, Alberto
    Iaffaioli, Vincenzo
    De Placido, Sabino
    Frassineti, Gian Luca
    Ciarlo, Andrea
    Buonadonna, Angela
    Silvestris, Nicola
    Piazza, Elena
    Pavesi, Lorenzo
    Moroni, Mauro
    Clerico, Mario
    Aglietta, Massimo
    Giordani, Paolo
    Galli, Francesca
    Galli, Fabio
    Petrelli, Fausto
    [J]. THERAPEUTIC ADVANCES IN MEDICAL ONCOLOGY, 2020, 12
  • [2] Perioperative Venous Thromboembolism Prophylaxis
    Bartlett, Matthew A.
    Mauck, Karen F.
    Stephenson, Christopher R.
    Ganesh, Ravindra
    Daniels, Paul R.
    [J]. MAYO CLINIC PROCEEDINGS, 2020, 95 (12) : 2775 - 2798
  • [3] Thrombosis risk assessment as a guide to quality patient care
    Caprini, JA
    [J]. DM DISEASE-A-MONTH, 2005, 51 (2-3): : 70 - 78
  • [4] Risk-assessment models for VTE and bleeding in hospitalized medical patients: an overview of systematic reviews
    Darzi, Andrea J.
    Repp, Allen B.
    Spencer, Frederick A.
    Morsi, Rami Z.
    Charide, Rana
    Etxeandia-Ikobaltzeta, Itziar
    Bauer, Kenneth A.
    Burnett, Allison E.
    Cushman, Mary
    Dentali, Francesco
    Kahn, Susan R.
    Rezende, Suely M.
    Zakai, Neil A.
    Agarwal, Arnav
    Karam, Samer G.
    Lotfi, Tamara
    Wiercioch, Wojtek
    Waziry, Reem
    Iorio, Alfonso
    Akl, Elie A.
    Schunemann, Holger J.
    [J]. BLOOD ADVANCES, 2020, 4 (19) : 4929 - 4944
  • [5] A common cancer at an uncommon age The etiology of early-onset colorectal cancer needs to be understood to tackle rising incidence
    Giannakis, Marios
    Ng, Kimmie
    [J]. SCIENCE, 2023, 379 (6637) : 1088 - 1090
  • [6] Developing a machine learning model for bleeding prediction in patients with cancer-associated thrombosis receiving anticoagulation therapy
    Grdinic, Aleksandra G.
    Radovanovic, Sandro
    Gleditsch, Jostein
    Jorgensen, Camilla Tovik
    Asady, Elia
    Pettersen, Heidi Hassel
    Delibasic, Boris
    Ghanima, Waleed
    [J]. JOURNAL OF THROMBOSIS AND HAEMOSTASIS, 2024, 22 (04) : 1094 - 1104
  • [7] Interpretable machine learning models for predicting venous thromboembolism in the intensive care unit: an analysis based on data from 207 centers
    Guan, Chengfu
    Ma, Fuxin
    Chang, Sijie
    Zhang, Jinhua
    [J]. CRITICAL CARE, 2023, 27 (01)
  • [8] Incidence, Timing, and Outcomes of Venous Thromboembolism in Patients Undergoing Surgery for Esophagogastric Cancer: A Population-Based Cohort Study
    Hanna, Nader M.
    Williams, Erin
    Kong, Weidong
    Fundytus, Adam
    Booth, Christopher M.
    Patel, Sunil V.
    Caycedo-Marulanda, Antonio
    Chung, Wiley
    Nanji, Sulaiman
    Merchant, Shaila J.
    [J]. ANNALS OF SURGICAL ONCOLOGY, 2022, 29 (07) : 4393 - 4404
  • [9] Cost of colorectal cancer care: sufficient to inform cancer policy? Comment
    IJzerman, Maarten J.
    [J]. LANCET GASTROENTEROLOGY & HEPATOLOGY, 2021, 6 (09): : 679 - 680
  • [10] Machine learning predicts cancer-associated deep vein thrombosis using clinically available variables
    Jin, Shuai
    Qin, Dan
    Liang, Bao-Sheng
    Zhang, Li-Chuan
    Wei, Xiao-Xia
    Wang, Yu-Jie
    Zhuang, Bing
    Zhang, Tong
    Yang, Zhen-Peng
    Cao, Yi-Wei
    Jin, San-Li
    Yang, Ping
    Jiang, Bo
    Rao, Ben-Qiang
    Shi, Han-Ping
    Lu, Qian
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2022, 161