Development and validation of machine learning models for predicting venous thromboembolism in colorectal cancer patients: A cohort study in China

被引:0
作者
Hu, Zuhai [1 ]
Li, Xiaosheng [2 ]
Yuan, Yuliang [1 ]
Xu, Qianjie [3 ]
Zhang, Wei [2 ]
Lei, Haike [1 ]
机构
[1] Chongqing Univ Canc Hosp, Chongqing Canc Multi Big Data Applicat Engn Res Ct, Chongqing 400030, Peoples R China
[2] Chongqing Univ Canc Hosp, Chongqing Key Lab Translat Res Canc Metastasis & I, Chongqing 400030, Peoples R China
[3] Chongqing Med Univ, Sch Publ Hlth, Dept Hlth Stat, Chongqing 400016, Peoples R China
关键词
Colorectal cancer; VTE; Machine learning; Bootstrap; RISK-ASSESSMENT MODELS; DIAGNOSIS;
D O I
10.1016/j.ijmedinf.2024.105770
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: With advancements in healthcare, traditional VTE risk assessment tools are increasingly insufficient to meet the demands of high-quality care, underscoring the need for innovative and specialized assessment methods. Objective: Owing to the remarkable success of machine learning in supervised learning and disease prediction, our objective is to develop a reliable and efficient model for assessing VTE risk by leveraging the fundamental data and clinical characteristics of colorectal cancer patients within our medical facility. Methods: Six commonly used machine learning algorithms were utilized in our study to predict the occurrence of VTE in patients with rectal cancer. In the modeling process, LASSO regression was employed to identify and exclude variables not associated with VTE. Additionally, hyperparameter tuning was conducted via 5-fold cross- validation to mitigate overfitting, and 200 bootstrap samples were used to adjust the apparent performance on the training set. The selection of the VTE assessment model was determined by a thorough evaluation of performance criteria, such as the AUC, ACC and F1 score. Results: The RF model exhibits consistent and efficient performance. Specifically, in the internally validation dataset, where generalizability was adjusted, the RF model achieved the highest scores across multiple metrics: AD-AUC (0.895), AD-ACC (0.871), AD-F1 (0.311), AD-MCC (0.316), AD-Precision (0.241), AD-Specificity (0.888). For external validation on unseen colon cancer data, the RF model also performed best in terms of ACC (0.728), F1 (0.292), MCC (0.225), Precision (0.192), and Specificity (0.740), with a suboptimal AUC of 0.745 and a Sensitivity (Recall) of 0.615. Additionally, the RF model demonstrates strong performance not only on the original dataset but also on datasets processed via alternative imbalance handling techniques. Conclusions: Our research successfully established and validated a risk assessment model for assessing the risk of VTE in colorectal cancer patients.
引用
收藏
页数:12
相关论文
共 35 条
  • [11] Venous Thromboembolism Prophylaxis and Treatment in Patients With Cancer: ASCO Clinical Practice Guideline Update
    Key, Nigel S.
    Khorana, Alok A.
    Kuderer, Nicole M.
    Bohlke, Kari
    Lee, Agnes Y. Y.
    Arcelus, Juan, I
    Wong, Sandra L.
    Balaban, Edward P.
    Flowers, Christopher R.
    Francis, Charles W.
    Gates, Leigh E.
    Kakkar, Ajay K.
    Levine, Mark N.
    Liebman, Howard A.
    Tempero, Margaret A.
    Lyman, Gary H.
    Falanga, Anna
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2020, 38 (05) : 496 - 520
  • [12] Development and validation of a predictive model for chemotherapy-associated thrombosis
    Khorana, Alok A.
    Kuderer, Nicole M.
    Culakova, Eva
    Lyman, Gary H.
    Francis, Charles W.
    [J]. BLOOD, 2008, 111 (10) : 4902 - 4907
  • [13] Cancer-associated venous thromboembolism
    Khorana, Alok A.
    Mackman, Nigel
    Falanga, Anna
    Pabinger, Ingrid
    Noble, Simon
    Ageno, Walter
    Moik, Florian
    Lee, Agnes Y. Y.
    [J]. NATURE REVIEWS DISEASE PRIMERS, 2022, 8 (01)
  • [14] Incidence, mortality, survival, risk factor and screening of colorectal cancer: A comparison among China, Europe, and northern America
    Li, Na
    Lu, Bin
    Luo, Chenyu
    Cai, Jie
    Lu, Ming
    Zhang, Yuhan
    Chen, Hongda
    Dai, Min
    [J]. CANCER LETTERS, 2021, 522 : 255 - 268
  • [15] A Survey on Sparse Learning Models for Feature Selection
    Li, Xiaoping
    Wang, Yadi
    Ruiz, Ruben
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (03) : 1642 - 1660
  • [16] Venous Thromboembolic Prophylaxis After Total Hip and Knee Arthroplasty
    Lieberman, Jay R.
    Bell, Jennifer A.
    [J]. JOURNAL OF BONE AND JOINT SURGERY-AMERICAN VOLUME, 2021, 103 (16) : 1556 - 1564
  • [17] Postoperative Venous Thromboembolism in Colon and Rectal Cancer: Do Tumor Location and Operation Matter?
    McKenna, Nicholas P.
    Bews, Katherine A.
    Behm, Kevin T.
    Habermann, Elizabeth B.
    Cima, Robert R.
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF SURGEONS, 2023, 236 (04) : 658 - 665
  • [18] Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): Explanation and Elaboration
    Moons, Karel G. M.
    Altman, Douglas G.
    Reitsma, Johannes B.
    Ioannidis, John P. A.
    Macaskill, Petra
    Steyerberg, Ewout W.
    Vickers, Andrew J.
    Ransohoff, David F.
    Collins, Gary S.
    [J]. ANNALS OF INTERNAL MEDICINE, 2015, 162 (01) : W1 - W73
  • [19] Thromboembolic and bleeding complications in patients with oesophageal cancer
    Mulder, F., I
    Hovenkamp, A.
    van Laarhoven, H. W. M.
    Buller, H. R.
    Kamphuisen, P. W.
    Hulshof, M. C. C. M.
    Henegouwen, M. I. van Berge
    Middeldorp, S.
    van Es, N.
    [J]. BRITISH JOURNAL OF SURGERY, 2020, 107 (10) : 1324 - 1333
  • [20] Breast cancer detection using artificial intelligence techniques: A systematic literature review
    Nassif, Ali Bou
    Abu Talib, Manar
    Nasir, Qassim
    Afadar, Yaman
    Elgendy, Omar
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 127