Machine learning for epithelial ovarian cancer platinum resistance recurrence identification using routine clinical data

被引:0
|
作者
Yang, Li-Rong [1 ]
Yang, Mei [2 ]
Chen, Liu-Lin [1 ]
Shen, Yong-Lin [1 ]
He, Yuan [2 ]
Meng, Zong-Ting [2 ]
Wang, Wan-Qi [2 ]
Li, Feng [3 ]
Liu, Zhi-Jin [4 ]
Li, Lin-Hui [1 ]
Wang, Yu-Feng [2 ]
Luo, Xin-Lei [5 ]
机构
[1] Southern Cent Hosp Yunnan Prov, Hematol Oncol Dept, Honghe, Yunnan, Peoples R China
[2] Kunming Med Univ, Affiliated Hosp 3, Geriatr Oncol Dept, Kunming, Yunnan, Peoples R China
[3] Pingxiang Peoples Hosp, Dept Oncol, Pingxiang, Jiangxi, Peoples R China
[4] First Hosp Nanchang, Dept Oncol, Nanchang, Jiangxi, Peoples R China
[5] Southern Cent Hosp Yunnan Prov, Dept Spinal Surg, Honghe, Peoples R China
来源
FRONTIERS IN ONCOLOGY | 2024年 / 14卷
关键词
platinum resistance; recurrence; model; nomogram; early detection of cancer; SERUM LACTATE-DEHYDROGENASE; PROGNOSTIC RELEVANCE; PROGRESSION-FREE; PRIMARY SURGERY; STAGE; PREDICTION; SURVIVAL; SENSITIVITY; HISTOLOGY; NOMOGRAM;
D O I
10.3389/fonc.2024.1457294
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background Most epithelial ovarian cancer (EOC) eventually develops recurrence. Identification of high-risk patients can prompt earlier intervention and improve long-term outcomes. We used laboratory and clinical data to create models based on machine learning for EOC platinum resistance recurrence identification.Methods This study was designed as a retrospective cohort analysis. Initially, we identified 1,392 patients diagnosed with epithelial ovarian cancer who underwent platinum-based chemotherapy at Yunnan Cancer Hospital between January 1, 2012, and June 30, 2022. We collected data on the patients' clinicopathologic characteristics, routine laboratory results, surgical information, details of chemotherapy regimens, and survival outcomes. Subsequently, to identify relevant variables influencing the recurrence of platinum resistance, we screened thirty potential factors using two distinct variable selection methods: Lasso regression and multiple logistic regression analysis. Following this screening process, five machine learning algorithms were employed to develop predictive models based on the selected variables. These included decision tree analysis (DTA), K-Nearest neighbor (KNN), support vector machine (SVM), random forest (RF), and eXtreme gradient boosting (XGBoost). The performance of these models was compared against that of traditional logistic regression. To ensure robust internal validation and facilitate comparison among model performance metrics, a five-fold cross-validation method was implemented. Key performance indicators for the models included the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and average accuracy. Finally, we will visualize these models through nomograms, decision tree diagrams, variable importance plots, etc., to assist clinicians in their practice.Results Multiple logistic regression analysis identified eight variables associated with platinum resistance recurrence. In the lasso regression, seven variables were selected. Based on the findings from both Lasso regression and multiple logistic regression analysis, models were developed using these 7 and 8 factors. Among these, the XGBoost model derived from multiple logistic regression exhibited superior performance and demonstrated good discrimination during internal validation, achieving an AUC of 0.784, a sensitivity of 0.735, a specificity of 0.713, an average accuracy of 80.4%, with a cut-off value set at 0.240. Conversely, the LR model based on lasso regression yielded commendable results as well; it achieved an AUC of 0.738, a sensitivity of 0.541, a specificity of 0.836, with a cut-off value established at 0.154 and an accuracy rate of 79.6%. Finally, we visualized both models through nomograms to illustrate the significance of each variable involved in their development.Conclusions We have successfully developed predictive models for platinum-resistant recurrence of epithelial ovarian cancer, utilizing routine clinical and laboratory data. Among these models, the XGBoost model-derived from variables selected through multiple logistic regression-demonstrated the best performance. It exhibited high AUC values and average accuracy during internal validation, making it a recommended tool for clinical use. However, due to variations in time and context, influencing factors may change over time; thus, continuous evolution of the model is necessary. We propose a framework for this ongoing model adaptation.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Machine Learning for Early Lung Cancer Identification Using Routine Clinical and Laboratory Data
    Gould, Michael K.
    Huang, Brian Z.
    Tammemagi, Martin C.
    Kinar, Yaron
    Shiff, Ron
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2021, 204 (04) : 445 - 453
  • [2] FSP1 expression as a predictor of platinum resistance and recurrence in epithelial ovarian cancer
    Xing, Hang
    Bi, Hai-Ning
    Yin, Qi
    Zhang, Ji
    Zhang, Xue
    Li, Yao-Jiao
    Gong, Xue-Mei
    Shi, Ji-Fang
    ANTI-CANCER DRUGS, 2025, 36 (04) : 338 - 346
  • [3] Artificial intelligence-assisted classification of epithelial ovarian cancer patients using machine learning clustering to determine the rate of platinum sensitivity and disease recurrence
    Nakayama, John
    Pindzola, Grace
    Summerscales, Tiffany
    McGaughey, Michael
    GYNECOLOGIC ONCOLOGY, 2024, 190 : S198 - S199
  • [4] PREDICTION OF PLATINUM-BASED CHEMOTHERAPY RESISTANCE IN EPITHELIAL OVARIAN CANCER USING APPARENT DIFFUSION COEFFICIENT OF MAGNETIC RESONANCE IMAGE AND MACHINE LEARNING ALGORITHM
    Lee, Sung-Jong
    Woo, Jae Yeon
    Song, Heekyoung
    Cha, Jimin
    INTERNATIONAL JOURNAL OF GYNECOLOGICAL CANCER, 2022, 32 : A47 - A47
  • [5] Identification of spinal tuberculosis subphenotypes using routine clinical data: a study based on unsupervised machine learning
    Yao, Yuanlin
    Wu, Shaofeng
    Liu, Chong
    Zhou, Chenxing
    Zhu, Jichong
    Chen, Tianyou
    Huang, Chengqian
    Feng, Sitan
    Zhang, Bin
    Wu, Siling
    Ma, Fengzhi
    Liu, Lu
    Zhan, Xinli
    ANNALS OF MEDICINE, 2023, 55 (02)
  • [6] Predicting the Recurrence of Ovarian Cancer Based on Machine Learning
    Zhou, Lining
    Hong, Hong
    Chu, Fuying
    Chen, Xiang
    Wang, Chenlu
    CANCER MANAGEMENT AND RESEARCH, 2024, 16 : 1375 - 1387
  • [7] A Bioinformatics Analysis of Ovarian Cancer Data Using Machine Learning
    Schilling, Vincent
    Beyerlein, Peter
    Chien, Jeremy
    ALGORITHMS, 2023, 16 (07)
  • [8] Related Clinical Factors of Platinum-Based Chemotherapy Resistance in Patients with Epithelial Ovarian Cancer
    Xiong, Zhuo
    Ha, Chunfang
    Li, Ruyue
    Wu, Mingyong
    Wei, Meng
    GYNECOLOGIC AND OBSTETRIC INVESTIGATION, 2024, 89 (06) : 469 - 477
  • [9] NOVEL CLASSIFICATION OF PATIENTS WITH EPITHELIAL OVARIAN CANCER USING MACHINE LEARNING TECHNOLOGY
    Paik, E. S.
    Park, J. Y.
    Kim, J. H.
    Kim, T. J.
    Choi, C. H.
    Kim, B. G.
    Bae, D. S.
    Seo, S. W.
    Lee, J. W.
    Jeong, S. Y.
    INTERNATIONAL JOURNAL OF GYNECOLOGICAL CANCER, 2018, 28 : 718 - 718
  • [10] Clinical and molecular features of platinum resistance in ovarian cancer
    Miras, Isabel
    Estevez-Garcia, Purificacion
    Munoz-Galvan, Sandra
    CRITICAL REVIEWS IN ONCOLOGY HEMATOLOGY, 2024, 201