Random survival forest algorithm for risk stratification and survival prediction in gastric neuroendocrine neoplasms

被引:5
作者
Liao, Tianbao [1 ]
Su, Tingting [2 ]
Lu, Yang [5 ]
Huang, Lina [3 ]
Wei, Wei-Yuan [4 ]
Feng, Lu-Huai [3 ]
机构
[1] Youjiang Med Univ Nationalities, Dept Presidents Off, Baise, Peoples R China
[2] Peoples Hosp Guangxi Zhuang Autonomous Reg, Dept ECG Diagnost, Nanning, Peoples R China
[3] Guangxi Med Univ, Affiliated Tumor Hosp, Dept Endocrinol & Metab Nephrol, Nanning, Peoples R China
[4] Guangxi Med Univ, Affiliated Tumor Hosp, Dept Gastr & Abdominal Tumor Surg, Nanning, Peoples R China
[5] Guangxi Med Univ, Affiliated Tumor Hosp, Dept Int Med, Nanning, Peoples R China
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Machine learning; Leave-one-out cross-validation method; Prognostic model; Risk stratification; PROGNOSIS; NOMOGRAM; TUMOR;
D O I
10.1038/s41598-024-77988-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study aimed to construct and assess a machine-learning algorithm designed to forecast survival rates and risk stratification for patients with gastric neuroendocrine neoplasms (gNENs) after diagnosis. Data on patients with gNENs were extracted and randomly divided into training and validation sets using the Surveillance, Epidemiology, and End Results database. We developed a prediction model using 10 machine learning algorithms across 101 combinations to forecast cancer-related mortality in patients with gNENs, selecting the best model using the highest mean over a sequence of time-dependent area under the receiver operating characteristic (ROC) curve (AUC). The performance of the final model was assessed through time-dependent ROC curves for discrimination and calibration curves for calibration. The maximum selection rank method was used to determine the best prognostic risk score threshold for classifying patients into high- and low-risk groups. Afterward, Kaplan-Meier analysis and log-rank test were used to compare survival rates among these groups. Our study examined 775 patients with gNENs, dividing them into training and validation sets. A training set comprised 543 patients, with a median follow-up of 42 months and cumulative mortality rates of 40.0% at 1 year, 48.6% at 3 years, and 54.0% at 5 years. A validation set comprised 232 patients, with cumulative mortality rates of 29.1% at 1 year, 43.5% at 3 years, and 53.2% at 5 years. The optimal random survival forest (RSF) model (mtry = 4, node size = 5) achieved an AUC of 0.839 for survival prediction in the training set. Comprising 11 variables such as demographics, treatment details, tumor characteristics, T staging, N staging, and M staging, the RSF model revealed high predictive accuracy with AUCs of 0.92, 0.96, and 0.96 for 1-, 3-, and 5-year survival, respectively, which was consistently reflected in the validation set with AUCs of 0.88, 0.92, and 0.89, respectively. Moreover, patients were risk-stratified. Although our RSF model effectively stratified patients into different prognostic groups, it needs external validation to confirm its utility for noninvasive prognostic prediction and risk stratification in gNENs. Further research is required to verify its broader clinical applicability.
引用
收藏
页数:11
相关论文
共 50 条
[11]   Prediction of prognosis in elderly patients with sepsis based on machine learning (random survival forest) [J].
Luming Zhang ;
Tao Huang ;
Fengshuo Xu ;
Shaojin Li ;
Shuai Zheng ;
Jun Lyu ;
Haiyan Yin .
BMC Emergency Medicine, 22
[12]   Prediction of prognosis in elderly patients with sepsis based on machine learning (random survival forest) [J].
Zhang, Luming ;
Huang, Tao ;
Xu, Fengshuo ;
Li, Shaojin ;
Zheng, Shuai ;
Lyu, Jun ;
Yin, Haiyan .
BMC EMERGENCY MEDICINE, 2022, 22 (01)
[13]   Prognostic risk factor of major salivary gland carcinomas and survival prediction model based on random survival forests [J].
Chen, Yufan ;
Li, Guoli ;
Jiang, Wenmei ;
Nie, Rong Cheng ;
Deng, Honghao ;
Chen, Yingle ;
Li, Hao ;
Chen, Yanfeng .
CANCER MEDICINE, 2023, 12 (09) :10899-10907
[14]   Nomogram Individually Predicts the Overall Survival of Patients with Gastroenteropancreatic Neuroendocrine Neoplasms [J].
Wei, W. ;
Cheng, F. ;
Jie, C. ;
Zhiwei, Z. ;
Ye, C. ;
Yong, L. ;
Jian, S. .
NEUROENDOCRINOLOGY, 2017, 105 :100-100
[15]   Nomogram individually predicts the overall survival of patients with gastroenteropancreatic neuroendocrine neoplasms [J].
Fang, Cheng ;
Wang, Wei ;
Feng, Xingyu ;
Sun, Jian ;
Zhang, Yu ;
Zeng, Yujie ;
Wang, Junjiang ;
Chen, Huishan ;
Cai, Muyan ;
Lin, Junzhong ;
Chen, Minhu ;
Chen, Ye ;
Li, Yong ;
Li, Shengping ;
Chen, Jie ;
Zhou, Zhiwei .
BRITISH JOURNAL OF CANCER, 2017, 117 (10) :1544-1550
[16]   Identification of new biomarkers associated with prognosis of pancreatic neuroendocrine neoplasms and establishment of survival prediction model [J].
Yanling, X. ;
Pin, Y. ;
Mujie, Y. ;
Jianan, B. ;
Danyang, G. ;
Jinhao, C. ;
Chunhua, H. ;
Feiyu, L. ;
Qiyun, T. .
JOURNAL OF NEUROENDOCRINOLOGY, 2024, 36 :175-175
[17]   A novel nomogram and risk stratification system predicting the cancer-specific survival of patients with gastric neuroendocrine carcinoma: a study based on SEER database and external validation [J].
Xue Song ;
Yangyang Xie ;
Yafang Lou .
BMC Gastroenterology, 23
[18]   A novel nomogram and risk stratification system predicting the cancer-specific survival of patients with gastric neuroendocrine carcinoma: a study based on SEER database and external validation [J].
Song, Xue ;
Xie, Yangyang ;
Lou, Yafang .
BMC GASTROENTEROLOGY, 2023, 23 (01)
[19]   Development and validation of a nomogram for predicting the overall survival of patients with gastroenteropancreatic neuroendocrine neoplasms [J].
Xie, Si ;
Li, Lei ;
Wang, Xiaotong ;
Li, Lequn .
MEDICINE, 2021, 100 (02) :E24223
[20]   Development and validation of a novel nomogram for predicting survival rate in pancreatic neuroendocrine neoplasms [J].
Liao, Tianbao ;
Su, Tingting ;
Huang, Lina ;
Li, Bixun ;
Feng, Lu-Huai .
SCANDINAVIAN JOURNAL OF GASTROENTEROLOGY, 2022, 57 (01) :85-90