Ensemble of machine learning techniques to predict survival in kidney transplant recipients

被引:1
作者
Díez-Sanmartín, Covadonga [1 ]
Sarasa Cabezuelo, Antonio [1 ]
Andrés Belmonte, Amado [2 ]
机构
[1] Department of Computer Systems and Computing, School of Computer Science, Complutense University of Madrid, Madrid
[2] Nephrology Department, 12 de Octubre Hospital, Complutense University of Madrid, Madrid
关键词
Artificial intelligence; Feature selection; Kidney transplant; Machine learning; Survival analysis;
D O I
10.1016/j.compbiomed.2024.108982
中图分类号
学科分类号
摘要
Kidney transplant recipients face a high cardiovascular risk, which is a leading cause of death in this patient group. This article proposes the application of clustering techniques and feature selection to predict the survival outcomes of kidney transplant recipients based on machine learning techniques and mainstream statistical methods. First, feature selection techniques (Boruta, Random Survival Forest and Elastic Net) are used to detect the most relevant variables. Subsequently, each set of variables obtained by each feature selection technique is used as input for the clustering algorithms used (Consensus Clustering, Self-Organizing Map and Agglomerative Clustering) to determine which combination of feature selection, clustering algorithm and number of clusters maximizes intercluster variability. Next, the mechanism called False Clustering Discovery Reduction is applied to obtain the minimum number of statistically differentiable populations after applying a control metric. This metric is based on a variance test to confirm that reducing the number of clusters does not generate significant losses in the heterogeneity obtained. This approach was applied to the Organ Procurement and Transplantation Network medical dataset (n = 11,332). The combination of Random Survival Forest and consensus clustering yielded the optimal result of 4 clusters starting from 8 initial ones. Finally, for each population, Kaplan-Meier survival curves are generated to predict the survival of new patients based on the predictions of the XGBoost classifier, with an overall multi-class AUC of 98.11%. © 2024 The Authors
引用
收藏
相关论文
共 71 条
[1]  
Pannu N., Klarenbach S., Wiebe N., Manns B., Tonelli M., Renal replacement therapy in patients with acute renal failure: a systematic review, JAMA, 299, 7, pp. 793-805, (2008)
[2]  
Abbud-Filho M., Adams P.L., Alberu J., Cardella C., Chapman J., Cochat P., Cosio F., Danovitch G., Davis C., Gaston R.S., Humar A., Hunsicker L.G., Josephson M.A., Kasiske B., Kirste G., Leichtman A., Munn S., Obrador G.T., Tibell A., Delmonico F.L., A report of the Lisbon Conference on the care of the kidney transplant recipient, Transplantation, 83, 8, pp. S1-S22, (2007)
[3]  
Wolfe R.A., Ashby V.B., Milford E.L., Ojo A.O., Ettenger R.E., Agodoa L.Y., Held P.J., Port F.K., Comparison of mortality in all patients on dialysis, patients on dialysis awaiting transplantation, and recipients of a first cadaveric transplant, N. Engl. J. Med., 341, 23, pp. 1725-1730, (1999)
[4]  
(2023)
[5]  
Lentine K.L., Smith J.M., Miller J.M., Bradbrook K., Larkin L., Weiss S., Handarova D.K., Temple K., Israni A.K., Snyder J.J., OPTN/SRTR 2021 annual data report: kidney, Am. J. Transplant., 23, 2, pp. S21-S120, (2023)
[6]  
Kleinbaum D.G., Klein M., Kaplan-Meier survival curves and the log-rank test, Survival Analysis, pp. 55-96, (2012)
[7]  
Cox D.R., Regression models and life-tables, J. Roy. Stat. Soc. B, 34, 2, pp. 187-202, (1972)
[8]  
Diez-Sanmartin C., Sarasa-Cabezuelo A., Andres Belmonte A., The impact of artificial intelligence and big data on end-stage kidney disease treatments, Expert Syst. Appl., 180, (2021)
[9]  
Diez-Sanmartin C., Sarasa Cabezuelo A., Application of artificial intelligence techniques to predict survival in kidney transplantation: a review, J. Clin. Med., 9, 2, (2020)
[10]  
Moghadam P., Ahmadi A., A machine learning framework to predict kidney graft failure with class imbalance using Red Deer algorithm, Expert Syst. Appl., 210, (2022)