Novel statistically equivalent signature- based hybrid feature selection and ensemble deep learning LSTM and GRU for chronic kidney disease classification

被引:1
作者
Yogesh, N. [1 ,2 ]
Shrinivasacharya, Purohit [1 ,2 ]
Naik, Nagaraj [3 ]
机构
[1] Siddaganga Inst Technol, Tumkuru, Karanataka, India
[2] Visvesveraya Technol Univ, Belagavi, India
[3] Manipal Acad Higher Educ MAHE, Manipal Inst Technol, Comp Sci & Engn, Manipal, India
关键词
Hybrid feature selection; LSTM; GRU; Chronic kidney disease classification; DIAGNOSIS;
D O I
10.7717/peerj-cs.2467
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chronic kidney disease (CKD) involves numerous variables, but only a few significantly impact the classification task. The statistically equivalent signature (SES) method, inspired by constraint-based learning of Bayesian networks, is employed to identify essential features in CKD. Unlike conventional feature selection methods, which typically focus on a single set of features with the highest predictive potential, the SES method can identify multiple predictive feature subsets with similar performance. However, most feature selection (FS) classifiers perform suboptimally with strongly correlated data. The FS approach faces challenges in identifying crucial features and selecting the most effective classifier, particularly in high-dimensional data. This study proposes using the Least Absolute Shrinkage and Selection Operator (LASSO) in conjunction with the SES method for feature selection in CKD identification. Following this, an ensemble deep-learning model combining long short-term memory (LSTM) and gated recurrent unit (GRU) networks is proposed for CKD classification. The features selected by the hybrid feature selection method are fed into the ensemble deeplearning model. The model's performance is evaluated using accuracy, precision, recall, and F1 score metrics. The experimental results are compared with individual classifiers, including decision tree (DT), Random Forest (RF), logistic regression (LR), and support vector machine (SVM). The findings indicate a 2% improvement in classification accuracy when using the proposed hybrid feature selection method combined with the LSTM and GRU ensemble deep-learning model. Further analysis reveals that certain features, such as HEMO, POT, bacteria, and coronary artery disease, contribute minimally to the classification task. Future research could explore additional feature selection methods, including dynamic feature selection that adapts to evolving datasets and incorporates clinical knowledge to enhance CKD classification accuracy further.
引用
收藏
页数:24
相关论文
共 34 条
  • [1] Alassaf RA, 2018, IEEE INT CONF INNOV, P99, DOI 10.1109/INNOVATIONS.2018.8606040
  • [2] Early Prediction of Chronic Kidney Disease Using Machine Learning Supported by Predictive Analytics
    Aljaaf, Ahmed J.
    Al-Jumeily, Dhiya
    Haglan, Hussein M.
    Alloghani, Mohamed
    Baker, Thar
    Hussain, Abir J.
    Mustafina, Jamila
    [J]. 2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 251 - 259
  • [3] Enhancing the Early Detection of Chronic Kidney Disease: A Robust Machine Learning Model
    Arif, Muhammad Shoaib
    Mukheimer, Aiman
    Asif, Daniyal
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (03)
  • [4] Machine learning to predict end stage kidney disease in chronic kidney disease
    Bai, Qiong
    Su, Chunyan
    Tang, Wen
    Li, Yike
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] RNN-based longitudinal analysis for diagnosis of Alzheimer's disease
    Cui, Ruoxuan
    Liu, Manhua
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2019, 73 : 1 - 10
  • [7] A Machine Learning Method with Filter-Based Feature Selection for Improved Prediction of Chronic Kidney Disease
    Ebiaredoh-Mienye, Sarah A.
    Swart, Theo G.
    Esenogho, Ebenezer
    Mienye, Ibomoiye Domor
    [J]. BIOENGINEERING-BASEL, 2022, 9 (08):
  • [8] Fahimifar S., 2022, Qual Quant, DOI [10.1007/s11135-022-01480-z, DOI 10.1007/S11135-022-01480-Z]
  • [9] Development and External Validation of a Machine Learning Model for Progression of CKD
    Ferguson, Thomas
    Ravani, Pietro
    Sood, Manish M.
    Clarke, Alix
    Komenda, Paul
    Rigatto, Claudio
    Tangri, Navdeep
    [J]. KIDNEY INTERNATIONAL REPORTS, 2022, 7 (08): : 1772 - 1781
  • [10] A Comparison of Power Quality Disturbance Detection and Classification Methods Using CNN, LSTM and CNN-LSTM
    Garcia, Carlos Iturrino
    Grasso, Francesco
    Luchetta, Antonio
    Piccirilli, Maria Cristina
    Paolucci, Libero
    Talluri, Giacomo
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (19): : 1 - 22