A Robust Chronic Kidney Disease Classifier Using Machine Learning

被引:25
作者
Swain, Debabrata [1 ]
Mehta, Utsav [1 ]
Bhatt, Ayush [1 ]
Patel, Hardeep [1 ]
Patel, Kevin [1 ]
Mehta, Devanshu [1 ]
Acharya, Biswaranjan [2 ]
Gerogiannis, Vassilis C. [3 ]
Kanavos, Andreas [4 ]
Manika, Stella [5 ]
机构
[1] Pandit Deendayal Energy Univ, Comp Sci & Engn Dept, Gandhinagar 382007, India
[2] Marwadi Univ, Dept Comp Engn AI, Rajkot 360003, India
[3] Univ Thessaly, Dept Digital Syst, Larisa 41500, Greece
[4] Ionian Univ, Dept Informat, Corfu 49100, Greece
[5] Univ Thessaly, Dept Planning & Reg Dev, Volos 38334, Greece
关键词
chronic kidney disease; data balancing; hyperparameter tuning; machine learning; SMOTE; supervised learning;
D O I
10.3390/electronics12010212
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clinical support systems are affected by the issue of high variance in terms of chronic disorder prognosis. This uncertainty is one of the principal causes for the demise of large populations around the world suffering from some fatal diseases such as chronic kidney disease (CKD). Due to this reason, the diagnosis of this disease is of great concern for healthcare systems. In such a case, machine learning can be used as an effective tool to reduce the randomness in clinical decision making. Conventional methods for the detection of chronic kidney disease are not always accurate because of their high degree of dependency on several sets of biological attributes. Machine learning is the process of training a machine using a vast collection of historical data for the purpose of intelligent classification. This work aims at developing a machine-learning model that can use a publicly available data to forecast the occurrence of chronic kidney disease. A set of data preprocessing steps were performed on this dataset in order to construct a generic model. This set of steps includes the appropriate imputation of missing data points, along with the balancing of data using the SMOTE algorithm and the scaling of the features. A statistical technique, namely, the chi-squared test, is used for the extraction of the least-required set of adequate and highly correlated features to the output. For the model training, a stack of supervised-learning techniques is used for the development of a robust machine-learning model. Out of all the applied learning techniques, support vector machine (SVM) and random forest (RF) achieved the lowest false-negative rates and test accuracy, equal to 99.33% and 98.67%, respectively. However, SVM achieved better results than RF did when validated with 10-fold cross-validation.
引用
收藏
页数:13
相关论文
共 32 条
  • [21] Nishat MM., 2021, EAI Endorsed Transac Pervasive Health Technol, V7, pe1, DOI [10.4108/eai.13-8-2021.170671, DOI 10.4108/EAI.13-8-2021.170671]
  • [22] COVID-19 in CKD patients: Report from India
    Pawar, Nikita
    Tiwari, Vaibhav
    Gupta, Anurag
    Bhargava, Vinant
    Malik, Manish
    Gupta, Ashwani
    Bhalla, Anil Kumar
    Rana, D. S.
    [J]. INDIAN JOURNAL OF NEPHROLOGY, 2021, 31 (06) : 524 - 530
  • [23] Reshma S., 2020, Int J Eng Res, V9, P548, DOI [10.17577/IJERTV9IS070092, DOI 10.17577/IJERTV9IS070092]
  • [24] Revathy S., 2019, Int. J. Eng. Adv. Technol. (IJEAT), V9, P6364, DOI [10.35940/ijeat.E9444.119319, DOI 10.35940/IJEAT.E9444.119319]
  • [25] Diagnosis of Chronic Kidney Disease Using Effective Classification Algorithms and Recursive Feature Elimination Techniques
    Senan, Ebrahime Mohammed
    Al-Adhaileh, Mosleh Hmoud
    Alsaade, Fawaz Waselallah
    Aldhyani, Theyazn H. H.
    Alqarni, Ahmed Abdullah
    Alsharif, Nizar
    Uddin, M. Irfan
    Alahmadi, Ahmed H.
    Jadhav, Mukti E.
    Alzahrani, Mohammed Y.
    [J]. JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [26] Shankar S., 2020, INT RES J ENG TECHNO, V7, P4536
  • [27] Swain D., 2019, INT J RECENT TECHNOL, V8
  • [28] Swain D., 2019, International Journal of Innovative Technology and Exploring Engineering (IJITEE), V8, P689
  • [29] Swain D, 2021, INT J COMPUT SCI MAT, V14, P397
  • [30] Swain Debabrata., 2018, Prediction of H1B Visa Using Machine Learning Algorithms, P1, DOI DOI 10.1109/ICACAT.2018.8933603