An optimized XGBoost based diagnostic system for effective prediction of heart disease

被引:129
作者
Budholiya, Kartik [1 ]
Shrivastava, Shailendra Kumar [1 ]
Sharma, Vivek [1 ]
机构
[1] Samrat Ashok Technol Inst, Comp Sci & Engn, Vidisha, Madhya Pradesh, India
关键词
XGBoost; BayesianOptimization; Categoricalfeatureencoding; HeartDisease; Prediction; GAUSSIAN-PROCESSES;
D O I
10.1016/j.jksuci.2020.10.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Researchers have created several expert systems over the years to predict heart disease early and assist cardiologists to enhance the diagnosis process. We present a diagnostic system in this paper that utilizes an optimized XGBoost (Extreme Gradient Boosting) classifier to predict heart disease. Proper hyper-parameter tuning is essential for any classifier's successful application. To optimize the hyper-parameters of XGBoost, we used Bayesian optimization, which is a very efficient method for hyper-parameter optimization. We also used One-Hot (OH) encoding technique to encode categorical features in the dataset to improve prediction accuracy. The efficacy of the proposed model is evaluated on Cleveland heart disease dataset and compared it with Random Forest (RF) and Extra Tree (ET) classifiers. Five different evaluation metrics: accuracy, sensitivity, specificity, F1-score, and AUC (area under the curve) of ROC charts were used for performance evaluation. The experimental results showed its validity and efficacy in the prediction of heart disease. In addition, proposed model displays better performance compared to the previously suggested models. Moreover, our proposed method reaches the high prediction accuracy of 91.8%. Our results indicate that the proposed method could be used reliably to predict heart disease in the clinic. (C) 2020 The Authors. Published by Elsevier B.V. on behalf of King Saud University.
引用
收藏
页码:4514 / 4523
页数:10
相关论文
共 32 条
  • [21] APPLICATION OF BAYESIAN-APPROACH TO NUMERICAL-METHODS OF GLOBAL AND STOCHASTIC OPTIMIZATION
    MOCKUS, J
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 1994, 4 (04) : 347 - 365
  • [22] Olaniyi Ebenezer Obaloluwa, 2015, International Journal of Intelligent Systems and Applications, V7, P75, DOI 10.5815/ijisa.2015.12.08
  • [23] Attribute weighting via genetic algorithms for attribute weighted artificial immune system (AWAIS) and its application to heart disease and liver disorders problems
    Ozsen, Seral
    Gunes, Salih
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (01) : 386 - 392
  • [24] Automatic detection of heart disease using an artificial immune recognition system (AIRS) with fuzzy resource allocation mechanism and k-nn (nearest neighbour) based weighting preprocessing
    Polat, Kemal
    Sahan, Seral
    Guenes, Salih
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2007, 32 (02) : 625 - 631
  • [25] Gaussian processes in machine learning
    Rasmussen, CE
    [J]. ADVANCED LECTURES ON MACHINE LEARNING, 2004, 3176 : 63 - 71
  • [26] Rasmussen CE, 2005, ADAPT COMPUT MACH LE, P1
  • [27] A web based decision support system driven by fuzzy logic for the diagnosis of typhoid fever
    Samuel, O. W.
    Omisore, M. O.
    Ojokoh, B. A.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (10) : 4164 - 4171
  • [28] An integrated decision support system based on ANN and Fuzzy_AHP for heart failure risk prediction
    Samuel, Oluwarotimi Williams
    Asogbon, Grace Mojisola
    Sangaiah, Arun Kumar
    Fang, Peng
    Li, Guanglin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 68 : 163 - 172
  • [29] Snoek J., 2012, Advances in neural information processing systems, V25
  • [30] Auto-WEKA: Combined Selection and Hyperparameter Optimization of Classification Algorithms
    Thornton, Chris
    Hutter, Frank
    Hoos, Holger H.
    Leyton-Brown, Kevin
    [J]. 19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 847 - 855