Accurate Cardiovascular Disease Prediction: Leveraging Opt_hpLGBM With Dual-Tier Feature Selection

被引:0
|
作者
Gabriel, J. Jasmine [1 ]
Jani Anbarasi, L. [1 ]
机构
[1] Vellore Inst Technol, Sch Comp Sci & Engn, Chennai 600127, Tamil Nadu, India
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Solid modeling; Diseases; Heart; Predictive models; Accuracy; Support vector machines; Machine learning; Prediction algorithms; Data models; ANOVA; cardiovascular disease; chi-squared; feature selection; hyperparameter tuning; preprocessing; HEART-DISEASE; SYSTEM; MODEL; DIAGNOSIS;
D O I
10.1109/ACCESS.2024.3470537
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reliable forecasting of cardiovascular disease (CVD) outcomes is crucial for efficient patient management. While machine learning (ML) holds promise for disease prediction, challenges arise, particularly with smaller clinical datasets. Feature engineering is essential in this context, as it involves analyzing missing values, managing outliers, and addressing multicollinearity. This process is key to identifying and eliminating unnecessary features from the dataset. To tackle this, a scalable ML based Dual-Tier feature selection framework called ANOVA Chi-Squared (AnoX(2)) is proposed, utilizing a hybrid statistical method. The framework integrates validation using five different ML classifiers with the selected features from AnoX(2). The proposed model Opt_hpLGBM (Optuna hyperparameter tuned Light Gradient Boost Machine) along with AnoX(2) feature selection exhibits outstanding performance across four publicly available datasets, consistently achieving remarkable accuracy. For instance, it achieves 94.87% accuracy in the Cleveland dataset with 8 features, 95.12% in the Statlog dataset with the same number of features, 92.81% accuracy with 7 features in the heart disease dataset, and an impressive 98.85% accuracy in the z-Alizadeh Sani dataset with 12 features. These results exceed current benchmarks, establishing it as an industry leader in terms of the number of features utilized, accuracy, precision, recall, F1 score, and log loss metrics. With its potential for early diagnosis and treatment, this innovative framework can transform healthcare, significantly reducing mortality rates associated with cardiovascular disease.
引用
收藏
页码:142427 / 142448
页数:22
相关论文
共 10 条
  • [1] Efficient Prediction of Cardiovascular Disease Using Machine Learning Algorithms With Relief and LASSO Feature Selection Techniques
    Ghosh, Pronab
    Azam, Sami
    Jonkman, Mirjam
    Karim, Asif
    Shamrat, F. M. Javed Mehedi
    Ignatious, Eva
    Shultana, Shahana
    Beeravolu, Abhijith Reddy
    De Boer, Friso
    IEEE ACCESS, 2021, 9 : 19304 - 19326
  • [2] Enhanced Evolutionary Feature Selection and Ensemble Method for Cardiovascular Disease Prediction
    Jothi Prakash, V.
    Karthikeyan, N. K.
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2021, 13 (03) : 389 - 412
  • [3] Prediction of Cardiovascular Disease by Feature Selection and Machine Learning Techniques
    Ranade, Aditya
    Pise, Nitin
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 2, AITA 2023, 2024, 844 : 457 - 472
  • [4] Comparing different feature selection algorithms for cardiovascular disease prediction
    Najmul Hasan
    Yukun Bao
    Health and Technology, 2021, 11 : 49 - 62
  • [5] Comparing different feature selection algorithms for cardiovascular disease prediction
    Hasan, Najmul
    Bao, Yukun
    HEALTH AND TECHNOLOGY, 2021, 11 (01) : 49 - 62
  • [6] Enhanced Evolutionary Feature Selection and Ensemble Method for Cardiovascular Disease Prediction
    V. Jothi Prakash
    N. K. Karthikeyan
    Interdisciplinary Sciences: Computational Life Sciences, 2021, 13 : 389 - 412
  • [7] Prediction of cardiovascular disease based on multiple feature selection and improved PSO-XGBoost model
    Kerang Cao
    Chang Liu
    Siqi Yang
    Yuxin Zhang
    Lili Li
    Hoekyung Jung
    Shuo Zhang
    Scientific Reports, 15 (1)
  • [8] Multilayer Perceptron Neural Network with Arithmetic Optimization Algorithm-Based Feature Selection for Cardiovascular Disease Prediction
    Alghamdi, Fahad A.
    Almanaseer, Haitham
    Jaradat, Ghaith
    Jaradat, Ashraf
    Alsmadi, Mutasem K.
    Jawarneh, Sana
    Almurayh, Abdullah S.
    Alqurni, Jehad
    Alfagham, Hayat
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2024, 6 (02): : 987 - 1008
  • [9] Enhanced cardiovascular disease prediction through self-improved Aquila optimized feature selection in quantum neural network & LSTM model
    Darolia, Aman
    Chhillar, Rajender Singh
    Alhussein, Musaed
    Dalal, Surjeet
    Aurangzeb, Khursheed
    Lilhore, Umesh Kumar
    FRONTIERS IN MEDICINE, 2024, 11
  • [10] Enhanced feature selection and ensemble learning for cardiovascular disease prediction: hybrid GOL2-2 T and adaptive boosted decision fusion with babysitting refinement
    Praveen, S. Phani
    Hasan, Mohammad Kamrul
    Abdullah, Siti Norul Huda Sheikh
    Sirisha, Uddagiri
    Tirumanadham, N. S. Koti Mani Kumar
    Islam, Shayla
    Ahmed, Fatima Rayan Awad
    Ahmed, Thowiba E.
    Noboni, Ayman Afrin
    Sampedro, Gabriel Avelino
    Yeun, Chan Yeob
    Ghazal, Taher M.
    FRONTIERS IN MEDICINE, 2024, 11