Exploratory Data Analysis of Heart Disease Prediction using Machine Learning Techniques-RS Algorithm

被引:0
作者
Vibha, M. B. [1 ]
Sneha, S. R. [1 ]
Kiran, U. [1 ]
Kiran, Y. [1 ]
机构
[1] Dayananda Sagar Coll Engn, Dept MCA, Bangalore, Karnataka, India
来源
2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024 | 2024年
关键词
Machine learning; heart disease; Logistic regression; Decision tree; Random Forest; Support Vector Machine; Data analysis; CARDIOVASCULAR RISK PROFILE;
D O I
10.1109/ICOICI62503.2024.10696414
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Heart disease has become very common nowadays. Machine learning-based heart disease prediction has significant potential in clinical applications, enhancing early diagnosis and treatment. Detection of the disease at early stage can save human lives. The accurate predictive models help in early identification of the condition & help to avail appropriate treatment. In this work, we conduct a thorough exploratory data analysis (EDA) on a dataset that includes a range of clinical characteristics associated with heart health. Features in the dataset include, age, gender, kind of chest discomfort, resting bp, cholesterol level, and ECG readings. Our main goal is to find out how well the methods for logistic regression, decision tree classifier, random forest, and support vector machine can predict the occurrence of heart disease. Through EDA, this study analyses the distribution, correlation, and significance of features, gaining insights into potential risk factors associated with heart disease. Subsequently, we train and evaluate each machine learning model on the dataset, employing appropriate performance metrics to assess their prediction. In addition, this study has developed a hybrid model that merges the strengths of the Random Forest and Support Vector Machine (SVM) algorithms, resulting in an improved accuracy. This innovative approach highlights the advantages of combining advanced machine learning techniques to boost predictive reliability and consistency. This study highlights the significant potential of machine learning to improve early disease detection and treatment.
引用
收藏
页码:209 / 216
页数:8
相关论文
共 16 条
  • [1] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [2] Breiman L., 2017, Classification and Regression Trees, V1st, DOI 10.1201/9781315139470
  • [3] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
  • [4] General cardiovascular risk profile for use in primary care - The Framingham Heart Study
    D'Agostino, Ralph B.
    Vasan, Ramachandran S.
    Pencina, Michael J.
    Wolf, Philip A.
    Cobain, Mark
    Massaro, Joseph M.
    Kannel, William B.
    [J]. CIRCULATION, 2008, 117 (06) : 743 - 753
  • [5] geeksforgeeks, About us
  • [6] Hastie T, 2009, The Elements of Statistical Learning, V2nd
  • [7] Hosmer DW, 2013, WILEY SER PROBAB ST, P1, DOI 10.1002/9781118548387
  • [8] kaggle, About Us
  • [9] GENERAL CARDIOVASCULAR RISK PROFILE - FRAMINGHAM STUDY
    KANNEL, WB
    MCGEE, D
    GORDON, T
    [J]. AMERICAN JOURNAL OF CARDIOLOGY, 1976, 38 (01) : 46 - 51
  • [10] DIABETES AND CARDIOVASCULAR-DISEASE - FRAMINGHAM-STUDY
    KANNEL, WB
    MCGEE, DL
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1979, 241 (19): : 2035 - 2038