Feature Selection by mRMR Method for Heart Disease Diagnosis

被引:10
|
作者
Wang, Gaoshuai [1 ]
Lauri, Fabrice [2 ]
El Hassani, Amir Hajjam [1 ]
机构
[1] Univ Bourgogne Franche Comte, UTBM, Lab Nanomed Imagerie Therapeut EA4662, F-90010 Belfort, France
[2] Univ Bourgogne Franche Comte, UTBM, Connaissance & Intelligence Artificielle Distribu, F-90010 Belfort, France
来源
IEEE ACCESS | 2022年 / 10卷
关键词
Feature extraction; Diseases; Heart; Principal component analysis; Information filters; Mutual information; Clinical diagnosis; Heart disease; feature selection; mutual information; mRMR; ALGORITHM; CLASSIFICATION; ASSOCIATION; STROKE; PCA;
D O I
10.1109/ACCESS.2022.3207492
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Heart disease has become a non-ignorable threat to human health in recent years. Once without timely diagnosis and treatment, patients often suffer disability or even death. However, the diagnosis accuracy directly relies on different doctors' experiences and various factors associated with heart disease bring heavy tasks on them make the situation worse. Therefore, to improve heart disease treatment, introducing computer-aided techniques to assist doctors in diagnosis is a feasible approach. At present, researchers usually use the processed dataset (13 features) selected by doctors from the unprocessed dataset (74 features) (UCI Machine Learning Repository) and apply the feature selection method to the dataset, it's inappropriate because the feature scale is so small. People neglect the unprocessed dataset's value and don't realize it could contain some latent information. A comprehensive comparison is needed to demonstrate the unprocessed dataset's advantages. Besides, the incremental feature combination method should be verified. As the minimum Redundancy - Maximum Relevance (mRMR) gains great success in feature selection, applying it as a feature filter can enhance classification accuracy. Thus, in this research, we introduced the mRMR method as a filter for feature selection and made a comprehensive comparison within several methods like Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), Kendall, Random Forest, and other research works in several metrics. By analyzing the results, in most cases, the unprocessed dataset can enhance algorithm's performance. The incremental feature selection method is effective and the mRMR is superior to other methods. Not only does it own the highest accuracies, but also the least supportive features. It has 100% accuracy with 8 features on the Cleveland dataset, 98.3% accuracy with 14 features on Hungarian, and 99% accuracy with 9 features on Long-beach-VA, respectively. Furthermore, we find that some features, which doctors regard as useless, play a part in classification, that should attract some attention from doctors.
引用
收藏
页码:100786 / 100796
页数:11
相关论文
共 50 条
  • [31] A Joint Evolutionary Method Based on Neural Network For Feature Selection
    Zhang, Biying
    ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL I, PROCEEDINGS, 2009, : 7 - 10
  • [32] Feature extraction through parallel Probabilistic Principal Component Analysis for heart disease diagnosis
    Shah, Syed Muhammad Saqliain
    Batool, Safeera
    Khan, Imran
    Ashraf, Muhammad Usman
    Abbas, Syed Hussnain
    Hussain, Syed Adnan
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2017, 482 : 796 - 807
  • [33] A Novel Feature Selection Method for Fault Diagnosis
    Voulgaris, Zacharias
    Sconyers, Chris
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2010, 339 : 262 - 269
  • [34] PRESERVING COMMUNITY FEATURE EXTRACTION AND MRMR FEATURE SELECTION FOR LINK CLASSIFICATION IN COMPLEX NETWORKS
    Wu, Jie-Hua
    Zhou, Bei
    Shen, Jing
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2018, : 215 - 221
  • [35] MRMR-EHO-Based Feature Selection Algorithm for Regression Modelling
    Sathishkumar, V. E.
    Cho, Yongyun
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2023, 30 (02): : 574 - 583
  • [36] Stateful MapReduce Framework for mRMR Feature Selection Using Horizontal Partitioning
    Yelleti, Vivek
    Prasad, P. S. V. S. Sai
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 317 - 327
  • [37] Improved aquila optimizer with mRMR for feature selection of high-dimensional gene expression data
    Qin, Xiwen
    Zhang, Siqi
    Dong, Xiaogang
    Shi, Hongyu
    Yuan, Liping
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (09): : 13005 - 13027
  • [38] mRMR-PSO: A Hybrid Feature Selection Technique with a Multiobjective Approach for Sign Language Recognition
    BansalnAff, Sandhya Rani
    Wadhawan, Savita
    Goel, Rajeev
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 10365 - 10380
  • [39] Prediction of heart disease by classifying with feature selection and machine learning methods
    Gazeloglu, Cengiz
    PROGRESS IN NUTRITION, 2020, 22 (02): : 660 - 670
  • [40] Diagnosis of Breast Cancer and Diabetes using Hybrid Feature Selection Method
    Jain, Divya
    Singh, Vijendra
    2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 64 - 69