Feature Selection by mRMR Method for Heart Disease Diagnosis

被引:10
|
作者
Wang, Gaoshuai [1 ]
Lauri, Fabrice [2 ]
El Hassani, Amir Hajjam [1 ]
机构
[1] Univ Bourgogne Franche Comte, UTBM, Lab Nanomed Imagerie Therapeut EA4662, F-90010 Belfort, France
[2] Univ Bourgogne Franche Comte, UTBM, Connaissance & Intelligence Artificielle Distribu, F-90010 Belfort, France
来源
IEEE ACCESS | 2022年 / 10卷
关键词
Feature extraction; Diseases; Heart; Principal component analysis; Information filters; Mutual information; Clinical diagnosis; Heart disease; feature selection; mutual information; mRMR; ALGORITHM; CLASSIFICATION; ASSOCIATION; STROKE; PCA;
D O I
10.1109/ACCESS.2022.3207492
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Heart disease has become a non-ignorable threat to human health in recent years. Once without timely diagnosis and treatment, patients often suffer disability or even death. However, the diagnosis accuracy directly relies on different doctors' experiences and various factors associated with heart disease bring heavy tasks on them make the situation worse. Therefore, to improve heart disease treatment, introducing computer-aided techniques to assist doctors in diagnosis is a feasible approach. At present, researchers usually use the processed dataset (13 features) selected by doctors from the unprocessed dataset (74 features) (UCI Machine Learning Repository) and apply the feature selection method to the dataset, it's inappropriate because the feature scale is so small. People neglect the unprocessed dataset's value and don't realize it could contain some latent information. A comprehensive comparison is needed to demonstrate the unprocessed dataset's advantages. Besides, the incremental feature combination method should be verified. As the minimum Redundancy - Maximum Relevance (mRMR) gains great success in feature selection, applying it as a feature filter can enhance classification accuracy. Thus, in this research, we introduced the mRMR method as a filter for feature selection and made a comprehensive comparison within several methods like Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), Kendall, Random Forest, and other research works in several metrics. By analyzing the results, in most cases, the unprocessed dataset can enhance algorithm's performance. The incremental feature selection method is effective and the mRMR is superior to other methods. Not only does it own the highest accuracies, but also the least supportive features. It has 100% accuracy with 8 features on the Cleveland dataset, 98.3% accuracy with 14 features on Hungarian, and 99% accuracy with 9 features on Long-beach-VA, respectively. Furthermore, we find that some features, which doctors regard as useless, play a part in classification, that should attract some attention from doctors.
引用
收藏
页码:100786 / 100796
页数:11
相关论文
共 50 条
  • [1] Review of Feature Selection, Dimensionality Reduction and Classification for Chronic Disease Diagnosis
    Alhassan, Afnan M.
    Zainon, Wan Mohd Nazmee Wan
    IEEE ACCESS, 2021, 9 : 87310 - 87317
  • [2] mRMR plus : An Effective Feature Selection Algorithm for Classification
    Chowdhury, Hussain A.
    Bhattacharyya, Dhruba K.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 424 - 430
  • [3] Heart Disease Diagnosis Using Decision Trees with Feature Selection Method
    Sheta, Alaa
    El-Ashmawi, Walaa
    Baareh, Abdelkarim
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2024, 21 (03) : 427 - 438
  • [4] Analog Filter Circuits Feature Selection Using MRMR and SVM
    Sun, Yongkui
    Ma, Lei
    Qin, Na
    Zhang, Meilan
    Lv, Qianyong
    2014 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2014), 2014, : 1543 - 1547
  • [5] Adapt the mRMR Criterion for Unsupervised Feature Selection
    Xu, Junling
    ADVANCED DATA MINING AND APPLICATIONS (ADMA 2010), PT II, 2010, 6441 : 111 - 121
  • [6] The Cost-Based Feature Selection Model for Coronary Heart Disease Diagnosis System Using Deep Neural Network
    Wiharto
    Suryani, Esti
    Setyawan, Sigit
    Putra, Bintang Pe
    IEEE ACCESS, 2022, 10 : 29687 - 29697
  • [7] Improved Measures of Redundancy and Relevance for mRMR Feature Selection
    Jo, Insik
    Lee, Sangbum
    Oh, Sejong
    COMPUTERS, 2019, 8 (02)
  • [8] Feature Selection Methods in the Framework of mRMR
    Wang, Xiujuan
    Tao, Yuanrui
    Zheng, Kangfeng
    2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 1490 - 1495
  • [9] New hybrid method for heart disease diagnosis utilizing optimization algorithm in feature selection
    Jalil Nourmohammadi-Khiarak
    Mohammad-Reza Feizi-Derakhshi
    Khadijeh Behrouzi
    Samaneh Mazaheri
    Yashar Zamani-Harghalani
    Rohollah Moosavi Tayebi
    Health and Technology, 2020, 10 : 667 - 678
  • [10] New hybrid method for heart disease diagnosis utilizing optimization algorithm in feature selection
    Nourmohammadi-Khiarak, Jalil
    Feizi-Derakhshi, Mohammad-Reza
    Behrouzi, Khadijeh
    Mazaheri, Samaneh
    Zamani-Harghalani, Yashar
    Tayebi, Rohollah Moosavi
    HEALTH AND TECHNOLOGY, 2020, 10 (03) : 667 - 678