Feature Selection by mRMR Method for Heart Disease Diagnosis

被引:10
|
作者
Wang, Gaoshuai [1 ]
Lauri, Fabrice [2 ]
El Hassani, Amir Hajjam [1 ]
机构
[1] Univ Bourgogne Franche Comte, UTBM, Lab Nanomed Imagerie Therapeut EA4662, F-90010 Belfort, France
[2] Univ Bourgogne Franche Comte, UTBM, Connaissance & Intelligence Artificielle Distribu, F-90010 Belfort, France
来源
IEEE ACCESS | 2022年 / 10卷
关键词
Feature extraction; Diseases; Heart; Principal component analysis; Information filters; Mutual information; Clinical diagnosis; Heart disease; feature selection; mutual information; mRMR; ALGORITHM; CLASSIFICATION; ASSOCIATION; STROKE; PCA;
D O I
10.1109/ACCESS.2022.3207492
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Heart disease has become a non-ignorable threat to human health in recent years. Once without timely diagnosis and treatment, patients often suffer disability or even death. However, the diagnosis accuracy directly relies on different doctors' experiences and various factors associated with heart disease bring heavy tasks on them make the situation worse. Therefore, to improve heart disease treatment, introducing computer-aided techniques to assist doctors in diagnosis is a feasible approach. At present, researchers usually use the processed dataset (13 features) selected by doctors from the unprocessed dataset (74 features) (UCI Machine Learning Repository) and apply the feature selection method to the dataset, it's inappropriate because the feature scale is so small. People neglect the unprocessed dataset's value and don't realize it could contain some latent information. A comprehensive comparison is needed to demonstrate the unprocessed dataset's advantages. Besides, the incremental feature combination method should be verified. As the minimum Redundancy - Maximum Relevance (mRMR) gains great success in feature selection, applying it as a feature filter can enhance classification accuracy. Thus, in this research, we introduced the mRMR method as a filter for feature selection and made a comprehensive comparison within several methods like Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), Kendall, Random Forest, and other research works in several metrics. By analyzing the results, in most cases, the unprocessed dataset can enhance algorithm's performance. The incremental feature selection method is effective and the mRMR is superior to other methods. Not only does it own the highest accuracies, but also the least supportive features. It has 100% accuracy with 8 features on the Cleveland dataset, 98.3% accuracy with 14 features on Hungarian, and 99% accuracy with 9 features on Long-beach-VA, respectively. Furthermore, we find that some features, which doctors regard as useless, play a part in classification, that should attract some attention from doctors.
引用
收藏
页码:100786 / 100796
页数:11
相关论文
共 50 条
  • [21] Automated Epileptic Seizure Detection Method Based on the Multi-attribute EEG Feature Pool and mRMR Feature Selection Method
    Miao, Bo
    Guan, Junling
    Zhang, Liangliang
    Meng, Qingfang
    Zhang, Yulin
    COMPUTATIONAL SCIENCE - ICCS 2019, PT III, 2019, 11538 : 45 - 59
  • [22] Applying Feature Selection and Weight Optimization Techniques to Enhance Artificial Neural Network for Heart Disease Diagnosis
    Khan, Younas
    Qamar, Usman
    Asad, Muhammad
    Zeb, Babar
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2020, 1037 : 340 - 351
  • [23] Segmentation-based linear discriminant analysis with information theoretic feature selection for hyperspectral image classification
    Afjal, Masud Ibn
    Mondal, Md. Nazrul Islam
    Al Mamun, Md.
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (11) : 3412 - 3455
  • [24] EFS-MI: an ensemble feature selection method for classification An ensemble feature selection method
    Hoque, Nazrul
    Singh, Mihir
    Bhattacharyya, Dhruba K.
    COMPLEX & INTELLIGENT SYSTEMS, 2018, 4 (02) : 105 - 118
  • [25] A new feature extraction and selection scheme for hybrid fault diagnosis of gearbox
    Li, Bing
    Zhang, Pei-lin
    Tian, Hao
    Mi, Shuang-shan
    Liu, Dong-sheng
    Ren, Guo-quan
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (08) : 10000 - 10009
  • [26] Exploring feature selection and classification methods for predicting heart disease
    Spencer, Robinson
    Thabtah, Fadi
    Abdelhamid, Neda
    Thompson, Michael
    DIGITAL HEALTH, 2020, 6
  • [27] Univariate and Multivariate Filter Feature Selection for Heart Disease Classification
    Benhar, Houda
    Hosni, Mohamed
    Idri, Ali
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2022, 38 (04) : 791 - 803
  • [28] Machinery Fault Diagnosis Scheme Using Redefined Dimensionless Indicators and mRMR Feature Selection
    Hu, Qin
    Si, Xiao-Sheng
    Qin, Ai-Song
    Lv, Yun-Rong
    Zhang, Qing-Hua
    IEEE ACCESS, 2020, 8 : 40313 - 40326
  • [29] COMB: A Hybrid Method for Cross-validated Feature Selection
    Thejas, G. S.
    Jimenez, Daniel
    Iyengar, S. S.
    Miller, Jerry
    Sunitha, N. R.
    Badrinath, Prajwal
    ACMSE 2020: PROCEEDINGS OF THE 2020 ACM SOUTHEAST CONFERENCE, 2020, : 100 - 106
  • [30] Feature Selection Method Based on Differential Correlation Information Entropy
    Wang, Xiujuan
    Yan, Yixuan
    Ma, Xiaoyue
    NEURAL PROCESSING LETTERS, 2020, 52 (02) : 1339 - 1358