Integrating Correlation-Based Feature Selection and Clustering for Improved Cardiovascular Disease Diagnosis

被引:46
作者
Wosiak, Agnieszka [1 ]
Zakrzewska, Danuta [1 ]
机构
[1] Lodz Univ Technol, Inst Informat Technol, PL-90924 Lodz, Poland
关键词
CORONARY-ARTERY-DISEASE; ALGORITHM;
D O I
10.1155/2018/2520706
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Based on the growing problem of heart diseases, their efficient diagnosis is of great importance to the modern world. Statistical inference is the tool that most physicians use for diagnosis, though in many cases it does not appear powerful enough. Clustering of patient instances allows finding out groups for which statistical models can be built more efficiently. However, the performance of such an approach depends on the features used as clustering attributes. In this paper, the methodology that consists of combining unsupervised feature selection and grouping to improve the performance of statistical analysis is considered. We assume that the set of attributes used in clustering and statistical analysis phases should be different and not correlated. Thus, the method consisting of selecting reversed correlated features as attributes of cluster analysis is considered. The proposed methodology has been verified by experiments done on three real datasets of cardiovascular cases. The obtained effects have been evaluated regarding the number of detected dependencies between parameters. Experiment results showed the advantage of the presented approach compared to other feature selection methods and without using clustering to support statistical inference.
引用
收藏
页数:11
相关论文
共 28 条
  • [1] Coronary artery disease detection using computational intelligence methods
    Alizadehsani, Roohallah
    Zangooei, Mohammad Hossein
    Hosseini, Mohammad Javad
    Habibi, Jafar
    Khosravi, Abbas
    Roshanzamir, Mohamad
    Khozeimeh, Fahime
    Sarrafzadegan, Nizal
    Nahavandi, Saeid
    [J]. KNOWLEDGE-BASED SYSTEMS, 2016, 109 : 187 - 197
  • [2] A data mining approach for diagnosis of coronary artery disease
    Alizadehsani, Roohallah
    Habibi, Jafar
    Hosseini, Mohammad Javad
    Mashayekhi, Hoda
    Boghrati, Reihane
    Ghandeharioun, Asma
    Bahadorian, Behdad
    Sani, Zahra Alizadeh
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2013, 111 (01) : 52 - 61
  • [3] MEASUREMENT IN MEDICINE - THE ANALYSIS OF METHOD COMPARISON STUDIES
    ALTMAN, DG
    BLAND, JM
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1983, 32 (03) : 307 - 317
  • [4] Amin S. U., 2008, INT J ADV RES COMPUT, V2, P218
  • [5] [Anonymous], 1996, GLOBAL BURDEN DIS IN
  • [6] [Anonymous], 1999, CORRELATION BASED FE
  • [7] [Anonymous], 2003, APPL STAT BEHAV SCI
  • [8] Computer aided decision making for heart disease detection using hybrid neural network-Genetic algorithm
    Arabasadi, Zeinab
    Alizadehsani, Roohallah
    Roshanzamir, Mohamad
    Moosaei, Hossein
    Yarifard, Ali Asghar
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2017, 141 : 19 - 26
  • [9] Mechanical Properties of Human Coronary Arteries
    Claes, E.
    Atienza, J. M.
    Guinea, G. V.
    Rojo, F. J.
    Bernal, J. M.
    Revuelta, J. M.
    Elices, M.
    [J]. 2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, : 3792 - 3795
  • [10] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38