Integrating Correlation-Based Feature Selection and Clustering for Improved Cardiovascular Disease Diagnosis

被引:46
作者
Wosiak, Agnieszka [1 ]
Zakrzewska, Danuta [1 ]
机构
[1] Lodz Univ Technol, Inst Informat Technol, PL-90924 Lodz, Poland
关键词
CORONARY-ARTERY-DISEASE; ALGORITHM;
D O I
10.1155/2018/2520706
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Based on the growing problem of heart diseases, their efficient diagnosis is of great importance to the modern world. Statistical inference is the tool that most physicians use for diagnosis, though in many cases it does not appear powerful enough. Clustering of patient instances allows finding out groups for which statistical models can be built more efficiently. However, the performance of such an approach depends on the features used as clustering attributes. In this paper, the methodology that consists of combining unsupervised feature selection and grouping to improve the performance of statistical analysis is considered. We assume that the set of attributes used in clustering and statistical analysis phases should be different and not correlated. Thus, the method consisting of selecting reversed correlated features as attributes of cluster analysis is considered. The proposed methodology has been verified by experiments done on three real datasets of cardiovascular cases. The obtained effects have been evaluated regarding the number of detected dependencies between parameters. Experiment results showed the advantage of the presented approach compared to other feature selection methods and without using clustering to support statistical inference.
引用
收藏
页数:11
相关论文
共 28 条
  • [11] ABC of interventional cardiology - Pathophysiology and investigation of coronary artery disease
    Grech, ED
    [J]. BRITISH MEDICAL JOURNAL, 2003, 326 (7397): : 1027 - 1030
  • [12] Cluster analysis and clinical asthma phenotypes
    Haldar, Pranab
    Pavord, Ian D.
    Shaw, Dominic E.
    Berry, Michael A.
    Thomas, Michael
    Brightling, Christopher E.
    Wardlaw, Andrew I.
    Green, Ruth H.
    [J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2008, 178 (03) : 218 - 224
  • [13] Han J, 2012, MOR KAUF D, P1
  • [14] Kira K, 1992, P 9 INT WORKSH MACH, P249
  • [15] Kononenko I., 1994, Machine Learning: ECML-94. European Conference on Machine Learning. Proceedings, P171
  • [16] Lichman M., 2017, UCI Machine Learning Repository
  • [17] Toward integrating feature selection algorithms for classification and clustering
    Liu, H
    Yu, L
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (04) : 491 - 502
  • [18] Looney S. W., 2011, ESSENTIAL STAT METHO, V27, P27, DOI [10.1016/B978-0-444-53737-9.50005-0, DOI 10.1016/B978-0-444-53737-9.50005-0]
  • [19] Myocardial dysfunction in children with intrauterine growth restriction: an echocardiographic study
    Niewiadomska-Jarosik, Katarzyna
    Zamojska, Justyna
    Zamecznik, Agata
    Wosiak, Agnieszka
    Jarosik, Piotr
    Stanczyk, Jerzy
    [J]. CARDIOVASCULAR JOURNAL OF AFRICA, 2017, 28 (01) : 36 - 39
  • [20] Polinski A., 2011, 2011 Federated Conference on Computer Science and Information Systems (FedCSIS), P417