Data-Driven Visual Characterization of Patient Health-Status Using Electronic Health Records and Self-Organizing Maps

被引:13
作者
Chushig-Muzo, David [1 ]
Soguero-Ruiz, Cristina [1 ]
Engelbrecht, A. P. [2 ,3 ]
de Miguel Bohoyo, Pablo [4 ]
Mora-Jimenez, Inmaculada [1 ]
机构
[1] Rey Juan Carlos Univ, Dept Signal Theory & Commun Telemat & Comp Syst, Fuenlabrada 28943, Spain
[2] Stellenbosch Univ, Dept Ind Engn, ZA-7600 Stellenbosch, South Africa
[3] Stellenbosch Univ, Comp Sci Div, ZA-7600 Stellenbosch, South Africa
[4] Univ Hosp Fuenlabrada, Fuenlabrada 28943, Spain
来源
IEEE ACCESS | 2020年 / 8卷
关键词
Self-organizing feature maps; Drugs; Prototypes; Visualization; Diabetes; Diseases; Clustering methods; Electronic health records; machine learning; self organizing maps; clustering; data visualization; chronic conditions; CLASS IMBALANCE PROBLEM; GESTATIONAL HYPERTENSION; DATA SETS; CLUSTER; CLASSIFICATION; VALIDATION; MANAGEMENT; RISK; ALGORITHMS; PREDICTION;
D O I
10.1109/ACCESS.2020.3012082
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hypertension and diabetes have become a global health and economic issue, being among the major chronic conditions worldwide, particularly in developed countries. To face this global problem, a better knowledge about these diseases becomes crucial to characterize chronic patients. Our aim is two-fold: (1) to provide an efficient visual tool for identifying clinical patterns in high-dimensional data; and (2) to characterize the patient health-status through a data-driven approach using electronic health records of healthy, hypertensive and diabetic populations. We propose a two-stage methodology that uses diagnosis and drug codes of healthy and chronic patients associated to the University Hospital of Fuenlabrada in Spain. The first stage applies the Self-Organizing Map on the aforementioned data to get a set of prototype patients which are projected onto a grid of nodes. Each node has associated a prototype patient that captures relationships among clinical characteristics. In the second stage, clustering methods are applied on the prototype patients to find groups of patients with a similar health-status. Clusters with distinctive patterns linked to chronic conditions were found, being the most remarkable highlights: a cluster of pregnant women emerged among the hypertensive population, and two clusters of diabetic individuals with significant differences in drug-therapy (insulin and non-insulin dependant). The proposed methodology showed to be effective to explore relationships within clinical data and to find patterns related to diabetes and hypertension in a visual way. Our methodology raises as a suitable alternative for building appropriate clinical groups, becoming a promising approach to be applied to any population due to its data-driven philosophy. A thorough analysis of these groups could spawn new and fruitful findings.
引用
收藏
页码:137019 / 137031
页数:13
相关论文
共 50 条
  • [21] Improvements on the visualization of clusters in geo-referenced data using Self-Organizing Maps
    Gorricha, Jorge
    Lobo, Victor
    [J]. COMPUTERS & GEOSCIENCES, 2012, 43 : 177 - 186
  • [22] Early prediction of clinical deterioration using data-driven machine-learning modeling of electronic health records
    Ruiz, Victor M.
    Goldsmith, Michael P.
    Shi, Lingyun
    Simpao, Allan F.
    Galvez, Jorge A.
    Naim, Maryam Y.
    Nadkarni, Vinay
    Gaynor, J. William
    Tsui, Fuchiang
    [J]. JOURNAL OF THORACIC AND CARDIOVASCULAR SURGERY, 2022, 164 (01) : 211 - +
  • [23] Regionalization with Self-Organizing Maps for Sharing Higher Resolution Protected Health Information
    Krzyzanowski, Brittany
    Manson, Steven
    [J]. ANNALS OF THE AMERICAN ASSOCIATION OF GEOGRAPHERS, 2022, 112 (07) : 1866 - 1889
  • [24] Visual data mining in spatial interaction analysis with self-organizing maps
    Yan, Jun
    Thill, Jean-Claude
    [J]. ENVIRONMENT AND PLANNING B-PLANNING & DESIGN, 2009, 36 (03) : 466 - 486
  • [25] Coupling self-organizing maps with a Naive Bayesian classifier : Stream classification studies using multiple assessment data
    Fytilis, Nikolaos
    Rizzo, Donna M.
    [J]. WATER RESOURCES RESEARCH, 2013, 49 (11) : 7747 - 7762
  • [26] Data fusion using a hierarchy of self-organizing feature maps
    Knopf, GK
    [J]. SENSORS AND CONTROLS FOR INTELLIGENT MACHINING, AGILE MANUFACTURING, AND MECHATRONICS, 1998, 3518 : 6 - 16
  • [27] Clustering of regional HDI data using Self-Organizing Maps
    Ferreira Costa, Jose Alfredo
    Vieira Pinto, Antonio Paulo
    de Andrade, Joao Ribeiro
    de Medeiros, Marcial Guerra
    [J]. 2017 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2017,
  • [28] Visualization of High-Dimensional Clinically Acquired Geographic Data Using the Self-Organizing Maps
    Oyana, T. J.
    [J]. JOURNAL OF ENVIRONMENTAL INFORMATICS, 2009, 13 (01) : 33 - 44
  • [29] Seismic facies analysis from pre-stack data using self-organizing maps
    Kourki, Meysam
    Riahi, Mohammad Ali
    [J]. JOURNAL OF GEOPHYSICS AND ENGINEERING, 2014, 11 (06)
  • [30] Patient Electronic Health Data-Driven Approach to Clinical Decision Support
    Mane, Ketan K.
    Bizon, Chris
    Owen, Phillips
    Gersing, Ken
    Mostafa, Javed
    Schmitt, Charles
    [J]. CTS-CLINICAL AND TRANSLATIONAL SCIENCE, 2011, 4 (05): : 369 - 371