Data-Driven Visual Characterization of Patient Health-Status Using Electronic Health Records and Self-Organizing Maps

被引:14
作者
Chushig-Muzo, David [1 ]
Soguero-Ruiz, Cristina [1 ]
Engelbrecht, A. P. [2 ,3 ]
de Miguel Bohoyo, Pablo [4 ]
Mora-Jimenez, Inmaculada [1 ]
机构
[1] Rey Juan Carlos Univ, Dept Signal Theory & Commun Telemat & Comp Syst, Fuenlabrada 28943, Spain
[2] Stellenbosch Univ, Dept Ind Engn, ZA-7600 Stellenbosch, South Africa
[3] Stellenbosch Univ, Comp Sci Div, ZA-7600 Stellenbosch, South Africa
[4] Univ Hosp Fuenlabrada, Fuenlabrada 28943, Spain
关键词
Self-organizing feature maps; Drugs; Prototypes; Visualization; Diabetes; Diseases; Clustering methods; Electronic health records; machine learning; self organizing maps; clustering; data visualization; chronic conditions; CLASS IMBALANCE PROBLEM; GESTATIONAL HYPERTENSION; DATA SETS; CLUSTER; CLASSIFICATION; VALIDATION; MANAGEMENT; RISK; ALGORITHMS; PREDICTION;
D O I
10.1109/ACCESS.2020.3012082
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hypertension and diabetes have become a global health and economic issue, being among the major chronic conditions worldwide, particularly in developed countries. To face this global problem, a better knowledge about these diseases becomes crucial to characterize chronic patients. Our aim is two-fold: (1) to provide an efficient visual tool for identifying clinical patterns in high-dimensional data; and (2) to characterize the patient health-status through a data-driven approach using electronic health records of healthy, hypertensive and diabetic populations. We propose a two-stage methodology that uses diagnosis and drug codes of healthy and chronic patients associated to the University Hospital of Fuenlabrada in Spain. The first stage applies the Self-Organizing Map on the aforementioned data to get a set of prototype patients which are projected onto a grid of nodes. Each node has associated a prototype patient that captures relationships among clinical characteristics. In the second stage, clustering methods are applied on the prototype patients to find groups of patients with a similar health-status. Clusters with distinctive patterns linked to chronic conditions were found, being the most remarkable highlights: a cluster of pregnant women emerged among the hypertensive population, and two clusters of diabetic individuals with significant differences in drug-therapy (insulin and non-insulin dependant). The proposed methodology showed to be effective to explore relationships within clinical data and to find patterns related to diabetes and hypertension in a visual way. Our methodology raises as a suitable alternative for building appropriate clinical groups, becoming a promising approach to be applied to any population due to its data-driven philosophy. A thorough analysis of these groups could spawn new and fruitful findings.
引用
收藏
页码:137019 / 137031
页数:13
相关论文
共 50 条
[31]   Patient Electronic Health Data-Driven Approach to Clinical Decision Support [J].
Mane, Ketan K. ;
Bizon, Chris ;
Owen, Phillips ;
Gersing, Ken ;
Mostafa, Javed ;
Schmitt, Charles .
CTS-CLINICAL AND TRANSLATIONAL SCIENCE, 2011, 4 (05) :369-371
[32]   Multiclass fMRI data decoding and visualization using supervised self-organizing maps [J].
Hausfeld, Lars ;
Valente, Giancarlo ;
Formisano, Elia .
NEUROIMAGE, 2014, 96 :54-66
[33]   Structural Health Monitoring with Self-Organizing Maps and Artificial Neural Networks [J].
Avci, Onur ;
Abdeljaber, Osama ;
Kiranyaz, Serkan ;
Inman, Daniel .
TOPICS IN MODAL ANALYSIS & TESTING, VOL 8, 2020, :237-246
[34]   Data-driven identification of heart failure disease states and progression pathways using electronic health records [J].
Nagamine, Tasha ;
Gillette, Brian ;
Kahoun, John ;
Burghaus, Rolf ;
Lippert, Jorg ;
Saxena, Mayur .
SCIENTIFIC REPORTS, 2022, 12 (01)
[35]   Data-driven discovery of seasonally linked diseases from an Electronic Health Records system [J].
Rachel D Melamed ;
Hossein Khiabanian ;
Raul Rabadan .
BMC Bioinformatics, 15
[36]   Data-driven identification of ageing-related diseases from electronic health records [J].
Kuan, Valerie ;
Fraser, Helen C. ;
Hingorani, Melanie ;
Denaxas, Spiros ;
Gonzalez-Izquierdo, Arturo ;
Direk, Kenan ;
Nitsch, Dorothea ;
Mathur, Rohini ;
Parisinos, Constantinos A. ;
Lumbers, R. Thomas ;
Sofat, Reecha ;
Wong, Ian C. K. ;
Casas, Juan P. ;
Thornton, Janet M. ;
Hemingway, Harry ;
Partridge, Linda ;
Hingorani, Aroon D. .
SCIENTIFIC REPORTS, 2021, 11 (01)
[37]   Clustering and Analyzing Embedded Software Development Projects Data Using Self-Organizing Maps [J].
Iwata, Kazunori ;
Nakashima, Toyoshiro ;
Anan, Yoshiyuki ;
Ishii, Naohiro .
SOFTWARE ENGINEERING RESEARCH, MANAGEMENT AND APPLICATIONS 2011, 2012, 377 :47-+
[38]   Using self-organizing maps to visualize high-dimensional data [J].
Penn, BS .
COMPUTERS & GEOSCIENCES, 2005, 31 (05) :531-544
[39]   Analysis and visualization of gene expression data using Self-Organizing Maps [J].
Nikkilä, J ;
Törönen, P ;
Kaski, S ;
Venna, J ;
Castrén, E ;
Wong, G .
NEURAL NETWORKS, 2002, 15 (8-9) :953-966
[40]   Analysis of temporal data of book sale using self-organizing maps [J].
Chen Zeng-qiang ;
Chen Yi-di ;
Yuan Zhu-zhi ;
Zhang Jian-hua .
PROCEEDINGS OF 2005 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1 AND 2, 2005, :1087-+