A fuzzy-based medical system for pattern mining in a distributed environment: Application to diagnostic and co-morbidity

被引:17
作者
Fernandez-Basso, Carlos [1 ]
Gutierrez-Batista, Karel [1 ]
Morcillo-Jimenez, Roberto [1 ]
Vila, Maria-Amparo [1 ]
Martin-Bautista, Maria J. [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada 18071, Spain
关键词
Association rules; Fuzzy logic; Data mining; Medical records; ASSOCIATION RULES; ENERGY EFFICIENCY; CLASSIFICATION; INFORMATION; ALGORITHMS; FRAMEWORK; ONTOLOGY; NETWORK;
D O I
10.1016/j.asoc.2022.108870
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we have addressed the extraction of hidden knowledge from medical records using data mining techniques such as association rules in conjunction with fuzzy logic in a distributed environment. A significant challenge in this domain is that although there are a lot of studies devoted to analysing health data, very few focus on the understanding and interpretability of the data and the hidden patterns present within the data. A major challenge in this area is that many health data analysis studies have focussed on classification, prediction or knowledge extraction and end users find little interpretability or understanding of the results. This is due to the use of black-box algorithms or because the nature of the data is not represented correctly. This is why it is necessary to focus the analysis not only on knowledge extraction but also on the transformation and processing of the data to improve the modelling of the nature of the data. Techniques such as association rule mining and fuzzy logic help to improve the interpretability of the data and treat it with the inherent uncertainty of real-world data. To this end, we propose a system that automatically: a) pre-processes the database by transforming and adapting the data for the data mining process and enriching the data to generate more interesting patterns, b) performs the fuzzification of the medical database to represent and analyse real-world medical data with its inherent uncertainty, c) discovers interrelations and patterns amongst different features (diagnostic, hospital discharge, etc.), and d) visualizes the obtained results efficiently to facilitate the analysis and improve the interpretability of the information extracted. Our proposed system yields a significant increase in the compression and interpretability of medical data for end-users, allowing them to analyse the data correctly and make the right decisions. We present one practical case using two health-related datasets to demonstrate the feasibility of our proposal for real data. (c) 2022 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:13
相关论文
共 65 条
[1]   An effective hot topic detection method for microblog on spark [J].
Ai, Wei ;
Li, Kenli ;
Li, Keqin .
APPLIED SOFT COMPUTING, 2018, 70 :1010-1023
[2]   Type-2 fuzzy ontology-aided recommendation systems for IoT-based healthcare [J].
Ali, Farman ;
Islam, S. M. Riazul ;
Kwak, Daehan ;
Khand, Pervez ;
Ullah, Niamat ;
Yoo, Sang-jo ;
Kwak, K. S. .
COMPUTER COMMUNICATIONS, 2018, 119 :138-155
[3]   Height estimation from single aerial images using a deep convolutional encoder-decoder network [J].
Amirkolaee, Hamed Amini ;
Arefi, Hossein .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 149 :50-66
[4]   Scalable Inference of Gene Regulatory Networks with the Spark Distributed Computing Platform [J].
Barba-Gonzalez, Cristobal ;
Garcia-Nieto, Jose ;
Benitez-Hidalgo, Antonio ;
Nebro, Antonio J. ;
Aldana-Montes, Jose F. .
INTELLIGENT DISTRIBUTED COMPUTING XII, 2018, 798 :61-70
[5]   A survey on the state of healthcare upcoding fraud analysis and detection [J].
Bauder R. ;
Khoshgoftaar T.M. ;
Seliya N. .
Health Services and Outcomes Research Methodology, 2017, 17 (1) :31-55
[6]   Big Data and Machine Learning in Health Care [J].
Beam, Andrew L. ;
Kohane, Isaac S. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2018, 319 (13) :1317-1318
[7]  
Calero J., 2004, ICEIS 2004, P138
[8]   Crosslingual named entity recognition for clinical de-identification applied to a COVID-19 Italian data set [J].
Catelli, Rosario ;
Gargiulo, Francesco ;
Casola, Valentina ;
De Pietro, Giuseppe ;
Fujita, Hamido ;
Esposito, Massimo .
APPLIED SOFT COMPUTING, 2020, 97
[9]   Fuzzy Electromagnetism Optimization (FEMO) and its application in biomedical image segmentation [J].
Chakraborty, Shouvik ;
Mali, Kalyani .
APPLIED SOFT COMPUTING, 2020, 97
[10]   Electroencephalogram-based emotion assessment system using ontology and data mining techniques [J].
Chen, Jing ;
Hu, Bin ;
Moore, Philip ;
Zhang, Xiaowei ;
Ma, Xu .
APPLIED SOFT COMPUTING, 2015, 30 :663-674