Particularities of data mining in medicine: lessons learned from patient medical time series data analysis

被引:0
作者
Shadi Aljawarneh
Aurea Anguera
John William Atwood
Juan A. Lara
David Lizcano
机构
[1] Jordan University of Science and Technology,Faculty of Computer and Information Technology
[2] Technical University of Madrid,School of Computer Science, Campus de Montegancedo
[3] Concordia University,High Speed Protocols Laboratory
[4] Madrid Open University,UDIMA, School of Computer Science
来源
EURASIP Journal on Wireless Communications and Networking | / 2019卷
关键词
KDD; Data mining; Physiological signals; Medical data mining; Lessons learned; EEG; Stabilometry; Sensors;
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, large amounts of data are generated in the medical domain. Various physiological signals generated from different organs can be recorded to extract interesting information about patients’ health. The analysis of physiological signals is a hard task that requires the use of specific approaches such as the Knowledge Discovery in Databases process. The application of such process in the domain of medicine has a series of implications and difficulties, especially regarding the application of data mining techniques to data, mainly time series, gathered from medical examinations of patients. The goal of this paper is to describe the lessons learned and the experience gathered by the authors applying data mining techniques to real medical patient data including time series. In this research, we carried out an exhaustive case study working on data from two medical fields: stabilometry (15 professional basketball players, 18 elite ice skaters) and electroencephalography (100 healthy patients, 100 epileptic patients). We applied a previously proposed knowledge discovery framework for classification purpose obtaining good results in terms of classification accuracy (greater than 99% in both fields). The good results obtained in our research are the groundwork for the lessons learned and recommendations made in this position paper that intends to be a guide for experts who have to face similar medical data mining projects.
引用
收藏
相关论文
共 237 条
[31]  
Pazos J(2007)Integrating expert knowledge and data mining for medical diagnosis Expert Syst Res Trends 3 113-111
[32]  
Anguera A(2013)Assessing the quality of medical and health data from the 2003 birth certificate revision: results from two states, National Vital Statistics Reports : From the Centers for Disease Control and Prevention, National Center for Health Statistics Natl Vital Stat Syst 62 1-71
[33]  
Lara JA(2014)Medical diagnosis of cardiovascular diseases using an interval-valued fuzzy rule-based classification system Appl Soft Comput 20 103-13158
[34]  
Lizcano D(2014)MRI breast cancer diagnosis hybrid approach using adaptive ant-based segmentation and multilayer perceptron neural networks classifier Applied Soft Computing 14 62-26
[35]  
Martínez MA(2015)Using K-nearest neighbor classification to diagnose abnormal lung sounds Sensors J 15 13132-158
[36]  
Pazos J(2014)Decision tree classification based decision support system for derma disease Int J Comput Appl 94 21-106
[37]  
Alonso F(2014)A Bayesian network decision model for supporting the diagnosis of dementia, Alzheimer′s disease and mild cognitive impairment Computers in Biology and Medicine. 51 140-165
[38]  
Lara JA(1986)Induction of decision trees Machine Learn 1 81-96
[39]  
Martínez L(2013)Automated EEG analysis of epilepsy: a review Knowledge Based Syst 45 147-198
[40]  
Valente JP(2015)Application of entropies for automated diagnosis of epilepsy using EEG signals: a review Knowledge Based Syst 88 85-15