Applying data mining techniques to medical time series: an empirical case study in electroencephalography and stabilometry

被引:16
作者
Anguera, A. [1 ]
Barreiro, J. M. [1 ]
Lara, J. A. [2 ]
Lizcano, D. [2 ]
机构
[1] Tech Univ Madrid, Sch Comp Sci, Campus Montegancedo S-N, Madrid 28660, Spain
[2] Open Univ Madrid, UDIMA, Fac Ensenanzas Tecn, Ctra Coruna Km 38-500,Via Serv 15, Madrid 28400, Spain
关键词
Medical Data Mining; Electronic Health Record; Time Series; Knowledge Discovery; SYSTEM; DIAGNOSIS; DISEASE; EVENTS; MODELS; RISK;
D O I
10.1016/j.csbj.2016.05.002
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
One of the major challenges in the medical domain today is how to exploit the huge amount of data that this field generates. To do this, approaches are required that are capable of discovering knowledge that is useful for decision making in the medical field. Time series are data types that are common in the medical domain and require specialized analysis techniques and tools, especially if the information of interest to specialists is concentrated within particular time series regions, known as events. This research followed the steps specified by the so-called knowledge discovery in databases (KDD) process to discover knowledge from medical time series derived from stabilometric (396 series) and ectroencephalographic (200) patient electronic health records (EHR). The view offered in the paper is based on the experience gathered as part of the VIIP project.(1) Knowledge discovery in medical time series has a number of difficulties and implications that are highlighted by illustrating the application of several techniques that cover the entire KDD process through two case studies. This paper illustrates the application of different knowledge discovery techniques for the purposes of classification within the above domains. The accuracy of this application for the two classes considered in each case is 99.86% and 98.11% for epilepsy diagnosis in the electroencephalography (EEG) domain and 99.4% and 99.1% for early-age sports talent classification in the stabilometry domain. The KDD techniques achieve better results than other traditional neural network-based classification techniques. (C) 2016 Anguera et al. Published by Elsevier B.V. on behalf of the Research Network of Computational and Structural Biotechnology.
引用
收藏
页码:185 / 199
页数:15
相关论文
共 50 条
[41]   Summary of Clustering Research in Time Series Data Mining [J].
Li H. ;
Zhang L. .
Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2022, 51 (03) :416-424
[42]   Data Mining of Time Series Based on Wave Cluster [J].
Dong Jixue .
2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2009, :697-699
[43]   A data mining algorithm of time series and the application on traffic [J].
Wang, Xiaoye ;
Zhang, Hua .
DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2007, 14 :499-502
[44]   An efficient distance metric for time series data mining [J].
Shi, Yu-Qing ;
Zhu, Yue-Long .
MANUFACTURING AND ENGINEERING TECHNOLOGY, 2015, :471-474
[45]   Research on feature engineering for time series data mining [J].
Li, Lei ;
Ou, Yihang ;
Wu, Yabin ;
Li, Qi ;
Chen, Daoxin .
PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, :431-435
[46]   A comparison of three data mining time series models in prediction of monthly brucellosis surveillance data [J].
Shirmohammadi-Khorram, Nasrin ;
Tapak, Leili ;
Hamidi, Omid ;
Maryanaji, Zohreh .
ZOONOSES AND PUBLIC HEALTH, 2019, 66 (07) :759-772
[47]   INTELLIGENCE ANALYSIS OF EMPIRICAL DATA BASED ON TIME SERIES [J].
Ivanets, O. B. ;
Khrashchevskyi, R., V ;
Kulik, M. S. ;
Burichenko, M. Yu .
RADIO ELECTRONICS COMPUTER SCIENCE CONTROL, 2023, (02) :61-71
[48]   An Empirical Study on Multivariate Data in Medical Decision Making Environment [J].
NaliniPriya, G. ;
Kannan, A. .
2013 IEEE INTERNATIONAL CONFERENCE ON SMART STRUCTURES AND SYSTEMS (ICSSS), 2013, :138-144
[49]   Self-labeling techniques for semi-supervised time series classification: an empirical study [J].
Gonzalez, Mabel ;
Bergmeir, Christoph ;
Triguero, Isaac ;
Rodriguez, Yanet ;
Benitez, Jose M. .
KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 55 (02) :493-528
[50]   Data Mining Techniques for Endometriosis Detection in a Data-Scarce Medical Dataset [J].
Caballero, Pablo ;
Gonzalez-Abril, Luis ;
Ortega, Juan A. ;
Simon-Soro, Aurea .
ALGORITHMS, 2024, 17 (03)