Applying data mining techniques to medical time series: an empirical case study in electroencephalography and stabilometry

被引:16
作者
Anguera, A. [1 ]
Barreiro, J. M. [1 ]
Lara, J. A. [2 ]
Lizcano, D. [2 ]
机构
[1] Tech Univ Madrid, Sch Comp Sci, Campus Montegancedo S-N, Madrid 28660, Spain
[2] Open Univ Madrid, UDIMA, Fac Ensenanzas Tecn, Ctra Coruna Km 38-500,Via Serv 15, Madrid 28400, Spain
关键词
Medical Data Mining; Electronic Health Record; Time Series; Knowledge Discovery; SYSTEM; DIAGNOSIS; DISEASE; EVENTS; MODELS; RISK;
D O I
10.1016/j.csbj.2016.05.002
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
One of the major challenges in the medical domain today is how to exploit the huge amount of data that this field generates. To do this, approaches are required that are capable of discovering knowledge that is useful for decision making in the medical field. Time series are data types that are common in the medical domain and require specialized analysis techniques and tools, especially if the information of interest to specialists is concentrated within particular time series regions, known as events. This research followed the steps specified by the so-called knowledge discovery in databases (KDD) process to discover knowledge from medical time series derived from stabilometric (396 series) and ectroencephalographic (200) patient electronic health records (EHR). The view offered in the paper is based on the experience gathered as part of the VIIP project.(1) Knowledge discovery in medical time series has a number of difficulties and implications that are highlighted by illustrating the application of several techniques that cover the entire KDD process through two case studies. This paper illustrates the application of different knowledge discovery techniques for the purposes of classification within the above domains. The accuracy of this application for the two classes considered in each case is 99.86% and 98.11% for epilepsy diagnosis in the electroencephalography (EEG) domain and 99.4% and 99.1% for early-age sports talent classification in the stabilometry domain. The KDD techniques achieve better results than other traditional neural network-based classification techniques. (C) 2016 Anguera et al. Published by Elsevier B.V. on behalf of the Research Network of Computational and Structural Biotechnology.
引用
收藏
页码:185 / 199
页数:15
相关论文
共 50 条
[21]   Fuzzy data mining for time-series data [J].
Chen, Chun-Hao ;
Hong, Tzung-Pei ;
Tseng, Vincent S. .
APPLIED SOFT COMPUTING, 2012, 12 (01) :536-542
[22]   Study on Mining Association Rules from Stock Time Series Data [J].
Xian, Tan .
SENSORS, MEASUREMENT AND INTELLIGENT MATERIALS, PTS 1-4, 2013, 303-306 :1119-1124
[23]   Particularities of data mining in medicine: lessons learned from patient medical time series data analysis [J].
Shadi Aljawarneh ;
Aurea Anguera ;
John William Atwood ;
Juan A. Lara ;
David Lizcano .
EURASIP Journal on Wireless Communications and Networking, 2019
[24]   Clustering stock price time series data to generate stock trading recommendations: An empirical study [J].
Nair, Binoy B. ;
Kumar, P. K. Saravana ;
Sakthivel, N. R. ;
Vipin, U. .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 70 :20-36
[25]   Time Series Data Mining: A Retail Application [J].
Hebert, Daniel ;
Anderson, Billie ;
Olinsky, Alan ;
Hardin, J. Michael .
INTERNATIONAL JOURNAL OF BUSINESS ANALYTICS, 2014, 1 (04) :51-68
[26]   Outliers Mining in Time Series Data Sets [J].
Zheng BinxiangDu Xiuhua Xi Yugeng Institute of Automation Shanghai Jiaotong UniversityShanghai PRChina .
JournalofSystemsEngineeringandElectronics, 2002, (01) :93-97
[27]   A data mining framework for time series estimation [J].
Hu, Xiao ;
Xu, Peng ;
Wu, Shaozhi ;
Asgari, Shadnaz ;
Bergsneider, Marvin .
JOURNAL OF BIOMEDICAL INFORMATICS, 2010, 43 (02) :190-199
[28]   Introducing time series chains: a new primitive for time series data mining [J].
Zhu, Yan ;
Imamura, Makoto ;
Nikovski, Daniel ;
Keogh, Eamonn .
KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (02) :1135-1161
[29]   Introducing time series chains: a new primitive for time series data mining [J].
Yan Zhu ;
Makoto Imamura ;
Daniel Nikovski ;
Eamonn Keogh .
Knowledge and Information Systems, 2019, 60 :1135-1161
[30]   Analytics over Multi-sensor Time Series Data - A Case-Study on Prediction of Mining Hazards [J].
Janusz, Andrzej ;
Slezak, Dominik .
INFORMATION TECHNOLOGY IN GEO-ENGINEERING, 2020, :815-830