Applying data mining techniques to medical time series: an empirical case study in electroencephalography and stabilometry

被引:16
作者
Anguera, A. [1 ]
Barreiro, J. M. [1 ]
Lara, J. A. [2 ]
Lizcano, D. [2 ]
机构
[1] Tech Univ Madrid, Sch Comp Sci, Campus Montegancedo S-N, Madrid 28660, Spain
[2] Open Univ Madrid, UDIMA, Fac Ensenanzas Tecn, Ctra Coruna Km 38-500,Via Serv 15, Madrid 28400, Spain
来源
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL | 2016年 / 14卷
关键词
Medical Data Mining; Electronic Health Record; Time Series; Knowledge Discovery; SYSTEM; DIAGNOSIS; DISEASE; EVENTS; MODELS; RISK;
D O I
10.1016/j.csbj.2016.05.002
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
One of the major challenges in the medical domain today is how to exploit the huge amount of data that this field generates. To do this, approaches are required that are capable of discovering knowledge that is useful for decision making in the medical field. Time series are data types that are common in the medical domain and require specialized analysis techniques and tools, especially if the information of interest to specialists is concentrated within particular time series regions, known as events. This research followed the steps specified by the so-called knowledge discovery in databases (KDD) process to discover knowledge from medical time series derived from stabilometric (396 series) and ectroencephalographic (200) patient electronic health records (EHR). The view offered in the paper is based on the experience gathered as part of the VIIP project.(1) Knowledge discovery in medical time series has a number of difficulties and implications that are highlighted by illustrating the application of several techniques that cover the entire KDD process through two case studies. This paper illustrates the application of different knowledge discovery techniques for the purposes of classification within the above domains. The accuracy of this application for the two classes considered in each case is 99.86% and 98.11% for epilepsy diagnosis in the electroencephalography (EEG) domain and 99.4% and 99.1% for early-age sports talent classification in the stabilometry domain. The KDD techniques achieve better results than other traditional neural network-based classification techniques. (C) 2016 Anguera et al. Published by Elsevier B.V. on behalf of the Research Network of Computational and Structural Biotechnology.
引用
收藏
页码:185 / 199
页数:15
相关论文
共 50 条
  • [21] Study on Mining Association Rules from Stock Time Series Data
    Xian, Tan
    SENSORS, MEASUREMENT AND INTELLIGENT MATERIALS, PTS 1-4, 2013, 303-306 : 1119 - 1124
  • [22] Particularities of data mining in medicine: lessons learned from patient medical time series data analysis
    Shadi Aljawarneh
    Aurea Anguera
    John William Atwood
    Juan A. Lara
    David Lizcano
    EURASIP Journal on Wireless Communications and Networking, 2019
  • [23] Fuzzy data mining for time-series data
    Chen, Chun-Hao
    Hong, Tzung-Pei
    Tseng, Vincent S.
    APPLIED SOFT COMPUTING, 2012, 12 (01) : 536 - 542
  • [24] Clustering stock price time series data to generate stock trading recommendations: An empirical study
    Nair, Binoy B.
    Kumar, P. K. Saravana
    Sakthivel, N. R.
    Vipin, U.
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 70 : 20 - 36
  • [25] A data mining framework for time series estimation
    Hu, Xiao
    Xu, Peng
    Wu, Shaozhi
    Asgari, Shadnaz
    Bergsneider, Marvin
    JOURNAL OF BIOMEDICAL INFORMATICS, 2010, 43 (02) : 190 - 199
  • [26] Outliers Mining in Time Series Data Sets
    Zheng Binxiang
    Journal of Systems Engineering and Electronics, 2002, (01) : 93 - 97
  • [27] Time Series Data Mining: A Retail Application
    Hebert, Daniel
    Anderson, Billie
    Olinsky, Alan
    Hardin, J. Michael
    INTERNATIONAL JOURNAL OF BUSINESS ANALYTICS, 2014, 1 (04) : 51 - 68
  • [28] Introducing time series chains: a new primitive for time series data mining
    Yan Zhu
    Makoto Imamura
    Daniel Nikovski
    Eamonn Keogh
    Knowledge and Information Systems, 2019, 60 : 1135 - 1161
  • [29] Analytics over Multi-sensor Time Series Data - A Case-Study on Prediction of Mining Hazards
    Janusz, Andrzej
    Slezak, Dominik
    INFORMATION TECHNOLOGY IN GEO-ENGINEERING, 2020, : 815 - 830
  • [30] Introducing time series chains: a new primitive for time series data mining
    Zhu, Yan
    Imamura, Makoto
    Nikovski, Daniel
    Keogh, Eamonn
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (02) : 1135 - 1161