DAPs: Mining using change-point detection of epileptic activity time series data

被引:0
作者
Kim S.-H. [1 ]
Li L. [2 ]
Faloutsos C. [3 ]
Yang H.-J. [4 ]
Lee S.-W. [1 ]
机构
[1] Department of Brain and Cognitive Engineering, Korea University, Seoul
[2] Computer Science Division, University of California, Berkeley, 94720, CA
[3] School of Computer Science, Carnegie Mellon University, Pittsburgh, 15213, PA
[4] Department of Computer Science, Chonnam National University, Gwangju
来源
| 1600年 / Institute of Information Science卷 / 33期
基金
新加坡国家研究基金会;
关键词
Chaos population model; Compression; Electroencephalography; Gray-box model; Minimum description length; Parameter estimation; Segmentation;
D O I
10.1688/JISE.2017.33.2.14
中图分类号
学科分类号
摘要
The goal of this study is to mine meaningful patterns effectively and efficiently via change-point detection of the time series data, with the assistance of domain knowledge and observed data. With those patterns, our method can do segmentation and compression. We developed a novel gray-box approach for mining such data: Domain Assisted Parameter semi-free wave mining (DAPs). DAPs is intended for mining time series with rich domain-specific knowledge based on a chaos model. Specifically, it automatically detects a change-point of time sequences, respecting the minimal description length principle. And the time sequence is segmented based on the detected change-point, and each segment is fitted with a consistent model. The experimental results using both synthetic and real EEG data indicated that the developed method offers a significant improvement in segmentation and compression via pattern detection over other existing methods. DAPs reduced the number of bits of the observed data by detecting the changes in the patterns contained therein and brought about a higher average compression ratio, 1.6% more than WT (level 5). DAPs provides the advantages of (a) being capable of automatically detecting meaningful patterns, (b) being parameter semi-free, and (c) resulting in a huge reduction in data storage. These findings provide possible applications in the use of various medical devices that produce vast amounts of physiological data that should be monitored. © 2017 Institute of Information Science. All rights reserved.
引用
收藏
页码:517 / 536
页数:19
相关论文
共 37 条
  • [1] Ouyang G., Dang C., Richards D.A., Li X., Ordinal pattern based similarity analysis for EEG recordings, Clinical Neurophysiology, 121, pp. 694-703, (2010)
  • [2] Verghese G., Getting to the gray box: Some challenges for model reduction, Proceedings of The American Control Conference, pp. 5-6, (2009)
  • [3] Hauth J., Grey-box modelling for nonlinear systems, Dissertation of Technische University of Kaiserslautern, (2008)
  • [4] Ljung L., Perspectives on system identification, Annual Reviews in Control, 34, pp. 1-12, (2010)
  • [5] Garces A., Orosco L., Diez P., Laciar E., Automatic detection of epileptic seizures in long-term EEG records, Computers in Biology and Medicine, 57, pp. 66-73, (2014)
  • [6] Aurlien H., Gjerde I.O., Eide G.E., Brogger J.C., Gilhus N.E., Characteristics of generalised epileptiform activity, Clinical Neurophysiology, 120, pp. 3-10, (2009)
  • [7] Sierra-Marcos A., Scheuer M.L., Rossetti A.O., Seizure detection with automated EEG analysis: A validation study focusing on periodic patterns, Clinical Neurophysiology, 126, pp. 456-462, (2015)
  • [8] Kim S.H., Faloutsos C., Yang H.J., EEG-MINE: Mining and understanding epilepsy data, Trends and Applications in Knowledge Discovery and Data Mining Lecture Notes in Computer Science, 7867, pp. 155-167, (2013)
  • [9] Lee T.C.M., An introduction to coding theory and the two-part minimum description length principle, International Statistical Review, 69, pp. 169-183, (2001)
  • [10] Ogo K., Nakagawa M., Chaos and fractal properties in EEG data, Electronics and Communications in Japan, 3, pp. 27-36, (2007)