Association Rule Mining in Multiple, Multidimensional Time Series Medical Data

被引:12
作者
Pradhan G.N. [1 ]
Prabhakaran B. [2 ]
机构
[1] Mayo Clinic, 13400 E Shea Blvd, Scottsdale, 85259, AZ
[2] University of Texas at Dallas, 800 W Campbell Rd, Richardson, 75080, TX
关键词
Association rules; Clustering; Electromyograms; Multi-attribute; Pattern mining;
D O I
10.1007/s41666-017-0001-x
中图分类号
学科分类号
摘要
Time series pattern mining (TSPM) finds correlations or dependencies in same series or in multiple time series. When numerous instances of multiple time series data are associated with different quantitative attributes, they form a multiple multidimensional framework. In this paper, we consider real-life time series data of muscular activities of human participants obtained from multiple electromyogram (EMG) sensors and discover patterns in these EMG time series data. Each EMG time series data is associated with quantitative attributes such as energy of the signal and onset time, which are required to be mined along with EMG time series patterns. We propose a two-stage approach for this purpose: in the first stage, our emphasis is on discovering frequent patterns in multiple time series by doing sequential mining across time slices. And in the next stage, we focus on the quantitative attributes of only those time series that are present in the patterns discovered in the first stage. Our evaluation with large sets of time series data from multiple EMG sensors demonstrate that our two-stage approach speeds up the process of finding association rules in such multidimensional environment as compared to other methods and scales up linearly in terms of number of time series involved. Our approach is generic in finding association rules in other medical sensor databases containing multiple time series associated with quantitative attributes, which can be used in extending research areas like rehabilitation programs or designing better prosthetic devices. © 2017, Springer International Publishing AG.
引用
收藏
页码:92 / 118
页数:26
相关论文
共 62 条
[11]  
Cano M., Santos M., de Avila A., Romani L., Traina A., Ribeiro M., Sart: A new association rule method for mining sequential patterns in time series of climate data, Computational Science and Its Applications – ICCSA 2012. Lecture Notes in Computer Science, 7335, pp. 743-757, (2012)
[12]  
Chan F.H.Y., Yang Y.S., Lam F.K., Zhang Y.T., Parker P.A., Fuzzy EMG classification for prosthesis control, IEEE Trans Rehabil Eng, 8, 3, pp. 305-311, (2000)
[13]  
Chaudhuri S., Dayal U., An overview of data warehousing and OLAP technology, SIGMOD Rec, 26, 1, pp. 65-74, (1997)
[14]  
Chen T.S., Hsu S.C., Mining frequent tree-like patterns in large datasets, Data Knowl Eng, 62, 1, pp. 65-83, (2007)
[15]  
Das G., Lin K.I., Mannila H., Renganathan G., Smyth P., Rule discovery from time series, Proceedings of KDD, pp. 16-22, (1998)
[16]  
Ester M., Kriegel H.-P., Jorg S., Xu X., A density-based algorithm for discovering clusters in large spatial databases with noise, Proceedings of the 2nd international conference on knowledge discovery and data mining. AAAI Press, pp 226–231, (1996)
[17]  
Gehrke J., Ganti V., Ramakrishnan R., Loh W.Y., Boat optimistic decision tree construction, Proceedings of ACM SIGMOD. New York, NY, USA, pp. 169-180, (1999)
[18]  
Gosain A., Bhugra M., A comprehensive survey of association rules on quantitative data in data mining, IEEE conference on information communication technologies (ICT), 2013, pp 1003–1008, (2013)
[19]  
Gray J., Bosworth A., Layman A., Pirahesh H., Data cube: a relational aggregation operator generalizing group-by, cross-tab, and sub-totals, Data Min Knowl Disc, 1, 1, pp. 29-53, (1997)
[20]  
Guha S., Rastogi R., Shim K., Cure: an efficient clustering algorithm for large databases, Proceedings of ACM SIGMOD, pp. 73-84, (1998)