Mining and Forecasting of Big Time-series Data

被引:35
作者
Sakurai, Yasushi [1 ]
Matsubara, Yasuko [1 ]
Faloutsos, Christos [2 ]
机构
[1] Kumamoto Univ, Kumamoto, Japan
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
来源
SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2015年
基金
美国国家科学基金会;
关键词
Time-series; Tensors; Pattern discovery; Forecasting;
D O I
10.1145/2723372.2731081
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given a large collection of time series, such as web-click logs, electric medical records and motion capture sensors, how can we efficiently and effectively find typical patterns? How can we statistically summarize all the sequences, and achieve a meaningful segmentation? What are the major tools for forecasting and outlier detection? Time-series data analysis is becoming of increasingly high importance, thanks to the decreasing cost of hardware and the increasing on-line processing capability. The objective of this tutorial is to provide a concise and intuitive overview of the most important tools that can help us find patterns in large-scale time-series sequences. We review the state of the art in four related fields: (1) similarity search and pattern discovery. (2) linear modeling and summarization. (3) non-linear modeling and forecasting, and (4) the extension of time-series mining and tensor analysis. The emphasis of the tutorial is to provide the intuition behind these powerful tools, which is usually lost in the technical literature, as well as to introduce case studies that illustrate their practical use.
引用
收藏
页码:919 / 922
页数:4
相关论文
共 56 条
[41]  
Pan J.Y., 2004, PAKDD
[42]  
Papadimitriou S., 2005, Proceedings of the 31st international conference on Very large data bases, P697
[43]  
Rakthanmanon Thanawin, 2012, KDD, V2012, P262, DOI 10.1145/2339530.2339576
[44]  
Sakurai Y., 2005, Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, PODS '05, P326
[45]  
Sakurai Y., 2005, P ACM SIGMOD INT C M, P599, DOI [10.1145/1066157.1066226, DOI 10.1145/1066157.1066226]
[46]  
Sakurai Yasushi., 2011, Proceedings of the 2011 SIAM International Conference on Data Mining, P759
[47]  
Sun J., 2006, P 12 ACM SIGKDD INT, P374, DOI DOI 10.1145/1150402.1150445
[48]  
Tao Yufei., 2004, SIGMOD 04, P611
[49]   Pattern discovery in data streams under the time warping distance [J].
Toyoda, Machiko ;
Sakurai, Yasushi ;
Ishikawa, Yoshiharu .
VLDB JOURNAL, 2013, 22 (03) :295-318
[50]   Contagious Diseases in the United States from 1888 to the Present [J].
van Panhuis, Willem G. ;
Grefenstette, John ;
Jung, Su Yon ;
Chok, Nian Shong ;
Cross, Anne ;
Eng, Heather ;
Lee, Bruce Y. ;
Zadorozhny, Vladimir ;
Brown, Shawn ;
Cummings, Derek ;
Burke, Donald S. .
NEW ENGLAND JOURNAL OF MEDICINE, 2013, 369 (22) :2152-2158