Correlating burst events on streaming stock market data

被引:0
作者
Michail Vlachos
Kun-Lung Wu
Shyh-Kwei Chen
Philip S. Yu
机构
[1] IBM T. J. Watson Research Center,
来源
Data Mining and Knowledge Discovery | 2008年 / 16卷
关键词
Time-series; Indexing; Burst detection; Correlation;
D O I
暂无
中图分类号
学科分类号
摘要
We address the problem of monitoring and identification of correlated burst patterns in multi-stream time series databases. We follow a two-step methodology: first we identify the burst sections in our data and subsequently we store them for easy retrieval in an efficient in-memory index. The burst detection scheme imposes a variable threshold on the examined data and takes advantage of the skewed distribution that is typically encountered in many applications. The detected bursts are compacted into burst intervals and stored in an interval index. The index facilitates the identification of correlated bursts by performing very efficient overlap operations on the stored burst regions. We present the merits of the proposed indexing scheme through a thorough analysis of its complexity. We also manifest the real-time response of our burst indexing technique, and demonstrate the usefulness of the approach for correlating surprising volume trading events using historical stock data of the NY stock exchange. While the focus of this work is on financial data, the proposed methods and data-structures can find applications for anomaly or novelty detection in telecommunication, network traffic and medical data.
引用
收藏
页码:109 / 133
页数:24
相关论文
共 30 条
  • [1] Friss-Cristensen E(1991)Length of solar cycle - an indicator of solar-activity closely related with climate Science 254 698-700
  • [2] Lassen K(1996)Selection predicate indexing for active databases using interval skip lists Inform Syst 21 269-298
  • [3] Hanson E(1999)Exploring expression data: identification and analysis of coexpressed genes Genome Res 9 11-59
  • [4] Johnson T(2004)Using multiple windows to track concept drift Intel Data Analy J 8 29-706
  • [5] Heyer LJ(1996)Long-term stochastic dependence in financial prices: evidence from the German Stock Market Appl Econ Lett 3 701-99
  • [6] Kruglyak S(1999)Higher-order spectral analysis of burst patterns in EEG IEEE Trans Biomed Eng 46 92-110
  • [7] Yooseph S(1999)Automated outbreak detection: a quantitative retrospective analysis Epidemiol Infect 122 103-83
  • [8] Lazarescu M(2004)A Bayesian paradigm for designing intrusion detection systems Comput Stat Data Anal (special issue on Computer Security) 45 69-649
  • [9] Venkatesh S(2003)Multifractal geometry in stock market time series Physica A 322 629-75
  • [10] Bui HH(2003)Automated, laboratory-based system using the Internet for disease outbreak detection, the Netherlands Emerg Infect Dis 9 9-undefined