Correlation Mining between Time Series Stream and Event Stream

被引:6
|
作者
Minaei-Bidgoli, Behrouz [1 ]
Lajevardi, Seyed Behzad [1 ]
机构
[1] Iran Univ Sci & Technol, Dept Comp Engn, Tehran, Iran
来源
NCM 2008: 4TH INTERNATIONAL CONFERENCE ON NETWORKED COMPUTING AND ADVANCED INFORMATION MANAGEMENT, VOL 2, PROCEEDINGS | 2008年
关键词
Temporal Data mining; Time Series; Event Streams; Equation Discovery; Correlation; Aerology; Lagrange; Time Point; Temporal Data;
D O I
10.1109/NCM.2008.223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There are huge amounts of temporal data in many real-world applications. National climate databases provide examples of this type of temporal data. Two categories of data are held in these databases: the first are time series events such as relative humidity, amount of cloud, visibility, soil temperature, bulb, wind direction, and so forth which are recorded in hourly-bin time intervals, and the second are event steam data like heavy rains, flooding, hurricane, ice storms and so forth. In this work we propose a general framework to explore the correlation between time series streams and events stream using equation discovery method. 4 well designed time series system measures a bunch of parameters while many event may happen independently from time series interval, it means that an event has its own time point. We try to find the effective parameters on events occurrences. Firstly, we find the closest time series record for any events; therefore, we have gathered different parameters value when an event is occurring. Using LA GRANGE's equation discovery method we find the equation between the events and effective parameters. LAGRANGE's equation discovery is one of the well-known equation discovery methods which analyze a great domain of equations. This equation and time series model can predict future events efficiently. Our data set is Tehran daily aerological data set which is digital data set archived at the Iran meteorological organization. This data set has statistics for geopotential height, temperature, specific humidity, Zonal wind, and meridional wind Data has collected from Tehran "Mehrabad" station. The earliest data is from 1961; the latest, from 2005 (44 years). This data set includes 17 types of events. Time series models can predict next time series parameters value and by using these equations the closest event can be predicted. The LAGRANGIAN method is able to find some suite equation with R>0.98 and s<0.008, which R is the correlation coefficient and S is the least-square. This work is the first estimate in the area of correlation mining for a huge data set of aerology and can be extended in many different data sets in any other environments.
引用
收藏
页码:333 / 338
页数:6
相关论文
共 50 条
  • [1] Data and Event Stream Mining
    Naveen, Kavya
    Sathyanarayana, M. V.
    Naveen, N. C.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (04): : 140 - 143
  • [2] Anomaly Detection for Time Series Data Stream
    Wang, Qifan
    Yan, Bo
    Su, Hongyi
    Zheng, Hong
    2021 IEEE 6TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2021), 2021, : 118 - 122
  • [3] The Application of Stream Data Time-Series Pattern Reliance Mining in Stock Market Analysis
    Yao, Jiayi
    Kong, Shuhui
    IEEE/SOLI'2008: PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATIONS AND LOGISTICS, AND INFORMATICS, VOLS 1 AND 2, 2008, : 159 - 163
  • [4] An improved time series model for monthly stream flows
    Ozcelik, Ceyhun
    Baykan, N. Orhan
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2009, 23 (05) : 587 - 601
  • [5] An improved time series model for monthly stream flows
    Ceyhun Özçelik
    N. Orhan Baykan
    Stochastic Environmental Research and Risk Assessment, 2009, 23 : 587 - 601
  • [6] Incremental and Adaptive Feature Exploration over Time Series Stream
    Zuo, Jingwei
    Zeitouni, Karine
    Taher, Yehia
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 593 - 602
  • [7] Self CNN-based time series stream forecasting
    Zeng, Zhiping
    Xiao, Haidong
    Zhang, Xinpeng
    ELECTRONICS LETTERS, 2016, 52 (22) : 1857 - +
  • [8] DUAL-STREAM CNN FOR STRUCTURED TIME SERIES CLASSIFICATION
    Weng, Shuchen
    Li, Wenbo
    Zhang, Yi
    Lyu, Siwei
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3187 - 3191
  • [9] TS-stream: clustering time series on data streams
    Pereira, Cassio M. M.
    de Mello, Rodrigo F.
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2014, 42 (03) : 531 - 566
  • [10] TS-stream: clustering time series on data streams
    Cássio M. M. Pereira
    Rodrigo F. de Mello
    Journal of Intelligent Information Systems, 2014, 42 : 531 - 566