A Deviant Load Shedding System for Data Stream Mining

被引:2
作者
Desai, Darshana [1 ]
Joshi, Abhijit [1 ]
机构
[1] Dwarkadas J Sanghvi Coll Engn, Bombay, Maharashtra, India
来源
INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING TECHNOLOGIES AND APPLICATIONS (ICACTA) | 2015年 / 45卷
关键词
Data stream; Frequent pattern matching; Data overload; Load shedding;
D O I
10.1016/j.procs.2015.03.103
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Load shedding is imperative for data stream processing systems in numerous functions as data streams are susceptible to sudden spikes in volume. The proposed system is an attempt to seek and resolve four major problems associated with data stream, which include load shedding and anti-shedding time, number of transactions pruned and selecting predicate; using efficient mining system. The frequent pattern discovered in data stream used in the model exploits the synergy between scheduling and load shedding. This paper also proposes various load shedding strategies which reduce and lighten the workload of the system ensuring an acceptable level of mining accuracy using various parameters like transaction, priority and attributes of data mining. A majority chunk of workload in mining algorithm lies in the innumerable item sets, which are counted and enumerated. The approach is based on the frequent pattern matching principle of stream mining which involves reducing the workload to maintain smaller item sets. (C) 2015 The Authors. Published by Elsevier B.V.
引用
收藏
页码:118 / 126
页数:9
相关论文
共 17 条
  • [1] Aggarwal Charu C., 2001, DATA STREAM MODELS A, P10598
  • [2] [Anonymous], ACM SIGMOD INT C MAN
  • [3] [Anonymous], 2003, CS200308
  • [4] Load shedding for aggregation queries over data streams
    Babcock, B
    Datar, M
    Motwani, R
    [J]. 20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 350 - 361
  • [5] Chang Joong Hyuk, 2003, P 9 ACM SIGKDD INT C, P487, DOI DOI 10.1145/956750.956807
  • [6] Charikar M, 2002, LECT NOTES COMPUT SC, V2380, P693
  • [7] Giannella Chris., 2003, Mining Frequent Patterns in Data Streams at Multiple Time Granularities
  • [8] Clustering data streams: Theory and practice
    Guha, S
    Meyerson, A
    Mishra, N
    Motwani, R
    O'Callaghan, L
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (03) : 515 - 528
  • [9] Huang Chongzheng, 2007, IEEE FUTURE GENERATI, V1, P378
  • [10] Hulten G., 2001, KDD-2001. Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P97, DOI 10.1145/502512.502529