Hierarchical Sampling from Sketches: Estimating Functions over Data Streams

被引:0
|
作者
Sumit Ganguly
Lakshminath Bhuvanagiri
机构
[1] Indian Institute of Technology,
[2] Google Inc.,undefined
来源
Algorithmica | 2009年 / 53卷
关键词
Data streams; Frequency moments; Entropy; HSS;
D O I
暂无
中图分类号
学科分类号
摘要
We present a randomized procedure named Hierarchical Sampling from Sketches (Hss) that can be used for estimating a class of functions over the frequency vector f of update streams of the form \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$\varPsi(\mathcal {S})=\sum_{i=1}^{n}\psi(\vert {f_{i}}\vert )$\end{document} . We illustrate this by applying the Hss technique to design nearly space-optimal algorithms for estimating the pth moment of the frequency vector, for real p≥2 and for estimating the entropy of a data stream.
引用
收藏
页码:549 / 582
页数:33
相关论文
共 50 条
  • [41] SPIRE: Efficient Data Inference and Compression over RFID Streams
    Nie, Yanming
    Cocci, Richard
    Cao, Zhao
    Diao, Yanlei
    Shenoy, Prashant
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (01) : 141 - 155
  • [42] Time-slide window join over data streams
    Kim, Hyeon Gyu
    Park, Yoo Hyun
    Cho, Yang Hyun
    Kim, Myoung Ho
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2014, 43 (02) : 323 - 347
  • [43] Adaptive scheduling for shared window joins over data streams
    Jin C.
    Zhou A.
    Yu J.X.
    Huang J.Z.
    Cao F.
    Frontiers of Computer Science in China, 2007, 1 (4): : 468 - 477
  • [44] Interactive mining of high utility patterns over data streams
    Ahmed, Chowdhury Farhan
    Tanbeer, Syed Khairuzzaman
    Jeong, Byeong-Soo
    Choi, Ho-Jin
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (15) : 11979 - 11991
  • [45] Load shedding for window queries over continuous data streams
    Kim, Kwang Rak
    Kim, Hyeon Gyu
    Lecture Notes in Electrical Engineering, 2015, 373 : 159 - 164
  • [46] Tracking clusters in evolving data streams over sliding windows
    Aoying Zhou
    Feng Cao
    Weining Qian
    Cheqing Jin
    Knowledge and Information Systems, 2008, 15 : 181 - 214
  • [47] On-line generation association rules over data streams
    Shin, Se Jung
    Lee, Won Suk
    INFORMATION AND SOFTWARE TECHNOLOGY, 2008, 50 (06) : 569 - 578
  • [48] Multi-relational pattern mining over data streams
    Andreia Silva
    Cláudia Antunes
    Data Mining and Knowledge Discovery, 2015, 29 : 1783 - 1814
  • [49] Time-slide window join over data streams
    Hyeon Gyu Kim
    Yoo Hyun Park
    Yang Hyun Cho
    Myoung Ho Kim
    Journal of Intelligent Information Systems, 2014, 43 : 323 - 347
  • [50] Multi-relational pattern mining over data streams
    Silva, Andreia
    Antunes, Claudia
    DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 29 (06) : 1783 - 1814