Time-decaying bloom filters for data streams with skewed distributions

被引:26
|
作者
Cheng, K [1 ]
Xiang, LM [1 ]
Iwaihara, M [1 ]
Xu, HY [1 ]
Mohania, MM [1 ]
机构
[1] Kyushu Sangyo Univ, Fukuoka, Japan
来源
15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications, Proceedings | 2005年
关键词
D O I
10.1109/RIDE.2005.15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bloom Filters are space-efficient data structures for membership queries over sets. To enable queries for multiplicities of multi-sets, the bitmap in a Bloom Filter is replaced by an array of counters whose values increment on each occurrence. In a data stream model, however, data items arrive at varying rates and recent occurrences are often regarded as more significant than past ones. In most data stream applications, it is critical to handle this "time-sensitivity". Furthermore, data streams with skewed distributions are common in many emerging applications, e.g., traffic engineering and billing, intrusion detection, trading surveillance and outlier detection. For such applications, it is inefficient to allocate counters of uniform size to all buckets. In this paper, we present Time-decaying Bloom Filters (TBF), a Bloom Filter that maintains the frequency count for each item in a data stream, and the value of each counter decays with time. For data streams with highly skewed distributions, we proposed further optimization by allowing dynamically allocating free counters to the "large" items. We performed preliminary experiments to verify the optimization.
引用
收藏
页码:63 / 69
页数:7
相关论文
共 50 条
  • [21] Tracking Influential Nodes in Time-Decaying Dynamic Interaction Networks
    Zhao, Junzhou
    Shang, Shuo
    Wang, Pinghui
    Lui, John C. S.
    Zhang, Xiangliang
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1106 - 1117
  • [22] EXISTENCE AND NONEXISTENCE OF WAVE OPERATORS FOR TIME-DECAYING HARMONIC OSCILLATORS
    Ishida, Atsuhide
    Kawamoto, Masaki
    REPORTS ON MATHEMATICAL PHYSICS, 2020, 85 (03) : 335 - 350
  • [23] A generalized voter model with time-decaying memory on a multilayer network
    Zhong, Li-Xin
    Xu, Wen-Juan
    Chen, Rong-Da
    Zhong, Chen-Yang
    Qiu, Tian
    Shi, Yong-Dong
    Wang, Li-Liang
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2016, 458 : 95 - 105
  • [24] Summarizing and Mining Skewed Data Streams
    Cormode, Graham
    Muthukrishnan, S.
    PROCEEDINGS OF THE FIFTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2005, : 44 - 55
  • [25] Theorems for masonry solids with brittle time-decaying tensile limit strength
    Baratta, Alessandro
    Corbi, Ileana
    Corbi, Ottavia
    ACTA MECHANICA, 2017, 228 (03) : 837 - 849
  • [26] Theorems for masonry solids with brittle time-decaying tensile limit strength
    Alessandro Baratta
    Ileana Corbi
    Ottavia Corbi
    Acta Mechanica, 2017, 228 : 837 - 849
  • [27] SKEWED HOLDING TIME DISTRIBUTIONS
    VANDEEMTER, JJ
    CHEMICAL ENGINEERING SCIENCE, 1961, 13 (03) : 190 - 191
  • [28] Time-decaying magnetoelectric effects in multiferroic fibrous composites with a viscous interface
    Pan, E.
    Wang, X.
    Albrecht, J. D.
    JOURNAL OF APPLIED PHYSICS, 2009, 105 (08)
  • [29] Using Bloom Filters for Mining Top-k Frequent Itemsets in Data Streams
    Kim, Younghee
    Cho, Kyungsoo
    Yoon, Jaeyeol
    Kim, Ieejoon
    Kim, Ungmo
    SECURE AND TRUST COMPUTING, DATA MANAGEMENT, AND APPLICATIONS, 2011, 186 : 209 - 216
  • [30] Asymptotic behavior for nonlinear Schrodinger equations with critical time-decaying harmonic potential
    Kawamoto, Masaki
    JOURNAL OF DIFFERENTIAL EQUATIONS, 2021, 303 : 253 - 267