Time-decaying bloom filters for data streams with skewed distributions

被引:26
作者
Cheng, K [1 ]
Xiang, LM [1 ]
Iwaihara, M [1 ]
Xu, HY [1 ]
Mohania, MM [1 ]
机构
[1] Kyushu Sangyo Univ, Fukuoka, Japan
来源
15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications, Proceedings | 2005年
关键词
D O I
10.1109/RIDE.2005.15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bloom Filters are space-efficient data structures for membership queries over sets. To enable queries for multiplicities of multi-sets, the bitmap in a Bloom Filter is replaced by an array of counters whose values increment on each occurrence. In a data stream model, however, data items arrive at varying rates and recent occurrences are often regarded as more significant than past ones. In most data stream applications, it is critical to handle this "time-sensitivity". Furthermore, data streams with skewed distributions are common in many emerging applications, e.g., traffic engineering and billing, intrusion detection, trading surveillance and outlier detection. For such applications, it is inefficient to allocate counters of uniform size to all buckets. In this paper, we present Time-decaying Bloom Filters (TBF), a Bloom Filter that maintains the frequency count for each item in a data stream, and the value of each counter decays with time. For data streams with highly skewed distributions, we proposed further optimization by allowing dynamically allocating free counters to the "large" items. We performed preliminary experiments to verify the optimization.
引用
收藏
页码:63 / 69
页数:7
相关论文
共 50 条
  • [31] Deadline-aware misinformation prevention in social networks with time-decaying influence
    School of Mechano-Electronic Engineering, Xidian University, Xi'an, China
    不详
    Expert Sys Appl,
  • [32] RETRACTED: Improved Decaying Bloom Filter for Duplicate Detection in Data Streams Over Sliding Windows (Retracted Article)
    Wang, Xiujun
    Shen, Hong
    ICCSIT 2010 - 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 4, 2010, : 348 - 353
  • [33] ANISOTROPIC COSMOLOGICAL MODEL WITH NEGATIVE CONSTANT DECELERATION PARAMETER AND TIME-DECAYING A TERM
    Verma, M. K.
    Zeyauddin, Mohd
    Ram, Shri
    ROMANIAN JOURNAL OF PHYSICS, 2011, 56 (3-4): : 616 - 626
  • [34] Accurate Quantile Estimation for Skewed Data Streams
    Lin, Zheng
    Liu, Jun
    Lin, Nan
    2017 IEEE 28TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2017,
  • [35] Two statistical methods for improving the analysis of large climatic data sets:: General skewed Kalman filters and distributions of distributions
    Naveau, P
    Vrac, M
    Genton, MG
    Chédin, A
    Diday, E
    GEOENV IV - GEOSTATISTICS FOR ENVIRONMENTAL APPLICATIONS: PROCEEDINGS, 2004, 13 : 1 - 14
  • [37] Near-optimal Approximate Membership Query over Time-decaying Windows
    Liu, Yang
    Chen, Wenji
    Guan, Yong
    2013 PROCEEDINGS IEEE INFOCOM, 2013, : 1447 - 1455
  • [38] Final state problem for nonlinear Schrodinger equations with time-decaying harmonic oscillators
    Kawamoto, Masaki
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2021, 503 (01)
  • [39] Time-decaying gillespie methods for epidemic and contagion processes on social temporal networks
    Li, Hui
    Luo, You
    Chen, Xin
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 46 - 47
  • [40] Deadline-aware misinformation prevention in social networks with time-decaying influence
    Yang, Lan
    Li, Zhiwu
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238