Learned FBF: Learning-Based Functional Bloom Filter for Key-Value Storage

被引:11
作者
Byun, Hayoung [1 ]
Lim, Hyesook [2 ]
机构
[1] Myongji Univ, Dept Elect Engn, Yongin 17058, South Korea
[2] Ewha Womans Univ, Dept Elect & Elect Engn, Seoul 03760, South Korea
基金
新加坡国家研究基金会;
关键词
Data structures; Data models; Programming; Memory management; Indexes; Task analysis; Neural networks; Key-value storage; functional Bloom filter; deep learning; search failure;
D O I
10.1109/TC.2021.3112079
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As a challenging attempt to replace a traditional data structure with a learned model, this paper proposes a learned functional Bloom filter (L-FBF) for a key-value storage. The learned model in the proposed L-FBF learns the characteristics and the distribution of given data and classifies each input. It is shown through theoretical analysis that the L-FBF provides a lower search failure rate than a single FBF in the same memory size, while providing the same semantic guarantees. For model training, character-level neural networks are used with pretrained embeddings. In experiments, four types of different character-level neural networks are trained: a single gated recurrent unit (GRU), two GRUs, a single long short-term memory (LSTM), and a single one-dimensional convolutional neural network (1D-CNN). Experimental results prove the validity of theoretical results, and show that the L-FBF reduces the search failures by 82.8% to 83.9% when compared with a single FBF under the same amount of memory used.
引用
收藏
页码:1928 / 1938
页数:11
相关论文
共 27 条
  • [1] Principal component analysis
    Abdi, Herve
    Williams, Lynne J.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04): : 433 - 459
  • [2] [Anonymous], GloVe: Global Vectors for Word Representation
  • [3] [Anonymous], WEB DIRECTORY
  • [4] [Anonymous], FREE ONLINE DATASET
  • [5] SPACE/TIME TRADE/OFFS IN HASH CODING WITH ALLOWABLE ERRORS
    BLOOM, BH
    [J]. COMMUNICATIONS OF THE ACM, 1970, 13 (07) : 422 - &
  • [6] Beyond bloom filters: From approximate membership checks to approximate state machines
    Bonomi, Flavio
    Mitzenmacher, Michael
    Panigrahy, Rina
    Singh, Sushil
    Varghese, George
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2006, 36 (04) : 315 - 326
  • [7] Broder A., 2004, Internet Mathematics, P485, DOI [10.1080/15427951.2004.10129096, DOI 10.1080/15427951.2004.10129096]
  • [8] Byun H., 2020, IEIE SUMMER C, P707
  • [9] Addition of a Secondary Functional Bloom Filter
    Byun, Hayoung
    Kim, Sohyun
    Yim, Changhoon
    Lim, Hyesook
    [J]. IEEE COMMUNICATIONS LETTERS, 2020, 24 (10) : 2123 - 2127
  • [10] Comparison on Search Failure between Hash Tables and a Functional Bloom Filter
    Byun, Hayoung
    Lim, Hyesook
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (15):