Learned FBF: Learning-Based Functional Bloom Filter for Key-Value Storage

被引:11
作者
Byun, Hayoung [1 ]
Lim, Hyesook [2 ]
机构
[1] Myongji Univ, Dept Elect Engn, Yongin 17058, South Korea
[2] Ewha Womans Univ, Dept Elect & Elect Engn, Seoul 03760, South Korea
基金
新加坡国家研究基金会;
关键词
Data structures; Data models; Programming; Memory management; Indexes; Task analysis; Neural networks; Key-value storage; functional Bloom filter; deep learning; search failure;
D O I
10.1109/TC.2021.3112079
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As a challenging attempt to replace a traditional data structure with a learned model, this paper proposes a learned functional Bloom filter (L-FBF) for a key-value storage. The learned model in the proposed L-FBF learns the characteristics and the distribution of given data and classifies each input. It is shown through theoretical analysis that the L-FBF provides a lower search failure rate than a single FBF in the same memory size, while providing the same semantic guarantees. For model training, character-level neural networks are used with pretrained embeddings. In experiments, four types of different character-level neural networks are trained: a single gated recurrent unit (GRU), two GRUs, a single long short-term memory (LSTM), and a single one-dimensional convolutional neural network (1D-CNN). Experimental results prove the validity of theoretical results, and show that the L-FBF reduces the search failures by 82.8% to 83.9% when compared with a single FBF under the same amount of memory used.
引用
收藏
页码:1928 / 1938
页数:11
相关论文
共 27 条
  • [11] Vectored-Bloom Filter for IP Address Lookup: Algorithm and Hardware Architectures
    Byun, Hayoung
    Li, Qingling
    Lim, Hyesook
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (21):
  • [12] A New Bloom Filter Architecture for FIB Lookup in Named Data Networking
    Byun, Hayoung
    Lim, Hyesook
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (02):
  • [13] Chung J, 2014, CORR, P1
  • [14] Long short-term memory
    Hochreiter, S
    Schmidhuber, J
    [J]. NEURAL COMPUTATION, 1997, 9 (08) : 1735 - 1780
  • [15] Principal component analysis: a review and recent developments
    Jolliffe, Ian T.
    Cadima, Jorge
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2016, 374 (2065):
  • [16] Kim Yoon, 2014, P C EMP METH NAT LAN, P1746, DOI [DOI 10.48550/ARXIV.1408.5882, 10.3115/v1/D14-1181, DOI 10.3115/V1/D14-1181]
  • [17] The Case for Learned Index Structures
    Kraska, Tim
    Beutel, Alex
    Chi, Ed H.
    Dean, Jeffrey
    Polyzotis, Neoklis
    [J]. SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 489 - 504
  • [18] Mitzenmacher M, 2018, ADV NEUR IN, V31
  • [19] Multi-Granularity Locality-Sensitive Bloom Filter
    Qian, Jiangbo
    Zhu, Qiang
    Chen, Huahui
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2015, 64 (12) : 3500 - 3514
  • [20] When Bloom Filters Are No Longer Compact: Multi-Set Membership Lookup for Network Applications
    Qiao, Yan
    Chen, Shigang
    Mo, Zhen
    Yoon, Myungkeun
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2016, 24 (06) : 3326 - 3339