LBFM: Multi-dimensional Membership Index for Block-level Data Skipping

被引:1
|
作者
Wang, Yong [1 ]
Yun, Xiaochun [2 ]
Wang, Xi [1 ]
Wang, Shupeng [1 ]
Wu, Yongshang [3 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] CNCERT CC, Beijing, Peoples R China
[3] Nanjing Univ, Sch Software, Nanjing, Jiangsu, Peoples R China
来源
2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017) | 2017年
关键词
data skipping; membership index; bloom filter; bitmap;
D O I
10.1109/ISPA/IUCC.2017.00056
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data skipping has been a promising technique to reduce data access in query engines. By maintaining metadata for each block of tuples, a query may skip a block if the metadata indicates that the block does not contain relevant data. Obviously, the key factor is how to build effective metadata by extracting representative features of blocks. In this paper, we propose a multi-dimensional index, Layered Bloom Filter Matrix (LBFM), which adopts a recursively layered framework, and represents the matrix as an ordered hierarchy of hashmap and bitmap to compress space consumption instead of space-consuming bit matrix. Additionally, LBFM supports dimension combination cutting, and optimal indexing strategy could be generated according to it, thus the space efficiency could be further improved. We demonstrate time complexity of LBFM, and theoretically prove that LBFM has lower space consumption than Bloom Filter Matrix algorithm. We proto-typed our index technique on Spark SQL. Our experiments on TPC-H and a real-world workload show that LBFM gains significant improvement in aspect of query response time over traditional methods.
引用
收藏
页码:343 / 351
页数:9
相关论文
共 50 条
  • [21] Multi-dimensional aggregation for temporal data
    Bohen, Michael
    Gamper, Johann
    Jensen, Christian S.
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 257 - 275
  • [22] MULTI-DIMENSIONAL INVERSION OF SEISMIC DATA
    FOSTER, DJ
    MOSHER, CC
    INVERSE PROBLEMS, 1988, 4 (01) : 71 - 85
  • [23] Visualization and level-of-detail control for multi-dimensional bioactive chemical data
    Yamazawa, Maiko
    Itoh, Takayuki
    Yamashita, Fumiyoshi
    PROCEEDINGS OF THE 12TH INTERNATIONAL INFORMATION VISUALISATION, 2008, : 11 - +
  • [24] AN EFFICIENT SNAPSHOT INDEXING METHOD FOR BLOCK-LEVEL BACKUP DATA IN REPLICATION SYSTEM
    Wu, Guangjun
    Fang, Binxing
    Yu, Xiangzhan
    Yun, Xiaochun
    Wang, Shupeng
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2011, 7 (02): : 915 - 925
  • [25] Updatable block-level deduplication of encrypted data with efficient auditing in cloud storage
    Dang Qianlong
    Xie Ying
    Li Donghao
    Hu Gongcheng
    The Journal of China Universities of Posts and Telecommunications, 2019, 26 (03) : 56 - 72
  • [26] Block-Level Message-Locked Encryption with Polynomial Commitment for IoT Data
    Huang, Ke
    Zhang, Xiao-Song
    Wang, Xiao-Fen
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2017, 33 (04) : 891 - 905
  • [27] SOME EXAMPLES OF MULTI-DIMENSIONAL INCOMPLETE BLOCK DESIGNS
    CAUSEY, BD
    ANNALS OF MATHEMATICAL STATISTICS, 1968, 39 (05): : 1577 - &
  • [28] Symbol-Level Precoding Made Practical for Multi-Level Modulations via Block-Level Rescaling
    Li, Ang
    Liu, Fan
    Liao, Xuewen
    Shen, Yuanjun
    Masouros, Christos
    SPAWC 2021: 2021 IEEE 22ND INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (IEEE SPAWC 2021), 2020, : 71 - 75
  • [29] Updatable block-level deduplication of encrypted data with efficient auditing in cloud storage
    Qianlong D.
    Ying X.
    Donghao L.
    Gongcheng H.
    Journal of China Universities of Posts and Telecommunications, 2019, 26 (03): : 56 - 72
  • [30] Multi-dimensional multi-level optical pickup head
    Yuan, Gaoqiang
    Tan, Wei Lian
    Ng, Lung Tat
    Chuah, Chong Wei
    Chong, Chun Yang
    Lim, Kian Guan
    Chong, Yeng Leong
    Lim, Yang Beng
    Ting, Lee Hou
    Shi, Luping
    Chong, Tow Chong
    JAPANESE JOURNAL OF APPLIED PHYSICS, 2008, 47 (07) : 5933 - 5935