Big high-dimension data cube designs for hybrid memory systems

被引:2
|
作者
Silva, Rodrigo Rocha [1 ]
Hirata, Celso Massaki [2 ]
Lima, Joubert de Castro [3 ]
机构
[1] Univ Coimbra, Fac Tecnol Sao Paulo, Rua Carlos Barattino,908 Vila Nova Mogilar, BR-08773600 Mogi Das Cruzes, SP, Brazil
[2] Inst Tecnol Aeronaut, Sao Jose Dos Campos, SP, Brazil
[3] Univ Fed Ouro Preto, Ouro Preto, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Multidimensional database; Multidimensional query; Big Data; Data cube; Holistic measure; High dimension; COMPUTATION;
D O I
10.1007/s10115-020-01505-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In Big Data cubes with hundreds of dimensions and billions of tuples, the indexing and query operations are a challenge and the reason is the time-space exponential complexity when a full cube is computed. Therefore, solutions based on RAM may not be practical and the solutions based on hybrid memory (RAM and disk) become viable alternatives. In this paper, we propose a hybrid approach, named bCubing, to index and query high-dimension data cubes with high number of tuples in a single machine and using RAM and disk memory systems. We evaluated bCubing in terms of runtime and memory consumption, comparing it with the Frag-Cubing, HIC and H-Frag approaches. bCubing showed to be faster and used less RAM than Frag-Cubing, HIC and H-Frag. bCubing indexed and allowed to query a data cube with 1.2 billion tuples and 60 dimensions, consuming only 84 GB of RAM, which means 35% less memory than HIC. The complex holistic measures mode and median were computed in multidimensional queries, and bCubing was, on average, 50% faster than HIC.
引用
收藏
页码:4717 / 4746
页数:30
相关论文
共 30 条
  • [1] Big high-dimension data cube designs for hybrid memory systems
    Rodrigo Rocha Silva
    Celso Massaki Hirata
    Joubert de Castro Lima
    Knowledge and Information Systems, 2020, 62 : 4717 - 4746
  • [2] A Hybrid Approach To Processing Big Data Graphs on Memory-Restricted Systems
    Harshvardhan
    West, Brandon
    Fidel, Adam
    Amato, Nancy M.
    Rauchwerger, Lawrence
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 799 - 808
  • [3] Clustering algorithm based on optimal intervals division for high-dimension data streams
    Li, Yinzhao
    Ren, Jiadong
    Hu, Changzheng
    Xu, Lina
    Ren, Jiadong
    ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 783 - +
  • [4] Hybrid Smart Systems for Big Data Analysis
    Kuftinova N.G.
    Ostroukh A.V.
    Karelina M.Y.
    Matyukhina E.N.
    Akhmetzhanova E.U.
    Russian Engineering Research, 2021, 41 (6) : 536 - 538
  • [5] An In-Memory Data-Cube Aware Distributed Data Discovery Across Clouds for Remote Sensing Big Data
    Song, Jie
    Ma, Yan
    Zhang, Zhixin
    Liu, Peng
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 4529 - 4548
  • [6] Research of Configurable Hybrid Memory Architecture for Big Data Processing
    Zhou, Hongwei
    Deng, Rangyu
    Feng, Quanyou
    Ni, Xiaoqiang
    Dou, Qiang
    COMPUTER ENGINEERING AND TECHNOLOGY, NCCET 2017, 2018, 600 : 116 - 132
  • [7] A hybrid memory built by SSD and DRAM to support in-memory Big Data analytics
    Zhiguang Chen
    Yutong Lu
    Nong Xiao
    Fang Liu
    Knowledge and Information Systems, 2014, 41 : 335 - 354
  • [8] A hybrid memory built by SSD and DRAM to support in-memory Big Data analytics
    Chen, Zhiguang
    Lu, Yutong
    Xiao, Nong
    Liu, Fang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 335 - 354
  • [9] A new adaptive weighted imbalanced data classifier via improved support vector machines with high-dimension nature
    Qi, Kai
    Yang, Hu
    Hu, Qingyu
    Yang, Dongjun
    KNOWLEDGE-BASED SYSTEMS, 2019, 185
  • [10] Processing Big Data Graphs on Memory-Restricted Systems
    Harshvardhan
    Amato, Nancy M.
    Rauchwerger, Lawrence
    PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14), 2014, : 517 - 518