A new approximate query engine based on intelligent capture and fast transformations of granulated data summaries

被引:0
作者
Dominik Ślęzak
Rick Glick
Paweł Betliński
Piotr Synak
机构
[1] University of Warsaw,Institute of Informatics
[2] Independent Consultant,undefined
[3] Independent Consultant,undefined
来源
Journal of Intelligent Information Systems | 2018年 / 50卷
关键词
Big data processing; Approximate query processing; Data summarization; Granular computing; Machine intelligence;
D O I
暂无
中图分类号
学科分类号
摘要
We outline the processes of intelligent creation and utilization of granulated data summaries in the engine aimed at fast approximate execution of analytical SQL statements. We discuss how to use the introduced engine for the purposes of ad-hoc data exploration over large and quickly increasing data collected in a heterogeneous or distributed fashion. We focus on mechanisms that transform input data summaries into result sets representing query outcomes. We also illustrate how our computational principles can be put together with other paradigms of scaling and harnessing data analytics.
引用
收藏
页码:385 / 414
页数:29
相关论文
共 71 条
  • [1] Abadi D(2013)The design and implementation of modern column-oriented database systems Foundations and Trends in Databases 5 197-280
  • [2] Boncz P(2003)The knowledge grid Communications of the ACM 46 89-93
  • [3] Harizopoulos S(2012)Synopses for massive data: samples, histograms, wavelets, sketches Foundations and Trends in Databases 4 1-294
  • [4] Idreos S(2013)Exploiting compression and approximation paradigms for effective and efficient online analytical processing over sensor network readings in data grid environments Concurrency and Computation: Practice and Experience 25 2016-2035
  • [5] Madden S(2015)Linearized and single-pass belief propagation Proceedings of the VLDB Endowment 8 581-592
  • [6] Cannataro M(2002)Fast incremental maintenance of approximate histograms ACM Transactions on Database Systems 27 261-298
  • [7] Talia D(2007)Learning multiple layers of representation Trends in Cognitive Sciences 11 428-434
  • [8] Cormode G(2013)Granular computing based on Gaussian cloud transformation Fundamenta Informaticae 127 385-398
  • [9] Garofalakis MN(2015)A handbook for building an approximate query engine IEEE Data Engineering Bulletin 38 3-29
  • [10] Haas PJ(2008)Delay aware querying with seaweed The VLDB Journal 17 315-331