A new approximate query engine based on intelligent capture and fast transformations of granulated data summaries

被引:0
作者
Dominik Ślęzak
Rick Glick
Paweł Betliński
Piotr Synak
机构
[1] University of Warsaw,Institute of Informatics
[2] Independent Consultant,undefined
[3] Independent Consultant,undefined
来源
Journal of Intelligent Information Systems | 2018年 / 50卷
关键词
Big data processing; Approximate query processing; Data summarization; Granular computing; Machine intelligence;
D O I
暂无
中图分类号
学科分类号
摘要
We outline the processes of intelligent creation and utilization of granulated data summaries in the engine aimed at fast approximate execution of analytical SQL statements. We discuss how to use the introduced engine for the purposes of ad-hoc data exploration over large and quickly increasing data collected in a heterogeneous or distributed fashion. We focus on mechanisms that transform input data summaries into result sets representing query outcomes. We also illustrate how our computational principles can be put together with other paradigms of scaling and harnessing data analytics.
引用
收藏
页码:385 / 414
页数:29
相关论文
共 71 条
  • [11] Jermaine C(2005)Fast split selection method and its application in decision tree construction from large databases International Journal of Hybrid Intelligent Systems 2 149-160
  • [12] Cuzzocrea A(2005)Flow graphs and intelligent data analysis Fundamenta Informaticae 64 369-377
  • [13] Saccȧ D(2007)Rudiments of rough sets Information Sciences 177 3-27
  • [14] Gatterbauer W(2005)Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy IEEE Transactions on Pattern Analysis and Machine Intelligence 27 1226-1238
  • [15] Günnemann S(2009)Degrees of conditional (in)dependence: a framework for approximate bayesian networks and examples related to the rough set-based feature selection Information Sciences 179 197-209
  • [16] Koutra D(2013)Two database related interpretations of rough approximations: data organization and query execution Fundamenta Informaticae 127 445-459
  • [17] Faloutsos C(2009)Hive – a warehousing solution over a map-reduce framework Proceedings of the VLDB Endowment 2 1626-1629
  • [18] Gibbons PB(2010)Belief propagation: technical perspective Communications of the ACM 53 94-157
  • [19] Matias Y(2016)A triarchic theory of granular computing Granular Computing 1 145-127
  • [20] Poosala V(1997)Toward a theory of fuzzy information granulation and tts centrality in human reasoning and fuzzy logic Fuzzy Sets and Systems 90 111-65