On the Performance of Query Rewriting in Vertically Distributed Cloud Databases

被引:1
作者
Kohler, Jens [1 ,3 ]
Simov, Kiril [1 ]
Fiech, Adrian [2 ]
Specht, Thomas [3 ]
机构
[1] IICT BAS, Linguist Modelling Dept, Sofia, Bulgaria
[2] Mem Univ St Johns, Dept Comp Sci, St John, NF, Canada
[3] Univ Appl Sci Mannheim, Inst Enterprise Comp, Mannheim, Germany
来源
INNOVATIVE APPROACHES AND SOLUTIONS IN ADVANCED INTELLIGENT SYSTEMS | 2016年 / 648卷
关键词
Vertically distributed cloud databases; Query performance; Query rewriting; MODEL;
D O I
10.1007/978-3-319-32207-0_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cloud Computing with its dynamic pay as you go model and scalability characteristics promises computing on demand with associated cost savings compared to traditional computing architectures. This is a promising computing model especially in the context of Big Data. However, renting computing capabilities from a cloud provider means the integration of external resources into the own infrastructure and this requires a great amount of trust and raises new data security and privacy challenges. With respect to these still unsolved problems, this work presents a fixed vertical partitioning and distribution approach that uses traditional relational data models and distributes the corresponding partitions vertically across different cloud providers. So, every cloud provider only gets a small, but defined (and therefore fixed) logically independent chunk of the entire data, which is useless without the other parts. However, the distribution and the subsequent join of the data suffer from great performance losses, which are unbearable in practical usage scenarios. The novelty of our approach combines the well-known vertical database partitioning technique with a distribution logic that stores the vertical partitions at different cloud computing environments. Traditionally, vertical as well as horizontal partitioning approaches are used to improve the access to database data, but these approaches use dynamic and automated partitioning algorithms and schemes based on query workloads, data volumes, network bandwidth, etc. In contrast to this, our approach uses a fixed user-defined vertical partitioning approach, where no two critical attributes of a relation should be stored in a single partition. Thus, our approach aims at improving data security and privacy especially in public Cloud Computing environments, but raises the challenging research question of how to improve the data access to such fixed user-partitioned and distributed database environments. In this paper, we outline a query rewriting approach that parallelizes queries and joins in order to improve the query performance. We implemented our fixed partitioning and distribution approach based on the TPC-W benchmark and we finally present the performance results in this work.
引用
收藏
页码:59 / 73
页数:15
相关论文
共 22 条
  • [1] [Anonymous], 2010, TODS
  • [2] Balkesen C., 2014, PVLDB, V7, P85
  • [3] Bezenek T, 2000, P 3 WORKSH COMP ARCH
  • [4] Answering queries using materialized views with minimum size
    Chirkova, Rada
    Li, Chen
    Li, Jia
    [J]. VLDB JOURNAL, 2006, 15 (03) : 191 - 210
  • [5] CODD EF, 1970, COMMUN ACM, V13, P377, DOI 10.1145/357980.358007
  • [6] Doshi P., 2011, 2011 3rd International Conference on Electronics Computer Technology (ICECT 2011), P145, DOI 10.1109/ICECTECH.2011.5942069
  • [7] Perm: Processing provenance and data on the same data model through query rewriting
    Glavic, Boris
    Alonso, Gustavo
    [J]. ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 174 - +
  • [8] Hachani N., 2006, 17 INT C DAT EXP SYS
  • [9] Answering queries using views: A survey
    Halevy, AY
    [J]. VLDB JOURNAL, 2001, 10 (04) : 270 - 294
  • [10] Hossain M., 2012, PROC 5 EUROPEAN C ST, P1