Parallel query processing in a polystore

被引:0
|
作者
Pavlos Kranas
Boyan Kolev
Oleksandra Levchenko
Esther Pacitti
Patrick Valduriez
Ricardo Jiménez-Peris
Marta Patiño-Martinez
机构
[1] LeanXcale,
[2] Distributed Systems Lab at Universidad Politécnica de Madrid,undefined
[3] Inria,undefined
[4] University of Montpellier,undefined
[5] CNRS,undefined
[6] LIRMM,undefined
来源
Distributed and Parallel Databases | 2021年 / 39卷
关键词
Database integration; Heterogeneous databases; Distributed and parallel databases; Polystores; Query languages; Query processing;
D O I
暂无
中图分类号
学科分类号
摘要
The blooming of different data stores has made polystores a major topic in the cloud and big data landscape. As the amount of data grows rapidly, it becomes critical to exploit the inherent parallel processing capabilities of underlying data stores and data processing platforms. To fully achieve this, a polystore should: (i) preserve the expressivity of each data store’s native query or scripting language and (ii) leverage a distributed architecture to enable parallel data integration, i.e. joins, on top of parallel retrieval of underlying partitioned datasets. In this paper, we address these points by: (i) using the polyglot approach of the CloudMdsQL query language that allows native queries to be expressed as inline scripts and combined with SQL statements for ad-hoc integration and (ii) incorporating the approach within the LeanXcale distributed query engine, thus allowing for native scripts to be processed in parallel at data store shards. In addition, (iii) efficient optimization techniques, such as bind join, can take place to improve the performance of selective joins. We evaluate the performance benefits of exploiting parallelism in combination with high expressivity and optimization through our experimental validation.
引用
收藏
页码:939 / 977
页数:38
相关论文
共 50 条
  • [1] Parallel query processing in a polystore
    Kranas, Pavlos
    Kolev, Boyan
    Levchenko, Oleksandra
    Pacitti, Esther
    Valduriez, Patrick
    Jimenez-Peris, Ricardo
    Patino-Martinez, Marta
    DISTRIBUTED AND PARALLEL DATABASES, 2021, 39 (04) : 939 - 977
  • [2] Efficient parallel query processing by graph ranking
    Dereniowski, D
    Kubale, M
    FUNDAMENTA INFORMATICAE, 2006, 69 (03) : 273 - 285
  • [3] A novel approach to resource scheduling for parallel query processing on computational grids
    Anastasios Gounaris
    Rizos Sakellariou
    Norman W. Paton
    Alvaro A. A. Fernandes
    Distributed and Parallel Databases, 2006, 19 : 87 - 106
  • [4] A novel approach to resource scheduling for parallel query processing on computational grids
    Gounaris, Anastasios
    Sakellariou, Rizos
    Paton, Norman W.
    Fernandes, Alvaro A. A.
    DISTRIBUTED AND PARALLEL DATABASES, 2006, 19 (2-3) : 87 - 106
  • [5] CBM: A Compact Representation and its Parallel Search for Query Processing on GPU
    Choksuchat, Chidchanok
    Gorlatch, Sergei
    Chantrapornchai, Chantana
    33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2018, : 1065 - 1067
  • [6] Efficient Parallel Skyline Query Processing for High-Dimensional Data
    Tang, Mingjie
    Yu, Yongyang
    Aref, Walid G.
    Malluhi, Qutaibah M.
    Ouzzani, Mourad
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (10) : 1838 - 1851
  • [7] Parallel Polyglot Query Processing on Heterogeneous Cloud Data Stores with LeanXcale
    Kolev, Boyan
    Levchenko, Oleksandra
    Pacitti, Esther
    Valduriez, Patrick
    Vilaca, Ricardo
    Goncalves, Rui
    Jimenez-Peris, Ricardo
    Kranas, Pavlos
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1757 - 1766
  • [8] PERFORMANCE ANALYSIS OF PARALLEL OBJECT-ORIENTED QUERY-PROCESSING ALGORITHMS
    THAKORE, AK
    SU, SYW
    DISTRIBUTED AND PARALLEL DATABASES, 1994, 2 (01) : 59 - 100
  • [9] OmniDB: Towards Portable and Efficient Query Processing on Parallel CPU/GPU Architectures
    Zhang, Shuhao
    He, Jiong
    He, Bingsheng
    Lu, Mian
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (12): : 1374 - 1377
  • [10] Performance evaluation of parallel query processing techniques in object-oriented database
    Wang, YJ
    Wang, YJ
    Hu, SR
    CHINESE JOURNAL OF ELECTRONICS, 2000, 9 (02): : 224 - 228