Parallel query processing in a polystore

被引:0
|
作者
Pavlos Kranas
Boyan Kolev
Oleksandra Levchenko
Esther Pacitti
Patrick Valduriez
Ricardo Jiménez-Peris
Marta Patiño-Martinez
机构
[1] LeanXcale,
[2] Distributed Systems Lab at Universidad Politécnica de Madrid,undefined
[3] Inria,undefined
[4] University of Montpellier,undefined
[5] CNRS,undefined
[6] LIRMM,undefined
来源
Distributed and Parallel Databases | 2021年 / 39卷
关键词
Database integration; Heterogeneous databases; Distributed and parallel databases; Polystores; Query languages; Query processing;
D O I
暂无
中图分类号
学科分类号
摘要
The blooming of different data stores has made polystores a major topic in the cloud and big data landscape. As the amount of data grows rapidly, it becomes critical to exploit the inherent parallel processing capabilities of underlying data stores and data processing platforms. To fully achieve this, a polystore should: (i) preserve the expressivity of each data store’s native query or scripting language and (ii) leverage a distributed architecture to enable parallel data integration, i.e. joins, on top of parallel retrieval of underlying partitioned datasets. In this paper, we address these points by: (i) using the polyglot approach of the CloudMdsQL query language that allows native queries to be expressed as inline scripts and combined with SQL statements for ad-hoc integration and (ii) incorporating the approach within the LeanXcale distributed query engine, thus allowing for native scripts to be processed in parallel at data store shards. In addition, (iii) efficient optimization techniques, such as bind join, can take place to improve the performance of selective joins. We evaluate the performance benefits of exploiting parallelism in combination with high expressivity and optimization through our experimental validation.
引用
收藏
页码:939 / 977
页数:38
相关论文
共 50 条
  • [41] ZNS - Efficient query processing with ZurichNoSQL
    Stockinger, Kurt
    Bodi, Richard
    Heitz, Jonas
    Weinmann, Thomas
    DATA & KNOWLEDGE ENGINEERING, 2017, 112 : 38 - 54
  • [42] Temporal Query Processing in Social Network
    Chen, Xiaoying
    Zhang, Chong
    Ge, Bin
    Xiao, Weidong
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2017, 49 (02) : 147 - 166
  • [43] Elasticity in Cloud Databases and Their Query Processing
    Graefe, Goetz
    Nica, Anisoara
    Stolze, Knut
    Neumann, Thomas
    Eavis, Todd
    Petrov, Ilia
    Pourabbas, Elaheh
    Fekete, David
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2013, 9 (02) : 1 - 20
  • [44] Approximate Query Processing with Error Guarantees
    Ni, Tianjia
    Sugiura, Kento
    Ishikawa, Yoshiharu
    Lu, Kejing
    BIG-DATA-ANALYTICS IN ASTRONOMY, SCIENCE, AND ENGINEERING, BDA 2021, 2022, 13167 : 268 - 278
  • [45] Diversification on big data in query processing
    Zhang, Meifan
    Wang, Hongzhi
    Li, Jianzhong
    Gao, Hong
    FRONTIERS OF COMPUTER SCIENCE, 2020, 14 (04)
  • [46] Query Processing for Streaming RDF Data
    Shah, Ruchita
    Pandat, Ami
    Bhise, Minal
    2018 4TH IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (IEEE WIECON-ECE 2018), 2018, : 75 - 78
  • [47] Query Processing in INM Database System
    Hu, Jie
    Fu, Qingchuan
    Liu, Mengchi
    WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2010, 6184 : 525 - 536
  • [48] Processing-in-Memory for Databases: Query Processing and Data Transfer
    Baumstark, Alexander
    Jibril, Muhammad Attahir
    Sattler, Kai-Uwe
    19TH INTERNATIONAL WORKSHOP ON DATA MANAGEMENT ON NEW HARDWARE, DAMON 2023, 2023, : 107 - 111
  • [49] Top-N query:: Query language, distance function, and processing strategies
    Chen, YX
    Meng, WY
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2003, 2762 : 458 - 470
  • [50] Sort-sharing-aware query processing
    Yu Cao
    Ramadhana Bramandia
    Chee-Yong Chan
    Kian-Lee Tan
    The VLDB Journal, 2012, 21 : 411 - 436