Query grouping-based multi-query optimization framework for interactive SQL query engines on Hadoop

被引:3
|
作者
Chen, Ling [1 ,2 ]
Lin, Yan [1 ]
Wang, Jingchang [3 ]
Huang, Heqing [1 ]
Chen, Donghui [1 ]
Wu, Yong [3 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Zhejiang, Peoples R China
[2] Alibaba Zhejiang Univ, Joint Inst Frontier Technol, Hangzhou 310027, Zhejiang, Peoples R China
[3] Zhejiang Hongcheng Comp Syst Co Ltd, Hangzhou 310053, Zhejiang, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
grouping method; Impala system; multi-query optimization; GENETIC ALGORITHM;
D O I
10.1002/cpe.4676
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In the past few years, executing high-concurrency queries with interactive SQL query engines on Hadoop has become an important activity for many organizations. However, these systems do not adopt Multi-Query Optimization (MQO) to accelerate the process. There are two major concerns. Firstly, traditional MQO researches assume that multiple queries have high similarity. However, these systems usually serve a variety of applications. Although queries from the same application have high similarity, queries from different applications may have low similarity, so using traditional MQO will be inefficient and time consuming. Secondly, integrating MQO may lead to lots of system modifications. To integrate MQO into interactive SQL query engines on Hadoop efficiently, a query grouping-based MQO framework is proposed. A lightweight mechanism is used to represent SQL queries, on which a grouping method is exploited to speed up the optimization process. A cost model is integrated to estimate the execution cost of interactive SQL query engines on Hadoop. By using the proposed framework, we modify Impala system to support MQO, and the experimental results on TPC-DS show significant performance improvements.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Multi-query Verifiable PIR and Its Application
    Hayashi, Ryuya
    Hayata, Junichiro
    Hara, Keisuke
    Nomura, Kenta
    Kamizono, Masaki
    Hanaoka, Goichiro
    CRYPTOLOGY AND NETWORK SECURITY, PT II, CANS 2024, 2025, 14906 : 166 - 190
  • [42] Preventing Multi-query Attack in Location-based Services
    Talukder, Nilothpal
    Ahamed, Sheikh Iqbal
    WISEC 10: PROCEEDINGS ON THE THIRD ACM CONFERENCE ON WIRELESS NETWORK SECURITY, 2010, : 25 - 35
  • [43] Multi query optimization using query pack trees
    Dekeyser, S
    XML-BASED DATA MANAGEMENT AND MULTIMEDIA ENGINEERING-EDBT 2002 WORKSHOPS, 2002, 2490 : 544 - 554
  • [44] Multi-Query Optimization of Incrementally Evaluated Sliding-Window Aggregations
    Shein, Anatoli U.
    Chrysanthis, Panos K.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (08) : 3899 - 3911
  • [45] A Distributed Engine for Multi-query Processing Based on Predicates with Spark
    Zhang, Bin
    Sun, Ximin
    Bi, Liwei
    Zhao, Changhao
    Chen, Xin
    Li, Xin
    Sun, Lei
    WEB AND BIG DATA, 2021, 1505 : 27 - 36
  • [46] Continuous multi-query optimization for subgraph matching over dynamic graphs
    Wang, Xi
    Zhang, Qianzhen
    Guo, Deke
    Zhao, Xiang
    SEMANTIC WEB, 2022, 13 (04) : 601 - 622
  • [47] SDCS : Secure Data Centric Sensor Networks with Multi-query Optimization
    Tanuja, R.
    Sukeerthi, B. J.
    Raju, Apoorva
    Manjula, S. H.
    Venugopal, K. R.
    Patnaik, L. M.
    2013 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2013,
  • [48] Minimizing the Make Span of Diagnostic Multi-Query Graphs Using Graph Pruning and Query Merging
    Tabassam, Nadra
    Obermaisser, Roman
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2018, : 368 - 375
  • [49] On query completion in web search engines based on query stream mining
    Barouni-Ebrahimi, M.
    Ghorbani, Ali A.
    PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE: WI 2007, 2007, : 317 - 320
  • [50] A Graph-Based Framework for Analyzing SQL Query Logs
    Wahl, Andreas M.
    Endler, Gregor
    Schwab, Peter K.
    Rith, Julian
    Herbst, Sebastian
    Lenz, Richard
    GRADES-NDA '18: PROCEEDINGS OF THE 1ST ACM SIGMOD JOINT INTERNATIONAL WORKSHOP ON GRAPH DATA MANAGEMENT EXPERIENCES & SYSTEMS (GRADES) AND NETWORK DATA ANALYTICS (NDA) 2018 (GRADES-NDA 2018), 2018,