Generating Custom Learned Cost Model for Query Optimizer of DBMS

被引:0
作者
Ouared, Abdelkader [1 ]
Amrani, Moussa [2 ]
Schobbens, Pierre-Yves [2 ]
机构
[1] Univ Tiaret, Dept Comp Sci, BP 78 Zaaroura, Tiaret 14000, Algeria
[2] Univ Namur, Fac Comp Sci, NaDI, Rue Grandgagnage 21, B-5000 Namur, Belgium
来源
MODEL-DRIVEN ENGINEERING AND SOFTWARE DEVELOPMENT, MODELSWARD 2023 | 2024年 / 2106卷
关键词
Learned database cost model; Database management system; Meta-Learning; Generic interface; Code generator; METRICS;
D O I
10.1007/978-3-031-66339-0_2
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Designers and developers of database management systems (DBMS) build increasingly specialized efficient query optimizer in order to deliver improved performance. This query optimizer uses a database Cost Model (CM) that is embedded inside these DBMSs. Currently, database CMs have to be manually calibrated for each database technology evolution and new requirements which is a costly and error prone process. Moreover, writing CM programs on DBMS distribution code like PostgreSQL and MySQL for specific platforms is hard since this requires the code to be tuned at a low level with architecture of the query optimizer that are fundamentally based on general-purpose programming languages. In this paper, we propose a framework to design and calibrate automatically the cost model of the query optimizer automatically. Our framework provides a code generation that allow user applications to fully generate the cost of database operation of a target DBMS by changing the model specification (e.g. the data primitives of the used data layout, data indexing, and the chosen algorithm of database operations). In addition, to CM code specialization for a specific hardware platform requires a machine learning component that is used as an external tool to automatically calibrate the CM under design. A proof-of-concept prototype is implemented by considering the CM that exists in PostgreSQL optimizer.
引用
收藏
页码:29 / 53
页数:25
相关论文
共 30 条
[1]  
Bellatreche L., 2013, P 17 INT DAT ENG APP, P142
[2]  
Brahimi L, 2016, INT CONF RES CHAL, P315
[3]   Generating custom code for efficient query execution on heterogeneous processors [J].
Bress, Sebastian ;
Koecher, Bastian ;
Funke, Henning ;
Zeuch, Steffen ;
Rabl, Tilmann ;
Markl, Volker .
VLDB JOURNAL, 2018, 27 (06) :797-822
[4]  
Chaudhuri S., 1998, Proceedings of the Seventeenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. PODS 1998, P34, DOI 10.1145/275487.275492
[5]   Meta-Modelling Meta-Learning [J].
Hartmann, Thomas ;
Moawad, Assaad ;
Schockaert, Cedric ;
Fouquet, Francois ;
Le Traon, Yves .
2019 ACM/IEEE 22ND INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS (MODELS 2019), 2019, :300-305
[6]  
Hilprecht B., 2020, CIDR
[7]   Zero-Shot Cost Models for Out-of-the-box Learned Cost Prediction [J].
Hilprecht, Benjamin ;
Binnig, Carsten .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (11) :2361-2374
[8]  
Hilprecht B, 2022, Arxiv, DOI arXiv:2105.00642
[9]  
Karatzoglou A., 2004, Journal of Statistical Software, V11, DOI DOI 10.18637/JSS.V011.I09
[10]  
Kipf A, 2018, Arxiv, DOI arXiv:1809.00677