Sample-Efficient Cardinality Estimation Using Geometric Deep Learning

被引:4
作者
Reiner, Silvan [1 ]
Grossniklaus, Michael [1 ]
机构
[1] Univ Konstanz, Constance, Germany
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2023年 / 17卷 / 04期
关键词
QUERY; OPTIMIZER;
D O I
10.14778/3636218.3636229
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In database systems, accurate cardinality estimation is a cornerstone of effective query optimization. In this context, estimators that use machine learning have shown significant promise. Despite their potential, the effectiveness of these learned estimators strongly depends on their ability to learn from small training sets. This paper presents a novel approach for learned cardinality estimation that addresses this issue by enhancing sample efficiency. We propose a neural network architecture informed by geometric deep learning principles that represents queries as join graphs. Furthermore, we introduce an innovative encoding for complex predicates, treating their encoding as a feature selection problem. Additionally, we devise a regularization term that employs equalities of the relational algebra and three-valued logic, augmenting the training process without requiring additional ground truth cardinalities. We rigorously evaluate our model across multiple benchmarks, examining q-errors, runtimes, and the impact of workload distribution shifts. Our results demonstrate that our model significantly improves the end-to-end runtimes of PostgreSQL, even with cardinalities gathered from as little as 100 query executions.
引用
收藏
页码:740 / 752
页数:13
相关论文
共 45 条
[21]  
Moerkotte G, 2009, PROC VLDB ENDOW, V2
[22]  
Muller Magnus, 2023, EDBT, P273
[23]   Robust Query Driven Cardinality Estimation under Changing Workloads [J].
Negi, Parimarjan ;
Wu, Ziniu ;
Kipf, Andreas ;
Tatbul, Nesime ;
Marcus, Ryan ;
Madden, Sam ;
Kraska, Tim ;
Alizadeh, Mohammad .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (06) :1520-1533
[24]   Flow-Loss: Learning Cardinality Estimates That Matter [J].
Negi, Parimarjan ;
Marcus, Ryan ;
Kipf, Andreas ;
Mao, Hongzi ;
Tatbul, Nesime ;
Kraska, Tim ;
Alizadeh, Mohammad .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (11) :2019-2032
[25]  
Ortiz J, 2019, Arxiv, DOI arXiv:1905.06425
[26]  
Poon H, 2011, 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS)
[27]   A review of unsupervised feature selection methods [J].
Solorio-Fernandez, Saul ;
Carrasco-Ochoa, J. Ariel ;
Martinez-Trinidad, Jose Fco. .
ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (02) :907-948
[28]   An End-to-End Learning-based Cost Estimator [J].
Sun, Ji ;
Li, Guoliang .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 13 (03) :307-319
[29]  
Vaswani A, 2017, ADV NEUR IN, V30
[30]  
Velickovic P., 2018, P INT C LEARN REPR, DOI DOI 10.48550/ARXIV.1710.10903