AI Meets AI: Leveraging Query Executions to Improve Index Recommendations

被引:83
作者
Ding, Bailu [1 ]
Das, Sudipto [1 ]
Marcus, Ryan [1 ,2 ]
Wu, Wentao [1 ]
Chaudhuri, Surajit [1 ]
Narasayya, Vivek R. [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
[2] Brandeis Univ, Waltham, MA 02254 USA
来源
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2019年
关键词
Automated indexing; autonomous database management; performance tuning; relational database-as-a-service; DESIGN;
D O I
10.1145/3299869.3324957
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
State-of-the-art index tuners rely on query optimizer's cost estimates to search for the index configuration with the largest estimated execution cost improvement'. Due to well-known limitations in optimizer's estimates, in a significant fraction of cases, an index estimated to improve a query's execution cost, e.g., CPU time, makes that worse when implemented. Such errors are a major impediment for automated indexing in production systems. We observe that comparing the execution cost of two plans of the same query corresponding to different index configurations is a key step during index tuning. Instead of using optimizer's estimates for such comparison, our key insight is that formulating it as a classification task in machine learning results in significantly higher accuracy. We present a study of the design space for this classification problem. We further show how to integrate this classifier into the state-of-the-art index tuners with minimal modifications, i.e., how artificial intelligence (AI) can benefit automated indexing (AI). Our evaluation using industry-standard benchmarks and a large number of real customer workloads demonstrates up to 5x reduction in the errors in identifying the cheaper plan in a pair, which eliminates almost all query execution cost regressions when the model is used in index tuning.
引用
收藏
页码:1241 / 1258
页数:18
相关论文
共 65 条
[61]   Automatic Database Management System Tuning Through Large-scale Machine Learning [J].
Van Aken, Dana ;
Pavlo, Andrew ;
Gordon, Geoffrey J. ;
Zhang, Bohan .
SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, :1009-1024
[62]  
Weikum G., 2002, Proceedings of the Twenty-eighth International Conference on Very Large Data Bases, P20
[63]   Sampling-Based Query Re-Optimization [J].
Wu, Wentao ;
Naughton, Jeffrey F. ;
Singh, Harneet .
SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, :1721-1736
[64]  
Wu WT, 2013, PROC INT CONF DATA, P1081, DOI 10.1109/ICDE.2013.6544899
[65]   Tree2Vector: Learning a Vectorial Representation for Tree-Structured Data [J].
Zhang, Haijun ;
Wang, Shuang ;
Xu, Xiaofei ;
Chow, Tommy W. S. ;
Wu, Q. M. Jonathan .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) :5304-5318