No DBA? No Regret! Multi-Armed Bandits for Index Tuning of Analytical and HTAP Workloads With Provable Guarantees

被引:4
作者
Perera, R. Malinga [1 ]
Oetomo, Bastian [1 ]
Rubinstein, Benjamin I. P. [1 ]
Borovica-Gajic, Renata [1 ]
机构
[1] Univ Melbourne, Parkville, Vic 3010, Australia
基金
澳大利亚研究理事会;
关键词
Indexes; Databases; Tuning; Physical design; Costs; Design tools; Uncertainty; HTAP; index tuning; multi-armed bandits; physical design tuning; reinforcement learning; SELECTION; DATABASE;
D O I
10.1109/TKDE.2023.3271664
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automating physical database design has remained a long-term interest in database research due to substantial performance gains afforded by optimised structures. Despite significant progress, a majority of today's commercial solutions are highly manual, requiring offline invocation by database administrators (DBAs). This status quo is untenable: identifying representative static workloads is no longer realistic; and physical design tools remain susceptible to the query optimiser's cost misestimates. Furthermore, modern application environments like hybrid transactional and analytical processing (HTAP) systems render analytical modelling next to impossible. We propose a self-driving approach to online index selection that does not depend on the DBA and query optimiser, and instead learns the benefits of viable structures through strategic exploration and direct performance observation. We view the problem as one of sequential decision making under uncertainty, specifically within the bandit learning setting. Multi-armed bandits balance exploration and exploitation to provably guarantee average performance that converges to policies that are optimal with perfect hindsight. Our comprehensive empirical evaluation against a state-of-the-art commercial tuning tool demonstrates up to 75% speed-up in analytical processing environments and 59% speed-up in HTAP environments. Lastly, our bandit framework outperforms a Monte Carlo tree search (MCTS)-based database optimiser, providing up to 24% speed-up.
引用
收藏
页码:12855 / 12872
页数:18
相关论文
共 47 条
[1]  
Abbasi-Yadkori Y., 2011, P ADV NEUR INF PROC, V24
[2]  
Aboulnaga A, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P181, DOI 10.1145/304181.304198
[3]  
Agrawal Sanjay, 2004, P 30 INT C VER LARG, P1110
[4]  
[Anonymous], 2009, Star schema benchmark
[5]  
[Anonymous], 2012, Proceedings of the 2012 international conference on Management of Data, DOI [10.1145/2213836.2213864, DOI 10.1145/2213836.2213864]
[6]  
Appuswamy R., 2017, CIDR
[7]   Bridging the Archipelago between Row-Stores and Column-Stores for Hybrid Workloads [J].
Arulraj, Joy ;
Pavlo, Andrew ;
Menon, Prashanth .
SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, :583-598
[8]   Optimal Column Layout for Hybrid Workloads [J].
Athanassoulis, Manos ;
Bogh, Kenneth S. ;
Idreos, Stratos .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (13) :2393-2407
[9]  
Borovica R., 2012, P 5 INT WORKSH TEST, P1
[10]   Smooth Scan: robust access path selection without cardinality estimation [J].
Borovica-Gajic, Renata ;
Idreos, Stratos ;
Ailamaki, Anastasia ;
Zukowski, Marcin ;
Fraser, Campbell .
VLDB JOURNAL, 2018, 27 (04) :521-545