An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning

被引:199
作者
Zhang, Ji [1 ]
Liu, Yu [1 ]
Zhou, Ke [1 ]
Li, Guoliang [2 ]
Xiao, Zhili [3 ]
Cheng, Bin [3 ]
Xing, Jiashu [3 ]
Wang, Yangtao [1 ]
Cheng, Tianheng [1 ]
Liu, Li [1 ]
Ran, Minwei [1 ]
Li, Zekang [1 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan, Hubei, Peoples R China
[2] Tsinghua Univ, Beijing, Peoples R China
[3] Tencent Inc, Shenzhen, Guangdong, Peoples R China
来源
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2019年
基金
中国国家自然科学基金;
关键词
D O I
10.1145/3299869.3300085
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Configuration tuning is vital to optimize the performance of database management system (DBMS). It becomes more tedious and urgent for cloud databases (CDB) due to the diverse database instances and query workloads, which make the database administrator (DBA) incompetent. Although there are some studies on automatic DBMS configuration tuning, they have several limitations. Firstly, they adopt a pipelined learning model but cannot optimize the overall performance in an end-to-end manner. Secondly, they rely on large-scale high-quality training samples which are hard to obtain. Thirdly, there are a large number of knobs that are in continuous space and have unseen dependencies, and they cannot recommend reasonable configurations in such high-dimensional continuous space. Lastly, in cloud environment, they can hardly cope with the changes of hardware configurations and workloads, and have poor adaptability. To address these challenges, we design an end-to-end automatic CDB tuning system, CDBTune, using deep reinforcement learning (RL). CDBTune utilizes the deep deterministic policy gradient method to find the optimal configurations in high-dimensional continuous space. CDBTune adopts a try-and-error strategy to learn knob settings with a limited number of samples to accomplish the initial training, which alleviates the difficulty of collecting massive high-quality samples. CDBTune adopts the reward-feedback mechanism in RL instead of traditional regression, which enables end-to-end learning and accelerates the convergence speed of our model and improves efficiency of online tuning. We conducted extensive experiments under 6 different workloads on real cloud databases to demonstrate the superiority of CDBTune. Experimental results showed that CDBTune had a good adaptability and significantly outperformed the state-of-the-art tuning tools and DBA experts.
引用
收藏
页码:415 / 432
页数:18
相关论文
共 57 条
[21]   Merging What's Cracked, Cracking What's Merged: Adaptive Indexing in Main-Memory Column-Stores [J].
Idreos, Stratos ;
Manegold, Stefan ;
Kuno, Harumi ;
Graefe, Goetz .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (09) :586-597
[22]  
Idreos S, 2009, ACM SIGMOD/PODS 2009 CONFERENCE, P297
[23]  
Ilyas Ihab F., 2004, P 2004 ACM SIGMOD IN, P647, DOI [10.1145/1007568.1007641, DOI 10.1145/1007568.1007641]
[24]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[25]  
Kumar S., 2003, Oracle database 10g: The self-managing database
[26]  
Kwan Eva, 2002, P IBM PERF TECHN REP
[27]   Deep learning [J].
LeCun, Yann ;
Bengio, Yoshua ;
Hinton, Geoffrey .
NATURE, 2015, 521 (7553) :436-444
[28]  
Lightstone S., 2004, P 30 INT C VER LARG, P1170
[29]  
Lillicrap TP, 2015, ARXIV150902971
[30]   A Q-Learning Scheme for Fair Coexistence Between LTE and Wi-Fi in Unlicensed Spectrum [J].
Maglogiannis, Vasilis ;
Naudts, Dries ;
Shahid, Adnan ;
Moerman, Ingrid .
IEEE ACCESS, 2018, 6 :27278-27293