Machine Learning for Data Management: A System View

被引:4
作者
Li, Guoliang [1 ]
Zhou, Xuanhe [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci, Beijing, Peoples R China
来源
2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022) | 2022年
关键词
data management; machine learning; TUNING SYSTEM; INDEX;
D O I
10.1109/ICDE53745.2022.00297
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning techniques have been proposed to optimize data management in recent years. Compared with traditional empirical data management, learning-based methods extract knowledge from historical tasks, generalize the extracted knowledge to similar new tasks, and can achieve better performance in many scenarios (e.g., knob tuning, cardinality estimation). However, data management systems require to handle various and dynamic workloads in different scenarios, and there are some challenges in applying machine learning techniques for data management systems. First, with various workloads and hundreds of system metrics, how to select and characterize effective features for data management problems? Second, with diversified machine learning models, how to design the proper models? Third, with various data management requirements, how to validate whether the machine learning models can meet the requirements? In this tutorial, we discuss existing learning-based data management studies and how they solve the above challenges, and provide some future research directions.
引用
收藏
页码:3198 / 3201
页数:4
相关论文
共 57 条
[11]  
github, HTTPSGITHUBCOMTSINGH
[12]   An Autonomous Materialized View Management System with Deep Reinforcement Learning [J].
Han, Yue ;
Li, Guoliang ;
Yuan, Haitao ;
Sun, Ji .
2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, :2159-2164
[13]   Learning a Partitioning Advisor for Cloud Databases [J].
Hilprecht, Benjamin ;
Binnig, Carsten ;
Rohm, Uwe .
SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, :143-157
[14]   DeepDB: Learn from Data, not from Queries! [J].
Hilprecht, Benjamin ;
Schmidt, Andreas ;
Kulessa, Moritz ;
Molina, Alejandro ;
Kersting, Kristian ;
Binnig, Carsten .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (07) :992-1005
[15]  
Idreos S, 2019, CIDR
[16]  
Kipf Andreas, 2019, CIDR
[17]   Magic mirror in my hand, which is the best in the land? An Experimental Evaluation of Index Selection Algorithms [J].
Kossmann, Jan ;
Halfpap, Stefan ;
Jankrift, Marcel ;
Schlosser, Rainer .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (11) :2382-2395
[18]   The Case for Learned Index Structures [J].
Kraska, Tim ;
Beutel, Alex ;
Chi, Ed H. ;
Dean, Jeffrey ;
Polyzotis, Neoklis .
SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, :489-504
[19]  
Kraska Tim, 2019, P CIDR
[20]   Black or White? How to Develop an AutoTuner for Memory-based Analytics [J].
Kunjir, Mayuresh ;
Babu, Shivnath .
SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, :1667-1683