Machine Learning for Cloud Data Systems: the Progress so far and the Path Forward

被引:3
作者
Jindal, Alekh [1 ]
Interlandi, Matteo [1 ]
机构
[1] Microsoft, Redmond, WA 98052 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2021年 / 14卷 / 12期
关键词
BENCHMARKING; QUERIES;
D O I
10.14778/3476311.3476408
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of this tutorial is to educate the audience about the state of the art in ML for cloud data systems, both in research and in practice. The tutorial is divided in two parts: the progress, and the path forward. Part I covers the recent successes in deploying machine learning solutions for cloud data systems. We will discuss the practical considerations taken into account and the progress made at various levels. The goal is to compare and contrast the promise of ML for systems with the ground actually covered in industry. Finally, Part II discusses practical issues of machine learning in the enterprise covering the generation of explanations, model debugging, model deployment, model management, constraints on eyes-on data usage and anonymization, and a discussion of the technical debt that can accrue through machine learning and models in the enterprise.
引用
收藏
页码:3202 / 3205
页数:4
相关论文
共 62 条
  • [1] Adya Atul, 2016, SLICER AUTOSHARDING
  • [2] Ahmed Zeeshan, 2019, SIGKDD
  • [3] Learning-based Query Performance Modeling and Prediction
    Akdere, Mert
    Cetintemel, Ugur
    Riondato, Matteo
    Upfal, Eli
    Zdonik, Stanley B.
    [J]. 2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 390 - 401
  • [4] Albrecht C., 2013, USENIX ANN TECHNICAL, P91
  • [5] Alipourfard O, 2017, PROCEEDINGS OF NSDI '17: 14TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION, P469
  • [6] Barr Jeff, 2018, NEWPREDICTIVE SCALIN
  • [7] Toward ML-Centric Cloud Platforms
    Bianchini, Ricardo
    Fontoura, Marcus
    Cortez, Eli
    Bonde, Anand
    Muzio, Alexandre
    Constantin, Ana-Maria
    Moscibroda, Thomas
    Magalhaes, Gabriel
    Bablani, Girish
    Russinovich, Mark
    [J]. COMMUNICATIONS OF THE ACM, 2020, 63 (02) : 50 - 59
  • [8] Chi E. H., 2020, P WORKSHOP ML SYSTEM
  • [9] Resource Central: Understanding and Predicting Workloads for Improved Resource Management in Large Cloud Platforms
    Cortez, Eli
    Bonde, Anand
    Muzio, Alexandre
    Russinovich, Mark
    Fontoura, Marcus
    Bianchini, Ricardo
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES (SOSP '17), 2017, : 153 - 167
  • [10] Crankshaw D, 2017, PROCEEDINGS OF NSDI '17: 14TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION, P613