Demystifying deep learning in predictive monitoring for cloud-native SLOs

被引:1
|
作者
Morichetta, Andrea [1 ]
Pujol, Victor Casamayor [1 ]
Nastic, Stefan [1 ]
Pusztai, Thomas [1 ]
Raith, Philipp [1 ]
Dustdar, Schahram [1 ]
Vij, Deepak [2 ]
Xiong, Ying [2 ]
Zhang, Zhaobo [2 ]
机构
[1] TU Wien, Distributed Syst Grp, Vienna, Austria
[2] Futurewei Technol Inc, Santa Clara, CA USA
关键词
workload prediction; neural networks; cloud; LSTM; Transformers; HOST LOAD PREDICTION; WORKLOAD; MODEL;
D O I
10.1109/CLOUD60044.2023.00013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The complexity inherent in managing cloud computing systems calls for novel solutions that can effectively enforce high-level Service Level Objectives (SLOs) promptly. Unfortunately, most of the current SLO management solutions rely on reactive approaches, i.e., correcting SLO violations only after they have occurred. Further, the few methods that explore predictive techniques to prevent SLO violations focus solely on forecasting low-level system metrics, such as CPU and Memory utilization. Although valid in some cases, these metrics do not necessarily provide clear and actionable insights into application behavior. This paper presents a novel approach that directly predicts high-level SLOs using low-level system metrics. We target this goal by training and optimizing two state-of-the-art neural network models, a Short-Term Long Memory LSTM-, and a Transformer-based model. Our models provide actionable insights into application behavior by establishing proper connections between the evolution of low-level workload-related metrics and the high-level SLOs. We demonstrate our approach to selecting and preparing the data. We show in practice how to optimize LSTM and Transformer by targeting efficiency as a high-level SLO metric and performing a comparative analysis. We show how these models behave when the input workloads come from different distributions. Consequently, we demonstrate their ability to generalize in heterogeneous systems. Finally, we operationalize our two models by integrating them into the Polaris framework we have been developing to enable a performance-driven SLO-native approach to Cloud computing.
引用
收藏
页码:24 / 34
页数:11
相关论文
共 50 条
  • [31] Deep Reinforcement Learning based Cloud-native Network Function Placement in Private 5G Networks
    Kim, Joonwoo
    Lee, Jaewook
    Kim, Taeyun
    Pack, Sangheon
    2020 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2020,
  • [32] Approaches for migrating non cloud-native applications to the cloud
    Shastry, Abhigna L.
    Nair, Devika S.
    Prathima, B.
    Ramya, C. P.
    Hallymysore, Phalachandra
    2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 632 - 638
  • [33] A New Cloud-Native Tool for Pharmacogenetic Analysis
    Yuan, David Yu
    Park, Jun Hyuk
    Li, Zhenyu
    Thomas, Rohan
    Hwang, David M.
    Fu, Lei
    GENES, 2024, 15 (03)
  • [34] Enhancement of Cloud-native applications with Autonomic Features
    Kosinska, Joanna
    Zielinski, Krzysztof
    JOURNAL OF GRID COMPUTING, 2023, 21 (03)
  • [35] Enabling Cloud-native IoT Device Management
    Nanos, Anastassios
    Plakas, Ioannis
    Ntoutsos, Georgios
    Mainas, Charalampos
    PROCEEDINGS OF THE 1ST INTERNATIONAL WORKSHOP ON METAOS FOR THE CLOUD-EDGE-IOT CONTINUUM, MECC 2024, 2024, : 42 - 47
  • [36] Cloud-native Deploy-ability: An Analysis of Required Features of Deployment Technologies to Deploy Arbitrary Cloud-native Applications
    Wurster, Michael
    Breitenbuecher, Uwe
    Brogi, Antonio
    Leymann, Frank
    Soldani, Jacopo
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE (CLOSER), 2020, : 171 - 180
  • [37] FortisEDoS: A Deep Transfer Learning-Empowered Economical Denial of Sustainability Detection Framework for Cloud-Native Network Slicing
    Benzaid, Chafika
    Taleb, Tarik
    Sami, Ashkan
    Hireche, Othmane
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2024, 21 (04) : 2818 - 2835
  • [38] Enriching Cloud-native Applications with Sustainability Features
    Vitali, Monica
    Schmiedmayer, Paul
    Bootz, Valentin
    2023 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING, IC2E, 2023, : 21 - 31
  • [39] Autonomic Management Framework for Cloud-Native Applications
    Kosinska, Joanna
    Zielinski, Krzysztof
    JOURNAL OF GRID COMPUTING, 2020, 18 (04) : 779 - 796
  • [40] Enhancement of Cloud-native applications with Autonomic Features
    Joanna Kosińska
    Krzysztof Zieliński
    Journal of Grid Computing, 2023, 21