Performance Diagnosis in Cloud Microservices Using Deep Learning

被引:18
|
作者
Wu, Li [1 ,2 ]
Bogatinovski, Jasmin [2 ]
Nedelkoski, Sasho [2 ]
Tordsson, Johan [1 ,3 ]
Kao, Odej [2 ]
机构
[1] Elastisys AB, Umea, Sweden
[2] TU Berlin, Distributed & Operating Syst Grp, Berlin, Germany
[3] Umea Univ, Dept Comp Sci, Umea, Sweden
来源
SERVICE-ORIENTED COMPUTING, ICSOC 2020 | 2021年 / 12632卷
基金
欧盟地平线“2020”;
关键词
Performance diagnosis; Root cause analysis; Microservices; Cloud computing; Autoencoder;
D O I
10.1007/978-3-030-76352-7_13
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microservice architectures are increasingly adopted to design large-scale applications. However, the highly distributed nature and complex dependencies of microservices complicate automatic performance diagnosis and make it challenging to guarantee service level agreements (SLAs). In particular, identifying the culprits of a microservice performance issue is extremely difficult as the set of potential root causes is large and issues can manifest themselves in complex ways. This paper presents an application-agnostic system to locate the culprits for microservice performance degradation with fine granularity, including not only the anomalous service from which the performance issue originates but also the culprit metrics that correlate to the service abnormality. Our method first finds potential culprit services by constructing a service dependency graph and next applies an autoencoder to identify abnormal service metrics based on a ranked list of reconstruction errors. Our experimental evaluation based on injection of performance anomalies to a microservice benchmark deployed in the cloud shows that our system achieves a good diagnosis result, with 92% precision in locating culprit service and 85.5% precision in locating culprit metrics.
引用
收藏
页码:85 / 96
页数:12
相关论文
共 50 条
  • [1] Enabling microservices management for Deep Learning applications across the Edge-Cloud Continuum
    Houmani, Zeina
    Balouek-Thomert, Daniel
    Caron, Eddy
    Parashar, Manish
    2021 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2021), 2021, : 137 - 146
  • [2] DeepMRA: An Efficient Microservices Resource Allocation Framework with Deep Reinforcement Learning in the Cloud
    Si, Qi
    Shi, Jilin
    Li, Weiyi
    Lu, Xuesong
    Pu, Peng
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT II, ICIC 2024, 2024, 14863 : 455 - 466
  • [3] cCube: A Cloud Microservices Architecture for Evolutionary Machine Learning Classification
    Salza, Pasquale
    Hemberg, Erik
    Ferrucci, Filomena
    O'reilly, Una-May
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCO'17 COMPANION), 2017, : 137 - 138
  • [4] Extracting microservices from monolithic systems using deep reinforcement learning
    Sellami, Khaled
    Saied, Mohamed Aymen
    EMPIRICAL SOFTWARE ENGINEERING, 2025, 30 (01)
  • [5] Learning Predictive Auto scaling Policies for Cloud-hosted Microservices Using Trace-driven Modeling
    Abdullah, Muhammad
    Iqbal, Waheed
    Erradi, Abdelkarim
    Bukhari, Faisal
    11TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM 2019), 2019, : 119 - 126
  • [6] Microservices vs Serverless: A Performance Comparison on a Cloud-native Web Application
    Fan, Chen-Fu
    Jindal, Anshul
    Gerndt, Michael
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE (CLOSER), 2020, : 204 - 215
  • [7] On revisiting energy and performance in microservices applications: A cloud elasticity-driven approach
    de Nardin, Igor Fontana
    Righi, Rodrigo da Rosa
    Lima Lopes, Thiago Roberto
    da Costa, Cristiano Andre
    Yeom, Heon Young
    Koestler, Harald
    PARALLEL COMPUTING, 2021, 108
  • [8] Seer: Leveraging Big Data to Navigate the Complexity of Performance Debugging in Cloud Microservices
    Gan, Yu
    Zhang, Yanqi
    Hu, Kelvin
    Cheng, Dailun
    He, Yuan
    Pancholi, Meghna
    Delimitrou, Christina
    TWENTY-FOURTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXIV), 2019, : 19 - 33
  • [9] A layered framework for root cause diagnosis of microservices
    Bento, Andre
    Correia, Jaime
    Duraes, Joao
    Soares, Joao
    Ribeiro, Luis
    Ferreira, Antonio
    Carreira, Rita
    Araujo, Filipe
    Barbosa, Raul
    2021 IEEE 20TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2021,
  • [10] Sentiment Analysis using Deep Learning in Cloud
    Raza, Muhammad Raheel
    Hussain, Walayat
    Tanyildizi, Erkan
    Varol, Asaf
    9TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSICS AND SECURITY (ISDFS'21), 2021,