Performance Diagnosis in Cloud Microservices Using Deep Learning

被引:18
|
作者
Wu, Li [1 ,2 ]
Bogatinovski, Jasmin [2 ]
Nedelkoski, Sasho [2 ]
Tordsson, Johan [1 ,3 ]
Kao, Odej [2 ]
机构
[1] Elastisys AB, Umea, Sweden
[2] TU Berlin, Distributed & Operating Syst Grp, Berlin, Germany
[3] Umea Univ, Dept Comp Sci, Umea, Sweden
来源
SERVICE-ORIENTED COMPUTING, ICSOC 2020 | 2021年 / 12632卷
基金
欧盟地平线“2020”;
关键词
Performance diagnosis; Root cause analysis; Microservices; Cloud computing; Autoencoder;
D O I
10.1007/978-3-030-76352-7_13
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microservice architectures are increasingly adopted to design large-scale applications. However, the highly distributed nature and complex dependencies of microservices complicate automatic performance diagnosis and make it challenging to guarantee service level agreements (SLAs). In particular, identifying the culprits of a microservice performance issue is extremely difficult as the set of potential root causes is large and issues can manifest themselves in complex ways. This paper presents an application-agnostic system to locate the culprits for microservice performance degradation with fine granularity, including not only the anomalous service from which the performance issue originates but also the culprit metrics that correlate to the service abnormality. Our method first finds potential culprit services by constructing a service dependency graph and next applies an autoencoder to identify abnormal service metrics based on a ranked list of reconstruction errors. Our experimental evaluation based on injection of performance anomalies to a microservice benchmark deployed in the cloud shows that our system achieves a good diagnosis result, with 92% precision in locating culprit service and 85.5% precision in locating culprit metrics.
引用
收藏
页码:85 / 96
页数:12
相关论文
共 50 条
  • [31] Migrating to Cloud-Native Architectures Using Microservices: An Experience Report
    Balalaie, Armin
    Heydarnoori, Abbas
    Jamshidi, Pooyan
    ADVANCES IN SERVICE-ORIENTED AND CLOUD COMPUTING (ESOCC 2015), 2016, 567 : 201 - 215
  • [32] Intelligent microservices autoscaling module using reinforcement learning
    Khaleq, Abeer Abdel
    Ra, Ilkyeun
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2023, 26 (05): : 2789 - 2800
  • [33] Bearing Fault Diagnosis Using Machine Learning and Deep Learning Techniques
    Dhanush, N. Sai
    Ambika, P. S.
    FOURTH CONGRESS ON INTELLIGENT SYSTEMS, VOL 1, CIS 2023, 2024, 868 : 309 - 321
  • [34] Anomaly Detection and Diagnosis for Container-Based Microservices with Performance Monitoring
    Du, Qingfeng
    Xie, Tiandi
    He, Yu
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT IV, 2018, 11337 : 560 - 572
  • [35] Quantitative DevSecOps Metrics for Cloud-Based Web Microservices
    Zhang, Jin Yu
    Zhang, Yuting
    IEEE ACCESS, 2024, 12 : 160317 - 160342
  • [36] Intelligent microservices autoscaling module using reinforcement learning
    Abeer Abdel Khaleq
    Ilkyeun Ra
    Cluster Computing, 2023, 26 : 2789 - 2800
  • [37] Cloud-based deep learning-assisted system for diagnosis of sports injuries
    Wu, Xiaoe
    Zhou, Jincheng
    Zheng, Maoxing
    Chen, Shanwei
    Wang, Dan
    Anajemba, Joseph
    Zhang, Guangnan
    Abdelhaq, Maha
    Alsaqour, Raed
    Uddin, Mueen
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2022, 11 (01):
  • [38] Resource Scheduling for Offline Cloud Computing Using Deep Reinforcement Learning
    El-Boghdadi, Hatem M.
    Ramadan, Rabie A.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (04): : 54 - 60
  • [39] Detecting epileptic seizures using deep learning with cloud and fog computing
    Rocha, Elisson
    Monteiro, Kayo
    Silva, Emerson
    Santos, Guto Leoni
    Santos, Wylliams
    Endo, Patricia Takako
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING COMPANION (UCC COMPANION), 2018, : 19 - 20
  • [40] CoScal: Multifaceted Scaling of Microservices With Reinforcement Learning
    Xu, Minxian
    Song, Chenghao
    Ilager, Shashikant
    Gill, Sukhpal Singh
    Zhao, Juanjuan
    Ye, Kejiang
    Xu, Chengzhong
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 3995 - 4009