Workflow-Aware Automatic Fault Diagnosis for Microservice-Based Applications With Statistics

被引:33
|
作者
Wang, Tao [1 ]
Zhang, Wenbo [1 ]
Xu, Jiwei [2 ]
Gu, Zeyu [3 ]
机构
[1] Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing 100190, Peoples R China
[2] Univ Coll Dublin, Sch Comp Sci, Dublin D02 PN40 4, Ireland
[3] Xia Mobile Software Co Ltd, Xia Internet Dept, Beijing 100085, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金; 北京市自然科学基金;
关键词
Fault diagnosis; Time factors; Computer architecture; Software systems; Internet; workflow; microservice; execution traces; statistics; ANOMALY DETECTION; ONLINE;
D O I
10.1109/TNSM.2020.3022028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Microservice architectures bring many benefits, e.g., faster delivery, improved scalability, and greater autonomy, so they are widely adopted to develop and operate Internet-based applications. How to effectively diagnose the faults of applications with lots of dynamic microservices has become a key to guarantee applications' performance and reliability. As a microservice performs various behaviors in different workflows of processing requests, existing approaches often cannot accurately locate the root cause of an application with interactive microservices in a dynamic deployment environment. We propose a workflow-aware automatic fault diagnosis approach for microservice-based applications with statistics. We characterize traces across microservices with calling trees, and then learn trace patterns as baselines. For the faults affecting the workflows of processing requests, we estimate the workflows' anomaly degrees, and then locate the microservices causing anomalies by comparing the difference between current traces and learned baselines with tree edit distance. For performance anomalies causing significantly increased response time, we employ principal component analysis to extract suspicious microservices with large fluctuation in response time. Finally, we evaluate our approach on three typical microservice-based applications with a series of experiments. The results show that our approach can accurately locate the microservices causing anomalies.
引用
收藏
页码:2350 / 2363
页数:14
相关论文
共 50 条
  • [41] Microservice-Based Architecture for the Integration of Data Backends and Dashboard Applications in the Energy and Environment Domains
    Sidler, Jannik
    Braun, Eric
    Schmitt, Christian
    Schlachter, Thorsten
    Hagenmeyer, Veit
    ADVANCES AND NEW TRENDS IN ENVIRONMENTAL INFORMATICS: A BOGEYMAN OR SAVIOUR FOR THE UN SUSTAINABILITY GOALS?, 2022, : 37 - 48
  • [42] Performance Diagnosis for Microservice-based Systems via Intra-/Inter-trace Analysis
    Liang, Zheheng
    Wu, Guoquan
    Cui, Lei
    Long, Zhenyue
    PROCEEDINGS OF THE 2023 30TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, APSEC 2023, 2023, : 604 - 606
  • [43] An artificial intelligence strategy for the deployment of future microservice-based applications in 6G networks
    Ssemakula J.B.
    Gorricho J.-L.
    Kibalya G.
    Serrat-Fernandez J.
    Neural Computing and Applications, 2024, 36 (18) : 10971 - 10997
  • [44] Extending the SEMAT Kernel for the practice of designing and implementing Microservice-based applications using Domain Driven Design
    Ray, Parthasarathi
    Pal, Pinakpani
    2020 IEEE 32ND CONFERENCE ON SOFTWARE ENGINEERING EDUCATION AND TRAINING (CSEE&T), 2020, : 305 - 308
  • [45] Energy-Aware Microservice-Based SaaS Deployment in a Cloud Data Center Using Hybrid Particle Swarm Optimization
    Alzahrani, A.
    Tang, M.
    IEEE ACCESS, 2024, 12 : 140884 - 140899
  • [46] Correction: An artificial intelligence strategy for the deployment of future microservice-based applications in 6G networks
    John Bosco Ssemakula
    Juan-Luis Gorricho
    Godfrey Kibalya
    Joan Serrat-Fernandez
    Neural Computing and Applications, 2025, 37 (10) : 7443 - 7443
  • [47] Heterogeneous Data-Driven Failure Diagnosis for Microservice-Based Industrial Clouds Toward Consumer Digital Ecosystems
    Xu, Yueshen
    Qiu, Zhibo
    Gao, Honghao
    Zhao, Xinkui
    Wang, Lu
    Li, Rui
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 2027 - 2037
  • [48] ModelCoder: A Fault Model based Automatic Root Cause Localization Framework for Microservice Systems
    Cai, Yang
    Han, Biao
    Li, Jie
    Zhao, Na
    Su, Jinshu
    2021 IEEE/ACM 29TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2021,
  • [49] PERT-GNN: Latency Prediction for Microservice-based Cloud-Native Applications via Graph Neural Networks
    Tam, Da Sun Handason
    Liu, Yang
    Xu, Huanle
    Xie, Siyue
    Lau, Wing Cheong
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 2155 - 2165
  • [50] SWITCH workbench: A novel approach for the development and deployment of time-critical microservice-based cloud-native applications
    Stefanic, Polona
    Cigale, Matej
    Jones, Andrew C.
    Knight, Louise
    Taylor, Ian
    Istrate, Cristiana
    Suciu, George
    Ulisses, Alexandre
    Stankovski, Vlado
    Taherizadeh, Salman
    Flores Salado, Guadalupe
    Koulouzis, Spiros
    Martin, Paul
    Zhao, Zhiming
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 99 : 197 - 212