Murphy: Performance Diagnosis of Distributed Cloud Applications

被引:1
|
作者
Harsh, Vipul [1 ,2 ]
Zhou, Wenxuan [2 ]
Ashok, Sachin [1 ]
Mysore, Radhika Niranjan [3 ]
Godfrey, P. Brighten [1 ,2 ]
Banerjee, Sujata [3 ]
机构
[1] Univ Illinois, Chicago, IL 60680 USA
[2] VMware, Palo Alto, CA 94304 USA
[3] VMware Res, Palo Alto, CA USA
来源
PROCEEDINGS OF THE 2023 ACM SIGCOMM 2023 CONFERENCE, SIGCOMM 2023 | 2023年
关键词
performance diagnosis; cyclic dependencies; enterprise networks; microservices;
D O I
10.1145/3603269.3604877
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Modern cloud-based applications have complex inter-dependencies on both distributed application components as well as network infrastructure, making it difficult to reason about their performance. As a result, a rich body of work seeks to automate performance diagnosis of enterprise networks and such cloud applications. However, existing methods either ignore inter-dependencies which results in poor accuracy, or require causal acyclic dependencies which cannot model common enterprise environments. We describe the design and implementation of Murphy, an automated performance diagnosis system, that can work with commonly available telemetry in practical enterprise environments, while achieving high accuracy. Murphy utilizes loosely-defined associations between entities obtained from commonly available monitoring data. Its learning algorithm is based on a Markov Random Field (MRF) that can take advantage of such loose associations to reason about how entities affect each other in the context of a specific incident. We evaluate Murphy in an emulated microservice environment and in real incidents from a large enterprise. Compared to past work, Murphy is able to reduce diagnosis error by approximate to 1.35x in restrictive environments supported by past work, and by >= 4.7x in more general environments.
引用
收藏
页码:438 / 451
页数:14
相关论文
共 50 条
  • [41] ECO: Edge-Cloud Optimization of 5G applications
    Rao, Kunal
    Coviello, Giuseppe
    Hsiung, Wang-Pin
    Chakradhar, Srimat
    21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 649 - 659
  • [42] Evaluating the Monolithic and the Microservice Architecture Pattern to Deploy Web Applications in the Cloud
    Villamizar, Mario
    Garces, Oscar
    Castro, Harold
    Verano, Mauricio
    Salamanca, Lorena
    Casallas, Rubby
    Gil, Santiago
    2015 10TH COMPUTING COLOMBIAN CONFERENCE (10CCC), 2015, : 583 - 590
  • [43] Performance Diagnosis and Optimization for Hyperledger Fabric
    Zhang, Shenbin
    Hua, Song
    Pi, Bingfeng
    Sun, Jun
    Yamashita, Kazuhiro
    Nomura, Yoshihide
    2020 2ND CONFERENCE ON BLOCKCHAIN RESEARCH & APPLICATIONS FOR INNOVATIVE NETWORKS AND SERVICES (BRAINS), 2020, : 210 - 211
  • [44] Predictive Container Auto-Scaling for Cloud-Native Applications
    Zhao, Hanqing
    Lim, Hyunwoo
    Hanif, Muhammad
    Lee, Choonhwa
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 1280 - 1282
  • [45] A Survey on Graph Neural Networks for Microservice-Based Cloud Applications
    Nguyen, Hoa Xuan
    Zhu, Shaoshu
    Liu, Mingming
    SENSORS, 2022, 22 (23)
  • [46] An avatar cloud service based method for supervising and interacting with containerized applications
    Barron-Lugo, J. Armando
    Lopez-Arevalo, Ivan
    Gonzalez-Compean, Jose L.
    Morin-Garcia, Jose C.
    Crespo-Sanchez, Melesio
    Carretero, Jesus
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 269
  • [47] A performance modeling framework for microservices-based cloud infrastructures
    Thiago Felipe da Silva Pinheiro
    Paulo Pereira
    Bruno Silva
    Paulo Maciel
    The Journal of Supercomputing, 2023, 79 : 7762 - 7803
  • [48] Container Mapping and its Impact on Performance in Containerized Cloud Environments
    Ambrosino, Gaia
    Fioccola, Giovanni B.
    Canonico, Roberto
    Ventre, Giorgio
    2020 14TH IEEE INTERNATIONAL CONFERENCE ON SERVICE-ORIENTED SYSTEM ENGINEERING (SOSE 2020), 2020, : 57 - 64
  • [49] A performance modeling framework for microservices-based cloud infrastructures
    Pinheiro, Thiago Felipe da silva
    Pereira, Paulo
    Silva, Bruno
    Maciel, Paulo
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (07) : 7762 - 7803
  • [50] Toward the Observability of Cloud-Native Applications: The Overview of the State-of-the-Art
    Kosinska, Joanna
    Balis, Bartosz
    Konieczny, Marek
    Malawski, Maciej
    Zielinski, Slawomir
    IEEE ACCESS, 2023, 11 : 73036 - 73052