Graph-based root cause analysis for service-oriented and microservice architectures

被引:69
作者
Brandon, Alvaro [1 ]
Sole, Marc [2 ]
Huelamo, Alberto [2 ]
Solans, David [2 ]
Perez, Maria S. [1 ]
Muntes-Mulero, Victor [2 ]
机构
[1] Univ Politecn Madrid, Madrid, Spain
[2] CA Technol, Barcelona, Spain
基金
欧盟地平线“2020”;
关键词
SOA; Microservices; Root Cause Analysis; Containers; Graphs; DIAGNOSIS;
D O I
10.1016/j.jss.2019.110432
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Service-oriented architectures and microservices define two ways of designing software with the aim of dividing an application into loosely-coupled services that communicate among each other. This translates into rapid development, where each service is developed and deployed by small teams, enabling continuous shipping of new features and fast-evolving applications. However, the underlying complexity of this type of architecture can hinder observability and maintenance by the user. In particular, identifying the root cause of an anomaly detected in the application can be a difficult and time-consuming task, considering the numerous services and connections to be examined. In this work, we present a root cause analysis framework, based on graph representations of these architectures. The graphs can be used to compare any anomalous situation that happens in the system with a library of anomalous graphs that serves as a knowledge base for the user troubleshooting those anomalies. We use the Grid'5000 testbed to deploy three different architectures and inject a set of anomalies. The results show how our graph-based approach is 19.41% more effective than a machine learning method that does not take into account the relationship between elements. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页数:17
相关论文
共 59 条
[1]   Graph based anomaly detection and description: a survey [J].
Akoglu, Leman ;
Tong, Hanghang ;
Koutra, Danai .
DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 29 (03) :626-688
[2]  
Akoglu Leman., 2013, Proceedings of the 2013 SIAM International Conference on Data Mining, P37
[3]  
[Anonymous], MEASUREMENTS
[4]  
[Anonymous], INF SYST
[5]  
[Anonymous], 2015, Using Docker: Developing and Deploying Software with Containers
[6]  
[Anonymous], 2003, ICML
[7]  
[Anonymous], ARXIVCS0309030
[8]  
[Anonymous], ARXIV170906686
[9]  
[Anonymous], P NETD 0 1
[10]  
[Anonymous], 2012, P 9 USENIX C NETW SY