Graph mining for discovering infrastructure patterns in configuration management databases

被引:12
作者
Anchuri, Pranay [1 ]
Zaki, Mohammed J. [1 ]
Barkol, Omer [2 ]
Bergman, Ruth [2 ]
Felder, Yifat [2 ]
Golan, Shahar [2 ]
Sityon, Arik [2 ]
机构
[1] Rensselaer Polytech Inst, Dept Comp Sci, Troy, NY 12180 USA
[2] Technion Israel Inst Technol, HP Labs, IL-32000 Haifa, Israel
关键词
Single graph mining; Frequent subgraphs; Sparse graph mining; Configuration management databases; FREQUENT PATTERNS; ALGORITHM;
D O I
10.1007/s10115-012-0528-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A configuration management database (CMDB) can be considered to be a large graph representing the IT infrastructure entities and their interrelationships. Mining such graphs is challenging because they are large, complex, and multi-attributed and have many repeated labels. These characteristics pose challenges for graph mining algorithms, due to the increased cost of subgraph isomorphism (for support counting) and graph isomorphism (for eliminating duplicate patterns). The notion of pattern frequency or support is also more challenging in a single graph, since it has to be defined in terms of the number of its (potentially, exponentially many) embeddings. We present CMDB-Miner, a novel two-step method for mining infrastructure patterns from CMDB graphs. It first samples the set of maximal frequent patterns and then clusters them to extract the representative infrastructure patterns. We demonstrate the effectiveness of CMDB-Miner on real-world CMDB graphs, as well as synthetic graphs.
引用
收藏
页码:491 / 522
页数:32
相关论文
共 34 条
[1]  
[Anonymous], 5 INT WORKSH MIN LEA
[2]  
[Anonymous], P 35 INT C VER LARG
[3]  
[Anonymous], IEEE T KNOWL DATA EN
[4]  
[Anonymous], 12 PAC AS C KNOWL DI
[5]  
[Anonymous], ICDM P
[6]  
[Anonymous], P WORKSH DAT MIN BIO
[7]  
[Anonymous], IEEE INT C DAT MIN
[8]  
[Anonymous], 1997, Eigenspaces of graphs
[9]  
[Anonymous], SIAM J MATRIX ANAL A
[10]  
[Anonymous], 2011, 15 EUR C PRINC PRACT