Classification and analysis of frequent subgraphs mining algorithms

被引:7
作者
Keyvanpour, Mohammad Reza [1 ]
Azizani, Fereshteh [2 ]
机构
[1] Department of Computer Engineering, Alzahra University, Tehran
[2] Department of Computer Engineering, Islamic Azad University, Qazvin Branch, Qazvin
关键词
Data mining; Frequent subgraph; Graph database; Graph mining;
D O I
10.4304/jsw.7.1.220-227
中图分类号
学科分类号
摘要
In recent years, data mining in graphs or graph mining have attracted much attention due to explosive growth in generating graph databases. The graph database is one type of database that consists of either a single large graph or a number of relatively small graphs. Some applications that produce graph database are as follows: Biological networks, semantic web and behavioral modeling. Among all patterns occurring in graph database, mining frequent subgraphs is of great importance. The frequent subgraph is the one that occurs frequently in the graph database. Frequent subgraphs not only are important themselves but also are applicable in other aspects of data analysis and data mining tasks, such as similarity search in graph database, graph clustering, classification, indexing, etc. So far, numerous algorithms have been proposed for mining frequent subgraphs. This study aims to create overall view of the algorithms through the analysis and comparison of their characterizations. To achieve the aim, the existing algorithms are classified based on their graph database and their subgraph generation way. The proposed classification can be effective in choosing applications appropriate algorithms and determination of graph mining new methods in this regard. © 2012 ACADEMY PUBLISHER.
引用
收藏
页码:220 / 227
页数:7
相关论文
共 41 条
[1]  
Agrawal R., Srikant R., Fast Algorithms For Mining Association Rules, Proceeding of the 20th Very Large Dada Base, pp. 487-499, (1994)
[2]  
Borgelt C., Berthold M.R., Mining Molecular Fragments: Finding Relevant Substructures of Molecules, Proceeding of the International Conference On Data Mining, pp. 211-218, (2002)
[3]  
Chakrabarti D., Faloutsos C., Graph Mining: Laws, Generators, and Algorithms, ACM Computing Surveys, pp. 2-69, (2006)
[4]  
Cook J., Holder L., Substructure Discovery Using Minimum Description Length and Background Knowledge, Journal of Artificial Intelligence Research, pp. 231-255, (1994)
[5]  
Damiani E., Oliboni B., Quintarelli E., Tanca L., Modeling Semistructured Data by Using Graph-Based Constraints, Proceedings of OTM Workshops, pp. 22-23, (2003)
[6]  
Dehaspe L., Toivonen H., Discovery of Frequent Datalog Patterns, Data Mining and Knowledge Discovery, pp. 7-36, (1999)
[7]  
Deshpande M., Kuramochi M., Karypis G., Frequent sub-structure-based approaches for classifying chemical compounds, Proceedings of the International Conference On Data Mining (ICDM'03), pp. 35-42, (2003)
[8]  
Doulamis A.D., Doulamis N.D., Kollias N.D., A Pyramidal Graph Representation for Efficient Image Content Description, IEEE International Workshop On Multimedia Signal Processing (MMSP), pp. 109-114, (1999)
[9]  
Fortin S., The Graph Isomorphism Problem, (1996)
[10]  
Garey M.R., Johnson D.S., Computers and Intractability:A Guide to The Theory of NP-Completeness, (1979)