Frequent subgraph mining in outerplanar graphs

被引:27
作者
Horvath, Tamas [1 ,3 ]
Ramon, Jan [2 ]
Wrobel, Stefan [1 ,3 ]
机构
[1] Univ Bonn, Dept Comp Sci 3, D-5300 Bonn, Germany
[2] Katholieke Univ Leuven, Dept Comp Sci, Leuven, Belgium
[3] Fraunhofer Inst IAIS, Schloss Birlinghoven, Sankt Augustin, Germany
关键词
Graph mining; Frequent pattern mining; Algorithms; Complexity; Applications; ISOMORPHISM;
D O I
10.1007/s10618-009-0162-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years there has been an increased interest in frequent pattern discovery in large databases of graph structured objects. While the frequent connected subgraph mining problem for tree datasets can be solved in incremental polynomial time, it becomes intractable for arbitrary graph databases. Existing approaches have therefore resorted to various heuristic strategies and restrictions of the search space, but have not identified a practically relevant tractable graph class beyond trees. In this paper, we consider the class of outerplanar graphs, a strict generalization of trees, develop a frequent subgraph mining algorithm for outerplanar graphs, and show that it works in incremental polynomial time for the practically relevant subclass of well-behaved outerplanar graphs, i.e., which have only polynomially many simple cycles. We evaluate the algorithm empirically on chemo- and bioinformatics applications.
引用
收藏
页码:472 / 508
页数:37
相关论文
共 41 条
[1]  
Agrawal R., 1996, ADV KNOWLEDGE DISCOV, V12, P307, DOI DOI 10.1007/978-3-319-31750-2.
[2]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theory of NP-Completeness
[3]  
[Anonymous], 1974, P 6 ANN ACM S THEORY
[4]   A partial k-arboretum of graphs with bounded treewidth [J].
Bodlaender, HL .
THEORETICAL COMPUTER SCIENCE, 1998, 209 (1-2) :1-45
[5]   Mining molecular fragments: Finding relevant substructures of molecules [J].
Borgelt, C ;
Berthold, MR .
2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, :51-58
[6]  
CALDERS T, 2008, P 2008 IEEE INT C DA, P73
[7]  
CHARTRAND G, 1967, ANN I H POINCARE B, V3, P433
[8]  
Chi Y, 2005, FUND INFORM, V66, P161
[9]   Canonical forms for labelled trees and their applications in frequent subtree mining [J].
Chi, Y ;
Yang, YR ;
Muntz, RR .
KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 8 (02) :203-234
[10]   Substructure Discovery Using Minimum Description Length and Background Knowledge [J].
Cook, Diane J. ;
Holder, Lawrence B. .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1993, 1 :231-255