Complexity issues in vertex-colored graph pattern matching

被引:24
作者
Dondi, Riccardo [1 ]
Fertin, Guillaume [2 ]
Vialette, Stephane [3 ]
机构
[1] Univ Bergamo, Dipartimento Sci Linguaggi Comunicaz & Studi Cult, I-24129 Bergamo, Italy
[2] Univ Nantes, LINA, UMR CNRS 6241, F-44322 Nantes 3, France
[3] Univ Paris Est Marne La Vallee, LIGM, CNRS UMR 8049, 5 Bd Descartes, F-77454 Marne La Vallee, France
关键词
Graph motifs; Vertex-colored graphs; Algorithmic complexity; Parameterized complexity;
D O I
10.1016/j.jda.2010.09.002
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Searching for motifs in graphs has become a crucial problem in the analysis of biological networks. In the context of metabolic network analysis, Lacroix et al. [V. Lacroix, C.G. Fernandes, M.-F. Sagot, IEEE/ACM Transactions on Computational Biology and Bioinformatics 3 (4) (2006) 360-368] introduced the NP-hard general problem of finding occurrences of motifs in vertex-colored graphs, where a motif M is a multiset of colors and an occurrence of M in a vertex-colored graph G, called the target graph, is a subset of vertices that induces a connected graph and the multiset of colors induced by this subset is exactly the motif. Pursuing the line of research pioneered by Lacroix et al. and aiming at dealing with approximate solutions, we consider in this paper the above-mentioned problem in two of its natural optimization forms, referred hereafter as the Min-CC and the Maximum Motif problems. The Min-CC problem seeks for an occurrence of a motif M in a vertex-colored graph G that induces a minimum number of connected components whereas the Maximum Motif problem is concerned with finding a maximum cardinality submotif M subset of M that occurs as a connected motif in G. We prove the Min-CC problem to be APX-hard even in the extremal case where the motif is a set and the target graph is a path. We complement this result by giving a polynomialtime algorithm in case the motif is built upon a fixed number of colors and the target graph is a path. Also, extending [M. Fellows, G. Fertin, D. Hermelin, S. Vialette, in: Proc. 34th International Colloquium on Automata, Languages and Programming (ICALP), Lecture Notes in Computer Science, vol. 4596, Springer, 2007, pp. 340-351], we prove the Min-CC problem to be fixed-parameter tractable when parameterized by the size of the motif, and we give a faster algorithm in case the target graph is a tree. Furthermore, we prove the MIN-CC problem for trees not to be approximable within ratio c logn for some constant c > 0, where n is the order of the target graph, and to be W[2]-hard when parameterized by the number of connected components in the occurrence of the motif. Finally, we give an exact exponential-time algorithm for the Min-CC problem in case the target graph is a tree. We prove that the Maximum Motif problem is APX-hard even in the case where the target graph is a tree of maximum degree 3, the motif is actually a set and each color occurs at most twice in the tree. Next, we strengthen this result by proving that the problem is not approximable within factor 2(log delta n), for any constant delta < 1, unless NP subset of DTIME(2(poly logn)). We complement these results by presenting two fixed-parameter algorithms for the problem, where the parameter is the size of the solution. Finally, we give exact exponential-time algorithms for this problem. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:82 / 99
页数:18
相关论文
共 33 条
[1]   COLOR-CODING [J].
ALON, N ;
YUSTER, R ;
ZWICK, U .
JOURNAL OF THE ASSOCIATION FOR COMPUTING MACHINERY, 1995, 42 (04) :844-856
[2]  
Betzler N, 2008, LECT NOTES COMPUT SC, V5029, P31, DOI 10.1007/978-3-540-69068-9_6
[3]   Fourier Meets Mobius: Fast Subset Convolution [J].
Bjorklund, Andreas ;
Husfeldt, Thore ;
Kaski, Petteri ;
Koivisto, Mikko .
STOC 07: PROCEEDINGS OF THE 39TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, 2007, :67-74
[4]   Complexity results on restricted instances of a paint shop problem for words [J].
Bonsma, P ;
Epping, T ;
Hochstättler, W .
DISCRETE APPLIED MATHEMATICS, 2006, 154 (09) :1335-1343
[5]  
Bonsma P., 2003, 1681 U TWENT DEP APP
[6]   Torque: topology-free querying of protein interaction networks [J].
Bruckner, Sharon ;
Huffner, Falk ;
Karp, Richard M. ;
Shamir, Ron ;
Sharan, Roded .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W106-W108
[7]  
Bruckner S, 2009, LECT NOTES COMPUT SC, V5541, P74, DOI 10.1007/978-3-642-02008-7_6
[8]  
Cormen TH, 2009, INTRO ALGORITHMS
[9]  
Dondi R., 2007, P 10 IT C THEOR COMP, P27
[10]  
Dondi R, 2009, LECT NOTES COMPUT SC, V5577, P221, DOI 10.1007/978-3-642-02441-2_20