Clustering with partial information

被引:9
作者
Bodlaender, Hans L. [3 ]
Fellows, Michael R. [2 ]
Heggernes, Pinar [1 ]
Mancini, Federico [1 ]
Papadopoulos, Charis [1 ]
Rosamond, Frances [2 ]
机构
[1] Univ Bergen, Dept Informat, N-5020 Bergen, Norway
[2] Univ Newcastle, Off DVC Res, PCRU, Callaghan, NSW 2308, Australia
[3] Univ Utrecht, Dept Informat & Comp Sci, NL-3508 TC Utrecht, Netherlands
关键词
Graphs; Parameterized algorithms; Cluster editing; Kernels; GRAPH; ALGORITHMS;
D O I
10.1016/j.tcs.2009.12.016
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The CORRELATION CLUSTERING problem, also known as the CLUSTER EDITING problem, seeks to edit a given graph by adding and deleting edges to obtain a collection of vertex-disjoint cliques, such that the editing cost is minimized. The EDGE CLIQUE PARTITIONING problem seeks to partition the edges of a given graph into edge-disjoint cliques, such that the number of cliques is minimized. Both problems are known to be NP-hard, and they have been previously studied with respect to approximation and fixed-parameter tractability. In this paper we study these two problems in a more general setting that we term fuzzy graphs, where the input graphs may have missing information, meaning that whether or not there is an edge between some pairs of vertices of the input graph can be undecided. For fuzzy graphs the CORRELATION CLUSTERING and EDGE CLIQUE PARTITIONING problems have previously been studied only with respect to approximation. Here we give parameterized algorithms based on kernelization for both problems. We prove that the CORRELATION CLUSTERING problem is fixed-parameter tractable on fuzzy graphs when parameterized by (k, r), where k is the editing cost and r is the minimum number of vertices required to cover the undecided edges. In particular we show that it has a polynomial-time reduction to a problem kernel on O(k(2) + r) vertices. We provide an analogous result for the EDGE CLIQUE PARTITIONING problem on fuzzy graphs. Using (k, r) as parameters, where k bounds the size of the partition, and r is the minimum number of vertices required to cover the undecided edges, we describe a polynomial-time kernelization to a problem kernel on O(k(4).3(r)) vertices. This implies fixed-parameter tractability for this parameterization. Furthermore we also show that parameterizing only by the number of cliques k, is not enough to obtain fixed-parameter tractability. The problem remains, in fact, NP-hard for each fixed k > 2. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:1202 / 1211
页数:10
相关论文
共 29 条
[1]   Aggregating Inconsistent Information: Ranking and Clustering [J].
Ailon, Nir ;
Charikar, Moses ;
Newman, Alantha .
JOURNAL OF THE ACM, 2008, 55 (05)
[2]   Correlation clustering [J].
Bansal, N ;
Blum, A ;
Chawla, S .
MACHINE LEARNING, 2004, 56 (1-3) :89-113
[3]  
Bansal N, 2002, ANN IEEE SYMP FOUND, P238, DOI 10.1109/SFCS.2002.1181947
[4]   Clustering gene expression patterns [J].
Ben-Dor, A ;
Shamir, R ;
Yakhini, Z .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :281-297
[5]  
BOCKER S, 2009, THEORET COM IN PRESS
[6]  
Bodlaender HL, 2008, LECT NOTES COMPUT SC, V5162, P144
[7]   Fixed-parameter tractability of graph modification problems for hereditary properties [J].
Cai, LZ .
INFORMATION PROCESSING LETTERS, 1996, 58 (04) :171-176
[8]   Clustering with qualitative information [J].
Charikar, M ;
Guruswami, V ;
Wirth, A .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2005, 71 (03) :360-383
[9]   Computing phylogenetic roots with bounded degrees and errors [J].
Chen, ZZ ;
Jiang, T ;
Lin, GH .
SIAM JOURNAL ON COMPUTING, 2003, 32 (04) :864-879
[10]  
DEBRUIJN NG, 1948, INDAG MATH, V10, P421