A binary linear programming formulation of the graph edit distance

被引:120
作者
Justice, Derek [1 ]
Hero, Alfred [1 ]
机构
[1] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
graph algorithms; similarity measures; structural pattern recognition; graphs and networks; linear programming; continuation (homotopy) methods;
D O I
10.1109/TPAMI.2006.152
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A binary linear programming formulation of the graph edit distance for unweighted, undirected graphs with vertex attributes is derived and applied to a graph recognition problem. A general formulation for editing graphs is used to derive a graph edit distance that is proven to be a metric, provided the cost function for individual edit operations is a metric. Then, a binary linear program is developed for computing this graph edit distance, and polynomial time methods for determining upper and lower bounds on the solution of the binary program are derived by applying solution methods for standard linear programming and the assignment problem. A recognition problem of comparing a sample input graph to a database of known prototype graphs in the context of a chemical information system is presented as an application of the new method. The costs associated with various edit operations are chosen by using a minimum normalized variance criterion applied to pairwise distances between nearest neighbors in the database of prototypes. The new metric is shown to perform quite well in comparison to existing metrics when applied to a database of chemical graphs.
引用
收藏
页码:1200 / 1214
页数:15
相关论文
共 52 条
[1]   A LINEAR-PROGRAMMING APPROACH FOR THE WEIGHTED GRAPH MATCHING PROBLEM [J].
ALMOHAMAD, HA ;
DUFFUAA, SO .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (05) :522-525
[2]  
[Anonymous], 2004, LP SOLVE OPEN SOURCE
[3]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
[4]  
[Anonymous], 1989, FUNDAMENTALS PATTERN
[5]  
BERGAMINI P, 2000, P JOINT IAPR INT WOR, P246
[6]  
Boyd S., 2004, CONVEX OPTIMIZATION
[7]   A graph distance metric based on the maximal common subgraph [J].
Bunke, H ;
Shearer, K .
PATTERN RECOGNITION LETTERS, 1998, 19 (3-4) :255-259
[8]  
Bunke H, 2000, INT C PATT RECOG, P117, DOI 10.1109/ICPR.2000.906030
[9]   Error correcting graph matching: On the influence of the underlying cost function [J].
Bunke, H .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (09) :917-922
[10]   MOLECULAR-STRUCTURE COMPARISON PROGRAM FOR IDENTIFICATION OF MAXIMAL COMMON SUBSTRUCTURES [J].
CONE, MM ;
VENKATARAGHAVAN, R ;
MCLAFFERTY, FW .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1977, 99 (23) :7668-7671