Top-k Similar Graph Matching Using TraM in Biological Networks

被引:14
作者
Amin, Mohammad Shafkat [1 ]
Finley, Russell L., Jr. [2 ]
Jamil, Hasan M. [3 ]
机构
[1] Wayne State Univ, Dept Comp Sci, Sunnyvale, CA 94086 USA
[2] Wayne State Univ, Ctr Mol Med & Genet, Detroit, MI 48201 USA
[3] Univ Idaho, Dept Comp Sci, Moscow, ID 83844 USA
基金
美国国家科学基金会;
关键词
Graphs and networks; knowledge and data engineering tools and techniques; bioinformatics; graph and tree search strategies; biology and genetics; PROTEIN-INTERACTION NETWORKS; INTERACTOME NETWORK; DISEASE; COMPLEXES; TOOL; PRIORITIZATION; PREDICTION; ALGORITHM; PATHWAYS; PATTERNS;
D O I
10.1109/TCBB.2012.90
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Many emerging database applications entail sophisticated graph-based query manipulation, predominantly evident in large-scale scientific applications. To access the information embedded in graphs, efficient graph matching tools and algorithms have become of prime importance. Although the prohibitively expensive time complexity associated with exact subgraph isomorphism techniques has limited its efficacy in the application domain, approximate yet efficient graph matching techniques have received much attention due to their pragmatic applicability. Since public domain databases are noisy and incomplete in nature, inexact graph matching techniques have proven to be more promising in terms of inferring knowledge from numerous structural data repositories. In this paper, we propose a novel technique called TraM for approximate graph matching that off-loads a significant amount of its processing on to the database making the approach viable for large graphs. Moreover, the vector space embedding of the graphs and efficient filtration of the search space enables computation of approximate graph similarity at a throw-away cost. We annotate nodes of the query graphs by means of their global topological properties and compare them with neighborhood biased segments of the data-graph for proper matches. We have conducted experiments on several real data sets, and have demonstrated the effectiveness and efficiency of the proposed method
引用
收藏
页码:1790 / 1804
页数:15
相关论文
共 60 条
[21]  
Hishigaki H, 2001, YEAST, V18, P523, DOI 10.1002/yea.706.abs
[22]   Protein networks in disease [J].
Ideker, Trey ;
Sharan, Roded .
GENOME RESEARCH, 2008, 18 (04) :644-652
[23]  
Jiang HL, 2007, PROC INT CONF DATA, P541
[24]   KEGG for linking genomes to life and the environment [J].
Kanehisa, Minoru ;
Araki, Michihiro ;
Goto, Susumu ;
Hattori, Masahiro ;
Hirakawa, Mika ;
Itoh, Masumi ;
Katayama, Toshiaki ;
Kawashima, Shuichi ;
Okuda, Shujiro ;
Tokimatsu, Toshiaki ;
Yamanishi, Yoshihiro .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D480-D484
[25]   Protein interactions and disease: computational approaches to uncover the etiology of diseases [J].
Kann, Maricel G. .
BRIEFINGS IN BIOINFORMATICS, 2007, 8 (05) :333-346
[26]   Walking the interactome for prioritization of candidate disease genes [J].
Koehler, Sebastian ;
Bauer, Sebastian ;
Horn, Denise ;
Robinson, Peter N. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 82 (04) :949-958
[27]   Detecting conserved interaction patterns in biological networks [J].
Koyuturk, Mehmet ;
Kim, Yohan ;
Subramaniam, Shankar ;
Szpankowski, Wojciech ;
Grama, Ananth .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (07) :1299-1322
[28]   Integrative Network Biology: Graph Prototyping for Co-Expression Cancer Networks [J].
Kugler, Karl G. ;
Mueller, Laurin A. J. ;
Graber, Armin ;
Dehmer, Matthias .
PLOS ONE, 2011, 6 (07)
[29]   A human phenome-interactome network of protein complexes implicated in genetic disorders [J].
Lage, Kasper ;
Karlberg, E. Olof ;
Storling, Zenia M. ;
Olason, Pall I. ;
Pedersen, Anders G. ;
Rigina, Olga ;
Hinsby, Anders M. ;
Tumer, Zeynep ;
Pociot, Flemming ;
Tommerup, Niels ;
Moreau, Yves ;
Brunak, Soren .
NATURE BIOTECHNOLOGY, 2007, 25 (03) :309-316
[30]   A map of the interactome network of the metazoan C-elegans [J].
Li, SM ;
Armstrong, CM ;
Bertin, N ;
Ge, H ;
Milstein, S ;
Boxem, M ;
Vidalain, PO ;
Han, JDJ ;
Chesneau, A ;
Hao, T ;
Goldberg, DS ;
Li, N ;
Martinez, M ;
Rual, JF ;
Lamesch, P ;
Xu, L ;
Tewari, M ;
Wong, SL ;
Zhang, LV ;
Berriz, GF ;
Jacotot, L ;
Vaglio, P ;
Reboul, J ;
Hirozane-Kishikawa, T ;
Li, QR ;
Gabel, HW ;
Elewa, A ;
Baumgartner, B ;
Rose, DJ ;
Yu, HY ;
Bosak, S ;
Sequerra, R ;
Fraser, A ;
Mango, SE ;
Saxton, WM ;
Strome, S ;
van den Heuvel, S ;
Piano, F ;
Vandenhaute, J ;
Sardet, C ;
Gerstein, M ;
Doucette-Stamm, L ;
Gunsalus, KC ;
Harper, JW ;
Cusick, ME ;
Roth, FP ;
Hill, DE ;
Vidal, M .
SCIENCE, 2004, 303 (5657) :540-543