SMETANA: Accurate and Scalable Algorithm for Probabilistic Alignment of Large-Scale Biological Networks

被引:71
作者
Sahraeian, Sayed Mohammad Ebrahim [1 ]
Yoon, Byung-Jun [2 ]
机构
[1] Univ Calif, Dept Plant & Microbial Biol, Berkeley, CA USA
[2] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
来源
PLOS ONE | 2013年 / 8卷 / 07期
基金
美国国家科学基金会;
关键词
PROTEIN-INTERACTION NETWORKS; GLOBAL ALIGNMENT; SYSTEMATIC IDENTIFICATION; CONSERVED PATHWAYS; SIMILARITY; DATABASE; YEAST;
D O I
10.1371/journal.pone.0067995
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper we introduce an efficient algorithm for alignment of multiple large-scale biological networks. In this scheme, we first compute a probabilistic similarity measure between nodes that belong to different networks using a semi-Markov random walk model. The estimated probabilities are further enhanced by incorporating the local and the cross-species network similarity information through the use of two different types of probabilistic consistency transformations. The transformed alignment probabilities are used to predict the alignment of multiple networks based on a greedy approach. We demonstrate that the proposed algorithm, called SMETANA, outperforms many state-of-the-art network alignment techniques, in terms of computational efficiency, alignment accuracy, and scalability. Our experiments show that SMETANA can easily align tens of genome-scale networks with thousands of nodes on a personal computer without any difficulty. The source code of SMETANA is available upon request. The source code of SMETANA can be downloaded from http://www.ece.tamu.edu/similar to bjyoon/SMETANA/.
引用
收藏
页数:12
相关论文
共 51 条
  • [31] PINALOG: a novel approach to align protein interaction networks-implications for complex detection and function prediction
    Phan, Hang T. T.
    Sternberg, Michael J. E.
    [J]. BIOINFORMATICS, 2012, 28 (09) : 1239 - 1245
  • [32] Human Protein Reference Database-2009 update
    Prasad, T. S. Keshava
    Goel, Renu
    Kandasamy, Kumaran
    Keerthikumar, Shivakumar
    Kumar, Sameer
    Mathivanan, Suresh
    Telikicherla, Deepthi
    Raju, Rajesh
    Shafreen, Beema
    Venugopal, Abhilash
    Balakrishnan, Lavanya
    Marimuthu, Arivusudar
    Banerjee, Sutopa
    Somanathan, Devi S.
    Sebastian, Aimy
    Rani, Sandhya
    Ray, Somak
    Kishore, C. J. Harrys
    Kanth, Sashi
    Ahmed, Mukhtar
    Kashyap, Manoj K.
    Mohmood, Riaz
    Ramachandra, Y. L.
    Krishna, V.
    Rahiman, B. Abdul
    Mohan, Sujatha
    Ranganathan, Prathibha
    Ramabadran, Subhashri
    Chaerkady, Raghothama
    Pandey, Akhilesh
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D767 - D772
  • [33] Effective Identification of Conserved Pathways in Biological Networks Using Hidden Markov Models
    Qian, Xiaoning
    Yoon, Byung-Jun
    [J]. PLOS ONE, 2009, 4 (12):
  • [34] A Network Synthesis Model for Generating Protein Interaction Network Families
    Sahraeian, Sayed Mohammad Ebrahim
    Yoon, Byung-Jun
    [J]. PLOS ONE, 2012, 7 (08):
  • [35] RESQUE: Network reduction using semi-Markov random walk scores for efficient querying of biological networks
    Sahraeian, Sayed Mohammad Ebrahim
    Yoon, Byung-Jun
    [J]. BIOINFORMATICS, 2012, 28 (16) : 2129 - 2136
  • [36] A Novel Low-Complexity HMM Similarity Measure
    Sahraeian, Sayed Mohammad Ebrahim
    Yoon, Byung-Jun
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (02) : 87 - 90
  • [37] PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences
    Sahraeian, Sayed Mohammad Ebrahim
    Yoon, Byung-Jun
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 (15) : 4917 - 4928
  • [38] The Database of Interacting Proteins: 2004 update
    Salwinski, L
    Miller, CS
    Smith, AJ
    Pettit, FK
    Bowie, JU
    Eisenberg, D
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D449 - D451
  • [39] Conserved patterns of protein interaction in multiple species
    Sharan, R
    Suthram, S
    Kelley, RM
    Kuhn, T
    McCuine, S
    Uetz, P
    Sittler, T
    Karp, RM
    Ideker, T
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (06) : 1974 - 1979
  • [40] Modeling cellular machinery through biological network comparison
    Sharan, R
    Ideker, T
    [J]. NATURE BIOTECHNOLOGY, 2006, 24 (04) : 427 - 433