Accurate Quantification of Functional Analogy among Close Homologs

被引:25
作者
Chikina, Maria D. [1 ]
Troyanskaya, Olga G. [2 ,3 ]
机构
[1] Princeton Univ, Dept Mol Biol, Princeton, NJ 08544 USA
[2] Princeton Univ, Lewis Sigler Inst Integrat Genom, Princeton, NJ 08544 USA
[3] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
关键词
PROTEIN-INTERACTION NETWORKS; GLOBAL ALIGNMENT; GENE-EXPRESSION; LAMIN-C; SNAP-25; DROSOPHILA; GENOME; IDENTIFICATION; ORGANIZATION; ANNOTATION;
D O I
10.1371/journal.pcbi.1001074
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Correctly evaluating functional similarities among homologous proteins is necessary for accurate transfer of experimental knowledge from one organism to another, and is of particular importance for the development of animal models of human disease. While the fact that sequence similarity implies functional similarity is a fundamental paradigm of molecular biology, sequence comparison does not directly assess the extent to which two proteins participate in the same biological processes, and has limited utility for analyzing families with several parologous members. Nevertheless, we show that it is possible to provide a cross-organism functional similarity measure in an unbiased way through the exclusive use of high-throughput gene-expression data. Our methodology is based on probabilistic cross-species mapping of functionally analogous proteins based on Bayesian integrative analysis of gene expression compendia. We demonstrate that even among closely related genes, our method is able to predict functionally analogous homolog pairs better than relying on sequence comparison alone. We also demonstrate that the landscape of functional similarity is often complex and that definitive "functional orthologs" do not always exist. Even in these cases, our method and the online interface we provide are designed to allow detailed exploration of sources of inferred functional similarity that can be evaluated by the user.
引用
收藏
页数:11
相关论文
共 55 条
[1]   Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms [J].
Alter, O ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (06) :3351-3356
[2]   Systematic identification of functional orthologs based on protein network comparison [J].
Bandyopadhyay, S ;
Sharan, R ;
Ideker, T .
GENOME RESEARCH, 2006, 16 (03) :428-435
[3]  
Barrett T, 2005, NUCLEIC ACIDS RES, V33, pD562
[4]   Cross-species analysis of biological networks by Bayesian alignment [J].
Berg, Johannes ;
Lassig, Michael .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (29) :10967-10972
[5]   Assessing functional annotation transfers with inter-species conserved coexpression: application to Plasmodium falciparum [J].
Brehelin, Laurent ;
Florent, Isabelle ;
Gascuel, Olivier ;
Marechal, Eric .
BMC GENOMICS, 2010, 11
[6]   The GRID: The General Repository for Interaction Datasets [J].
Breitkreutz, BJ ;
Stark, C ;
Tyers, M .
GENOME BIOLOGY, 2003, 4 (03)
[7]   Yeast Two-Hybrid, a Powerful Tool for Systems Biology [J].
Brueckner, Anna ;
Polge, Cecile ;
Lentze, Nicolas ;
Auerbach, Daniel ;
Schlattner, Uwe .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2009, 10 (06) :2763-2788
[8]   Organization of the secretory machinery in the rodent brain: distribution of the t-SNAREs, SNAP-25 and SNAP-23 [J].
Chen, D ;
Minger, SL ;
Honer, WG ;
Whiteheart, SW .
BRAIN RESEARCH, 1999, 831 (1-2) :11-24
[9]   SNAP-23 functions in docking/fusion of granules at low Ca2+ [J].
Chieregatti, E ;
Chicka, MC ;
Chapman, ER ;
Baldini, G .
MOLECULAR BIOLOGY OF THE CELL, 2004, 15 (04) :1918-1930
[10]   Global Prediction of Tissue-Specific Gene Expression and Context-Dependent Gene Networks in Caenorhabditis elegans [J].
Chikina, Maria D. ;
Huttenhower, Curtis ;
Murphy, Coleen T. ;
Troyanskaya, Olga G. .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (06)