SVD-phy: improved prediction of protein functional associations through singular value decomposition of phylogenetic profiles

被引:85
作者
Franceschini, Andrea [1 ,2 ]
Lin, Jianyi [3 ]
von Mering, Christian [1 ,2 ]
Jensen, Lars Juhl [4 ]
机构
[1] Univ Zurich, Inst Mol Life Sci, Winterthurerstr 190, CH-8057 Zurich, Switzerland
[2] Swiss Inst Bioinformat, Batiment Genopode, CH-1015 Lausanne, Switzerland
[3] Univ Milan, Dept Comp Sci, Via Comelico 39, I-20135 Milan, Italy
[4] Univ Copenhagen, Ctr Prot Res, Novo Nordisk Fdn, DK-2200 Copenhagen N, Denmark
关键词
IDENTIFICATION; DISCOVERY; PATHWAYS; SYSTEMS;
D O I
10.1093/bioinformatics/btv696
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A successful approach for predicting functional associations between non-homologous genes is to compare their phylogenetic distributions. We have devised a phylogenetic profiling algorithm, SVD-Phy, which uses truncated singular value decomposition to address the problem of uninformative profiles giving rise to false positive predictions. Benchmarking the algorithm against the KEGG pathway database, we found that it has substantially improved performance over existing phylogenetic profiling methods.
引用
收藏
页码:1085 / 1087
页数:3
相关论文
共 16 条
[1]   SIMAP-the database of all-against-all protein sequence similarities and annotations with new interfaces and increased coverage [J].
Arnold, Roland ;
Goldenberg, Florian ;
Mewes, Hans-Werner ;
Rattei, Thomas .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D279-D284
[2]   ProtPhylo: identification of protein-phenotype and protein-protein functional associations via phylogenetic profiling [J].
Cheng, Yiming ;
Perocchi, Fabiana .
NUCLEIC ACIDS RESEARCH, 2015, 43 (W1) :W160-W168
[3]  
Croft D, 2014, NUCLEIC ACIDS RES, V42, pD472, DOI [10.1093/nar/gkt1102, 10.1093/nar/gkz1031]
[4]   Discovery of uncharacterized cellular systems by genome-wide analysis of functional linkages [J].
Date, SV ;
Marcotte, EM .
NATURE BIOTECHNOLOGY, 2003, 21 (09) :1055-1062
[5]   Genomic context analysis reveals dense interaction network between vertebrate ultraconserved non-coding elements [J].
Dimitrieva, Slavica ;
Bucher, Philipp .
BIOINFORMATICS, 2012, 28 (18) :I395-I401
[6]   Annotation of bacterial genomes using improved phylogenomic profiles [J].
Enault, F. ;
Suhre, K. ;
Abergel, C. ;
Poirot, O. ;
Claverie, J. -M. .
BIOINFORMATICS, 2003, 19 :i105-i107
[7]   Data, information, knowledge and principle: back to metabolism in KEGG [J].
Kanehisa, Minoru ;
Goto, Susumu ;
Sato, Yoko ;
Kawashima, Masayuki ;
Furumichi, Miho ;
Tanabe, Mao .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D199-D205
[8]   EcoCyc: fusing model organism databases with systems biology [J].
Keseler, Ingrid M. ;
Mackie, Amanda ;
Peralta-Gil, Martin ;
Santos-Zavaleta, Alberto ;
Gama-Castro, Socorro ;
Bonavides-Martinez, Cesar ;
Fulcher, Carol ;
Huerta, Araceli M. ;
Kothari, Anamika ;
Krummenacker, Markus ;
Latendresse, Mario ;
Muniz-Rascado, Luis ;
Ong, Quang ;
Paley, Suzanne ;
Schroeder, Imke ;
Shearer, Alexander G. ;
Subhraveti, Pallavi ;
Travers, Mike ;
Weerasinghe, Deepika ;
Weiss, Verena ;
Collado-Vides, Julio ;
Gunsalus, Robert P. ;
Paulsen, Ian ;
Karp, Peter D. .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D605-D612
[9]   Expansion of Biological Pathways Based on Evolutionary Inference [J].
Li, Yang ;
Calvo, Sarah E. ;
Gutman, Roee ;
Liu, Jun S. ;
Mootha, Vamsi K. .
CELL, 2014, 158 (01) :213-225
[10]   NAPP: the Nucleic Acid Phylogenetic Profile Database [J].
Ott, Alban ;
Idali, Anouar ;
Marchais, Antonin ;
Gautheret, Daniel .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D205-D209