Assessing the relationship between conservation of function and conservation of sequence using photosynthetic proteins

被引:13
作者
Ashkenazi, Shaul [1 ]
Snir, Rotem [1 ]
Ofran, Yanay [1 ]
机构
[1] Bar Ilan Univ, Goodman Fac Life Sci, IL-52900 Ramat Gan, Israel
关键词
FUNCTION PREDICTION; GENOME ANNOTATION; DATABASE; METAGENOMICS; ALIGNMENTS; FAMILIES; DOMAINS; MOTIFS; ERRORS; MEME;
D O I
10.1093/bioinformatics/bts608
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
MOTIVATION: Assessing the false positive rate of function prediction methods is difficult, as it is hard to establish that a protein does not have a certain function. To determine to what extent proteins with similar sequences have a common function, we focused on photosynthesis-related proteins. A protein that comes from a non-photosynthetic organism is, undoubtedly, not involved in photosynthesis. RESULTS: We show that function diverges very rapidly: 70% of the close homologs of photosynthetic proteins come from non-photosynthetic organisms. Therefore, high sequence similarity, in most cases, is not tantamount to similar function. However, we found that many functionally similar proteins often share short sequence elements, which may correspond to a functional site and could reveal functional similarities more accurately than sequence similarity. Conclusions: These results shed light on the way biological function is conserved in evolution and may help improve large-scale analysis of protein function.
引用
收藏
页码:3203 / 3210
页数:8
相关论文
共 42 条
[1]   Protein consensus sequence motifs [J].
Aitken, A .
MOLECULAR BIOTECHNOLOGY, 1999, 12 (03) :241-253
[2]   MEME: discovering and analyzing DNA and protein sequence motifs [J].
Bailey, Timothy L. ;
Williams, Nadya ;
Misleh, Chris ;
Li, Wilfred W. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :W369-W373
[3]   MEME SUITE: tools for motif discovery and searching [J].
Bailey, Timothy L. ;
Boden, Mikael ;
Buske, Fabian A. ;
Frith, Martin ;
Grant, Charles E. ;
Clementi, Luca ;
Ren, Jingyuan ;
Li, Wilfred W. ;
Noble, William S. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W202-W208
[4]   The Universal Protein Resource (UniProt) 2009 [J].
Bairoch, Amos ;
Consortium, UniProt ;
Bougueleret, Lydie ;
Altairac, Severine ;
Amendolia, Valeria ;
Auchincloss, Andrea ;
Argoud-Puy, Ghislaine ;
Axelsen, Kristian ;
Baratin, Delphine ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte ;
Bolleman, Jerven ;
Bollondi, Laurent ;
Boutet, Emmanuel ;
Quintaje, Silvia Braconi ;
Breuza, Lionel ;
Bridge, Alan ;
deCastro, Edouard ;
Ciapina, Luciane ;
Coral, Danielle ;
Coudert, Elisabeth ;
Cusin, Isabelle ;
Delbard, Gwennaelle ;
Dornevil, Dolnide ;
Roggli, Paula Duek ;
Duvaud, Severine ;
Estreicher, Anne ;
Famiglietti, Livia ;
Feuermann, Marc ;
Gehant, Sebastian ;
Farriol-Mathis, Nathalie ;
Ferro, Serenella ;
Gasteiger, Elisabeth ;
Gateau, Alain ;
Gerritsen, Vivienne ;
Gos, Arnaud ;
Gruaz-Gumowski, Nadine ;
Hinz, Ursula ;
Hulo, Chantal ;
Hulo, Nicolas ;
James, Janet ;
Jimenez, Silvia ;
Jungo, Florence ;
Junker, Vivien ;
Kappler, Thomas ;
Keller, Guillaume ;
Lachaize, Corinne ;
Lane-Guermonprez, Lydie ;
Langendijk-Genevaux, Petra ;
Lara, Vicente .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D169-D174
[5]   Evaluation of BioCreAtIvE assessment of task 2 [J].
Blaschke, Christian ;
Leon, Eduardo Andres ;
Krallinger, Martin ;
Valencia, Alfonso .
BMC Bioinformatics, 2005, 6 (SUPPL.1)
[6]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[7]   Predicting functions from protein sequences - where are the bottlenecks? [J].
Bork, P ;
Koonin, EV .
NATURE GENETICS, 1998, 18 (04) :313-318
[8]   Errors in genome annotation [J].
Brenner, SE .
TRENDS IN GENETICS, 1999, 15 (04) :132-133
[9]   Targeted metagenomics and ecology of globally important uncultured eukaryotic phytoplankton [J].
Cuvelier, Marie L. ;
Allen, Andrew E. ;
Monier, Adam ;
McCrow, John P. ;
Messie, Monique ;
Tringe, Susannah G. ;
Woyke, Tanja ;
Welsh, Rory M. ;
Ishoey, Thomas ;
Lee, Jae-Hyeok ;
Binder, Brian J. ;
DuPont, Chris L. ;
Latasa, Mikel ;
Guigand, Cedric ;
Buck, Kurt R. ;
Hilton, Jason ;
Thiagarajan, Mathangi ;
Caler, Elisabet ;
Read, Betsy ;
Lasken, Roger S. ;
Chavez, Francisco P. ;
Worden, Alexandra Z. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (33) :14679-14684
[10]   Intrinsic errors in genome annotation [J].
Devos, D ;
Valencia, A .
TRENDS IN GENETICS, 2001, 17 (08) :429-431