Assessing the relationship between conservation of function and conservation of sequence using photosynthetic proteins

被引:12
作者
Ashkenazi, Shaul [1 ]
Snir, Rotem [1 ]
Ofran, Yanay [1 ]
机构
[1] Bar Ilan Univ, Goodman Fac Life Sci, IL-52900 Ramat Gan, Israel
关键词
FUNCTION PREDICTION; GENOME ANNOTATION; DATABASE; METAGENOMICS; ALIGNMENTS; FAMILIES; DOMAINS; MOTIFS; ERRORS; MEME;
D O I
10.1093/bioinformatics/bts608
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
MOTIVATION: Assessing the false positive rate of function prediction methods is difficult, as it is hard to establish that a protein does not have a certain function. To determine to what extent proteins with similar sequences have a common function, we focused on photosynthesis-related proteins. A protein that comes from a non-photosynthetic organism is, undoubtedly, not involved in photosynthesis. RESULTS: We show that function diverges very rapidly: 70% of the close homologs of photosynthetic proteins come from non-photosynthetic organisms. Therefore, high sequence similarity, in most cases, is not tantamount to similar function. However, we found that many functionally similar proteins often share short sequence elements, which may correspond to a functional site and could reveal functional similarities more accurately than sequence similarity. Conclusions: These results shed light on the way biological function is conserved in evolution and may help improve large-scale analysis of protein function.
引用
收藏
页码:3203 / 3210
页数:8
相关论文
共 42 条
  • [1] Protein consensus sequence motifs
    Aitken, A
    [J]. MOLECULAR BIOTECHNOLOGY, 1999, 12 (03) : 241 - 253
  • [2] MEME: discovering and analyzing DNA and protein sequence motifs
    Bailey, Timothy L.
    Williams, Nadya
    Misleh, Chris
    Li, Wilfred W.
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : W369 - W373
  • [3] MEME SUITE: tools for motif discovery and searching
    Bailey, Timothy L.
    Boden, Mikael
    Buske, Fabian A.
    Frith, Martin
    Grant, Charles E.
    Clementi, Luca
    Ren, Jingyuan
    Li, Wilfred W.
    Noble, William S.
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : W202 - W208
  • [4] The Universal Protein Resource (UniProt) 2009
    Bairoch, Amos
    Consortium, UniProt
    Bougueleret, Lydie
    Altairac, Severine
    Amendolia, Valeria
    Auchincloss, Andrea
    Argoud-Puy, Ghislaine
    Axelsen, Kristian
    Baratin, Delphine
    Blatter, Marie-Claude
    Boeckmann, Brigitte
    Bolleman, Jerven
    Bollondi, Laurent
    Boutet, Emmanuel
    Quintaje, Silvia Braconi
    Breuza, Lionel
    Bridge, Alan
    deCastro, Edouard
    Ciapina, Luciane
    Coral, Danielle
    Coudert, Elisabeth
    Cusin, Isabelle
    Delbard, Gwennaelle
    Dornevil, Dolnide
    Roggli, Paula Duek
    Duvaud, Severine
    Estreicher, Anne
    Famiglietti, Livia
    Feuermann, Marc
    Gehant, Sebastian
    Farriol-Mathis, Nathalie
    Ferro, Serenella
    Gasteiger, Elisabeth
    Gateau, Alain
    Gerritsen, Vivienne
    Gos, Arnaud
    Gruaz-Gumowski, Nadine
    Hinz, Ursula
    Hulo, Chantal
    Hulo, Nicolas
    James, Janet
    Jimenez, Silvia
    Jungo, Florence
    Junker, Vivien
    Kappler, Thomas
    Keller, Guillaume
    Lachaize, Corinne
    Lane-Guermonprez, Lydie
    Langendijk-Genevaux, Petra
    Lara, Vicente
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D169 - D174
  • [5] Evaluation of BioCreAtIvE assessment of task 2
    Blaschke, Christian
    Leon, Eduardo Andres
    Krallinger, Martin
    Valencia, Alfonso
    [J]. BMC Bioinformatics, 2005, 6 (SUPPL.1)
  • [6] The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003
    Boeckmann, B
    Bairoch, A
    Apweiler, R
    Blatter, MC
    Estreicher, A
    Gasteiger, E
    Martin, MJ
    Michoud, K
    O'Donovan, C
    Phan, I
    Pilbout, S
    Schneider, M
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 365 - 370
  • [7] Predicting functions from protein sequences - where are the bottlenecks?
    Bork, P
    Koonin, EV
    [J]. NATURE GENETICS, 1998, 18 (04) : 313 - 318
  • [8] Errors in genome annotation
    Brenner, SE
    [J]. TRENDS IN GENETICS, 1999, 15 (04) : 132 - 133
  • [9] Targeted metagenomics and ecology of globally important uncultured eukaryotic phytoplankton
    Cuvelier, Marie L.
    Allen, Andrew E.
    Monier, Adam
    McCrow, John P.
    Messie, Monique
    Tringe, Susannah G.
    Woyke, Tanja
    Welsh, Rory M.
    Ishoey, Thomas
    Lee, Jae-Hyeok
    Binder, Brian J.
    DuPont, Chris L.
    Latasa, Mikel
    Guigand, Cedric
    Buck, Kurt R.
    Hilton, Jason
    Thiagarajan, Mathangi
    Caler, Elisabet
    Read, Betsy
    Lasken, Roger S.
    Chavez, Francisco P.
    Worden, Alexandra Z.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (33) : 14679 - 14684
  • [10] Intrinsic errors in genome annotation
    Devos, D
    Valencia, A
    [J]. TRENDS IN GENETICS, 2001, 17 (08) : 429 - 431