Transferring functional annotations of membrane transporters on the basis of sequence similarity and sequence motifs

被引:14
作者
Barghash, Ahmad [1 ]
Helms, Volkhard [1 ]
机构
[1] Univ Saarland, Ctr Bioinformat, D-66041 Saarbrucken, Germany
关键词
Membrane transporter; Functional classification; BLAST; HMMER; MEME; Substrate; TC classification system; Sequence homology; DATABASE; CLASSIFICATION; ARABIDOPSIS; PHOSPHATE; RESOURCE; SYSTEM; PLANTS;
D O I
10.1186/1471-2105-14-343
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Membrane transporters catalyze the transport of small solute molecules across biological barriers such as lipid bilayer membranes. Experimental identification of the transported substrates is very tedious. Once a particular transport mechanism has been identified in one organism, it is thus highly desirable to transfer this information to related transporter sequences in different organisms based on bioinformatics evidence. Results: We present a thorough benchmark at which level of sequence identity membrane transporters from Escherichia coli, Saccharomyces cerevisiae, and Arabidopsis thaliana belong to the same families of the Transporter Classification (TC) system, and at what level these membrane transporters mediate the transport of the same substrate. We found that two membrane transporter sequences from different organisms that are aligned with normalized BLAST expectation value better than E-value 1e-8 are highly likely to belong to the same TC family (F-measure around 90%). Enriched sequence motifs identified by MEME at thresholds below 1e-12 support accurate classification into TC families for about two thirds of the sequences (F-measure 80% and higher). For the comparison of transported substrates, we focused on the four largest substrate classes of amino acids, sugars, metal ions, and phosphate. At similar identity thresholds, the nature of the transported substrates was more divergent (F-measure 40-75% at the same thresholds) than the TC family membership. Conclusions: We suggest an acceptable threshold of 1e-8 for BLAST and HMMER where at least three quarters of the sequences are classified according to the TC system with a reasonably high accuracy. Researchers who wish to apply these thresholds in their studies should multiply these thresholds by the size of the database they search against. Our findings should be useful to those who wish to transfer transporter functional annotations across species.
引用
收藏
页数:11
相关论文
共 36 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   Update on activities at the Universal Protein Resource (UniProt) in 2013 [J].
Apweiler, Rolf ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Alpi, Emanuela ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Casanova, Elisabet Barrera ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chan, Wei Mun ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dimmer, Emily ;
Fazzini, Francesco ;
Gane, Paul ;
Fedotov, Alexander ;
Castro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Jacobsen, Julius ;
Jones, Rachel ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightingale, Andrew ;
Orchard, Sandra ;
Patient, Samuel ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Sawford, Tony ;
Sehra, Harminder ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D43-D47
[3]   Assessing the relationship between conservation of function and conservation of sequence using photosynthetic proteins [J].
Ashkenazi, Shaul ;
Snir, Rotem ;
Ofran, Yanay .
BIOINFORMATICS, 2012, 28 (24) :3203-3210
[4]  
Bailey T L, 1994, Proc Int Conf Intell Syst Mol Biol, V2, P28
[5]   Combining evidence using p-values: application to sequence homology searches [J].
Bailey, TL ;
Gribskov, M .
BIOINFORMATICS, 1998, 14 (01) :48-54
[6]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[7]   The Transporter Classification (TC) system, 2002 [J].
Busch, W ;
Saier, MH .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 2002, 37 (05) :287-337
[8]   Phylogeny as a guide to structure and function of membrane transport proteins (Review) [J].
Chang, AB ;
Lin, R ;
Studley, WK ;
Tran, CV ;
Saier, MH .
MOLECULAR MEMBRANE BIOLOGY, 2004, 21 (03) :171-181
[9]   Assessing Performance of Orthology Detection Strategies Applied to Eukaryotic Genomes [J].
Chen, Feng ;
Mackey, Aaron J. ;
Vermunt, Jeroen K. ;
Roos, David S. .
PLOS ONE, 2007, 2 (04)
[10]   Saccharomyces Genome Database: the genomics resource of budding yeast [J].
Cherry, J. Michael ;
Hong, Eurie L. ;
Amundsen, Craig ;
Balakrishnan, Rama ;
Binkley, Gail ;
Chan, Esther T. ;
Christie, Karen R. ;
Costanzo, Maria C. ;
Dwight, Selina S. ;
Engel, Stacia R. ;
Fisk, Dianna G. ;
Hirschman, Jodi E. ;
Hitz, Benjamin C. ;
Karra, Kalpana ;
Krieger, Cynthia J. ;
Miyasato, Stuart R. ;
Nash, Rob S. ;
Park, Julie ;
Skrzypek, Marek S. ;
Simison, Matt ;
Weng, Shuai ;
Wong, Edith D. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D700-D705