Computational Analysis and Experimental Validation of Gene Predictions in Toxoplasma gondii

被引:26
作者
Dybas, Joseph M. [1 ,2 ,3 ]
Madrid-Aliste, Carlos J. [1 ,2 ,3 ]
Che, Fa-Yun [1 ,5 ,8 ]
Nieves, Edward [1 ,5 ,8 ]
Rykunov, Dmitry [1 ,2 ,3 ]
Angeletti, Ruth Hogue [1 ,3 ,5 ,8 ]
Weiss, Louis M. [1 ,4 ,6 ]
Kim, Kami [1 ,6 ,7 ]
Fiser, Andras [1 ,2 ,3 ]
机构
[1] Albert Einstein Coll Med, Biodef Prote Res Ctr, Bronx, NY 10467 USA
[2] Albert Einstein Coll Med, Dept Syst & Comput Biol, Bronx, NY USA
[3] Albert Einstein Coll Med, Dept Biochem, Bronx, NY USA
[4] Albert Einstein Coll Med, Dept Pathol, Bronx, NY USA
[5] Albert Einstein Coll Med, Dept Dev & Mol Biol, Bronx, NY USA
[6] Albert Einstein Coll Med, Dept Med, Bronx, NY USA
[7] Albert Einstein Coll Med, Dept Microbiol & Immunol, Bronx, NY USA
[8] Albert Einstein Coll Med, Lab Macromol Anal & Proteom, Bronx, NY USA
关键词
D O I
10.1371/journal.pone.0003899
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Toxoplasma gondii is an obligate intracellular protozoan that infects 20 to 90% of the population. It can cause both acute and chronic infections, many of which are asymptomatic, and, in immunocompromized hosts, can cause fatal infection due to reactivation from an asymptomatic chronic infection. An essential step towards understanding molecular mechanisms controlling transitions between the various life stages and identifying candidate drug targets is to accurately characterize the T. gondii proteome. Methodology/Principal Findings: We have explored the proteome of T. gondii tachyzoites with high throughput proteomics experiments and by comparison to publicly available cDNA sequence data. Mass spectrometry analysis validated 2,477 gene coding regions with 6,438 possible alternative gene predictions; approximately one third of the T. gondii proteome. The proteomics survey identified 609 proteins that are unique to Toxoplasma as compared to any known species including other Apicomplexan. Computational analysis identified 787 cases of possible gene duplication events and located at least 6,089 gene coding regions. Commonly used gene prediction algorithms produce very disparate sets of protein sequences, with pairwise overlaps ranging from 1.4% to 12%. Through this experimental and computational exercise we benchmarked gene prediction methods and observed false negative rates of 31 to 43%. Conclusions/Significance: This study not only provides the largest proteomics exploration of the T. gondii proteome, but illustrates how high throughput proteomics experiments can elucidate correct gene structures in genomes.
引用
收藏
页数:12
相关论文
共 36 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Outbreak of toxoplasmosis associated with municipal drinking water [J].
Bowie, WR ;
King, AS ;
Werker, DH ;
IsaacRenton, JL ;
Bell, A ;
Eng, SB ;
Marion, SA .
LANCET, 1997, 350 (9072) :173-177
[3]   Proteomic analysis of rhoptry organelles reveals many novel constituents for host-parasite interactions in Toxoplasma gondii [J].
Bradley, PJ ;
Ward, C ;
Cheng, SJ ;
Alexander, DL ;
Coller, S ;
Coombs, GH ;
Dunn, JD ;
Ferguson, DJ ;
Sanderson, SJ ;
Wastling, JM ;
Boothroyd, JC .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2005, 280 (40) :34245-34258
[4]  
Carruthers Vern B., 1999, Parasitology International, V48, P1
[5]   Creating a honey bee consensus gene set [J].
Elsik, Christine G. ;
Mackey, Aaron J. ;
Reese, Justin T. ;
Milshina, Natalia V. ;
Roos, David S. ;
Weinstock, George M. .
GENOME BIOLOGY, 2007, 8 (01)
[6]   Proteomics and glycomics analyses of N-glycosylated structures involved in Toxoplasma gondii-host cell interactions [J].
Fauquenoy, Sylvain ;
Morelle, Willy ;
Hovasse, Agnes ;
Bednarczyk, Audrey ;
Slomianny, Christian ;
Schaeffer, Christine ;
Van Dorsselaer, Alain ;
Tomavo, Stanislas .
MOLECULAR & CELLULAR PROTEOMICS, 2008, 7 (05) :891-910
[7]   Pfam:: clans, web tools and services [J].
Finn, Robert D. ;
Mistry, Jaina ;
Schuster-Bockler, Benjamin ;
Griffiths-Jones, Sam ;
Hollich, Volker ;
Lassmann, Timo ;
Moxon, Simon ;
Marshall, Mhairi ;
Khanna, Ajay ;
Durbin, Richard ;
Eddy, Sean R. ;
Sonnhammer, Erik L. L. ;
Bateman, Alex .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D247-D251
[8]   ToxoDB:: an integrated Toxoplasma gondii database resource [J].
Gajria, Bindu ;
Bahl, Amit ;
Brestelli, John ;
Dommer, Jennifer ;
Fischer, Steve ;
Gao, Xin ;
Heiges, Mark ;
Iodice, John ;
Kissinger, Jessica C. ;
Mackey, Aaron J. ;
Pinney, Deborah F. ;
Roos, David S. ;
Stoeckert, Christian J., Jr. ;
Wang, Haiming ;
Brunk, Brian P. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D553-D556
[9]   Proteogenomic mapping as a complementary method to perform genome annotation [J].
Jaffe, JD ;
Berg, HC ;
Church, GM .
PROTEOMICS, 2004, 4 (01) :59-77
[10]   Toxoplasma gondii infection in the United States:: Seroprevalence and risk factors [J].
Jones, JL ;
Kruszon-Moran, D ;
Wilson, M ;
McQuillan, G ;
Navin, T ;
McAuley, JB .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2001, 154 (04) :357-365