Digital Gene Expression Analysis Based on Integrated De Novo Transcriptome Assembly of Sweet Potato [Ipomoea batatas (L.) Lam.]

被引:153
作者
Tao, Xiang [1 ]
Gu, Ying-Hong [1 ]
Wang, Hai-Yan [1 ]
Zheng, Wen [1 ]
Li, Xiao [1 ]
Zhao, Chuan-Wu [1 ]
Zhang, Yi-Zheng [1 ]
机构
[1] Sichuan Univ, Key Lab Bioresources & Ecoenvironm, Minist Educ,Coll Life Sci, Sichuan Key Lab Mol Biol & Biotechnol,Ctr Funct G, Chengdu 610064, Sichuan, Peoples R China
关键词
RNA-SEQ DATA; SEQUENCING ANALYSIS; GENOME SEQUENCE; CODON USAGE; SHORT READS; GENERATION; PROTEIN; TOOL; DISCOVERY; ALIGNMENT;
D O I
10.1371/journal.pone.0036234
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Sweet potato (Ipomoea batatas L. [Lam.]) ranks among the top six most important food crops in the world. It is widely grown throughout the world with high and stable yield, strong adaptability, rich nutrient content, and multiple uses. However, little is known about the molecular biology of this important non-model organism due to lack of genomic resources. Hence, studies based on high-throughput sequencing technologies are needed to get a comprehensive and integrated genomic resource and better understanding of gene expression patterns in different tissues and at various developmental stages. Methodology/Principal Findings: Illumina paired-end (PE) RNA-Sequencing was performed, and generated 48.7 million of 75 bp PE reads. These reads were de novo assembled into 128,052 transcripts (>= 100 bp), which correspond to 41.1 million base pairs, by using a combined assembly strategy. Transcripts were annotated by Blast2GO and 51,763 transcripts got BLASTX hits, in which 39,677 transcripts have GO terms and 14,117 have ECs that are associated with 147 KEGG pathways. Furthermore, transcriptome differences of seven tissues were analyzed by using Illumina digital gene expression (DGE) tag profiling and numerous differentially and specifically expressed transcripts were identified. Moreover, the expression characteristics of genes involved in viral genomes, starch metabolism and potential stress tolerance and insect resistance were also identified. Conclusions/Significance: The combined de novo transcriptome assembly strategy can be applied to other organisms whose reference genomes are not available. The data provided here represent the most comprehensive and integrated genomic resources for cloning and identifying genes of interest in sweet potato. Characterization of sweet potato transcriptome provides an effective tool for better understanding the molecular mechanisms of cellular processes including development of leaves and storage roots, tissue-specific gene expression, potential biotic and abiotic stress response in sweet potato.
引用
收藏
页数:14
相关论文
共 82 条
[1]   Deep sequencing analysis of RNAs from a grapevine showing Syrah decline symptoms reveals a multiple virus infection that includes a novel virus [J].
Al Rwahnih, M. ;
Daubert, S. ;
Golino, D. ;
Rowhani, A. .
VIROLOGY, 2009, 387 (02) :395-401
[2]   Insights into corn genes derived from large-scale cDNA sequencing [J].
Alexandrov, Nickolai N. ;
Brover, Vyacheslav V. ;
Freidin, Stanislav ;
Troukhan, Maxim E. ;
Tatarinova, Tatiana V. ;
Zhang, Hongyu ;
Swaller, Timothy J. ;
Lu, Yu-Ping ;
Bouck, John ;
Flavell, Richard B. ;
Feldmann, Kenneth A. .
PLANT MOLECULAR BIOLOGY, 2009, 69 (1-2) :179-194
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]   Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[5]  
[Anonymous], FACTS FIG SWEETP
[6]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[7]   3′ tag digital gene expression profiling of human brain and universal reference RNA using Illumina Genome Analyzer [J].
Asmann, Yan W. ;
Klee, Eric W. ;
Thompson, E. Aubrey ;
Perez, Edith A. ;
Middha, Sumit ;
Oberg, Ann L. ;
Therneau, Terry M. ;
Smith, David I. ;
Poland, Gregory A. ;
Wieben, Eric D. ;
Kocher, Jean-Pierre A. .
BMC GENOMICS, 2009, 10 :531
[8]  
Bai XD, 2011, PLOS ONE, V6, DOI [10.1371/journal.pone.0016336, 10.1371/journal.pone.0016368]
[9]   NCBI GEO: archive for functional genomics data sets-10 years on [J].
Barrett, Tanya ;
Troup, Dennis B. ;
Wilhite, Stephen E. ;
Ledoux, Pierre ;
Evangelista, Carlos ;
Kim, Irene F. ;
Tomashevsky, Maxim ;
Marshall, Kimberly A. ;
Phillippy, Katherine H. ;
Sherman, Patti M. ;
Muertter, Rolf N. ;
Holko, Michelle ;
Ayanbule, Oluwabukunmi ;
Yefanov, Andrey ;
Soboleva, Alexandra .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D1005-D1010
[10]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300