Expanding Alternative Splicing Identification by Integrating Multiple Sources of Transcription Data in Tomato

被引:22
作者
Clark, Sarah [1 ]
Yu, Feng [2 ]
Gu, Lianfeng [3 ]
Min, Xiang Jia [1 ]
机构
[1] Youngstown State Univ, Dept Biol Sci, Youngstown, OH 44555 USA
[2] Youngstown State Univ, Dept Comp Sci & Informat Syst, Youngstown, OH 44555 USA
[3] Fujian Agr & Forestry Univ, Coll Forestry, Basic Forestry & Prote Ctr, Fuzhou, Fujian, Peoples R China
关键词
alternative splicing; gene expression; tomato; mRNA; plant; Solanum lycopersicum; transcriptome; GENOME-WIDE ANALYSIS; LONG NONCODING RNAS; SOLANUM-LYCOPERSICON; PROVIDES INSIGHTS; LANDSCAPE; REVEALS; COMPLEXITY; EVENTS; GENES; PROTEOME;
D O I
10.3389/fpls.2019.00689
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Tomato (Solanum lycopersicum) is an important vegetable and fruit crop. Its genome was completely sequenced and there are also a large amount of available expressed sequence tags (ESTs) and short reads generated by RNA sequencing (RNA-seq) technologies. Mapping transcripts including mRNA sequences, ESTs, and RNA-seq reads to the genome allows identifying pre-mRNA alternative splicing (AS), a post-transcriptional process generating two or more RNA isoforms from one pre-mRNA transcript. We comprehensively analyzed the AS landscape in tomato by integrating genome mapping information of all available mRNA and ESTs with mapping information of RNA-seq reads which were collected from 27 published projects. A total of 369,911 AS events were identified from 34,419 genomic loci involving 161,913 transcripts. Within the basic AS events, intron retention is the prevalent type (18.9%), followed by alternative acceptor site (12.9%) and alternative donor site (7.3%), with exon skipping as the least type (6.0%). Complex AS types having two or more basic event accounted for 54.9% of total AS events. Within 35,768 annotated protein-coding gene models, 23,233 gene models were found having pre-mRNAs generating AS isoform transcripts. Thus the estimated AS rate was 65.0% in tomato. The list of identified AS genes with their corresponding transcript isoforms serves as a catalog for further detailed examination of gene functions in tomato biology. The post-transcriptional information is also expected to be useful in improving the predicted gene models in tomato. The sequence and annotation information can be accessed at plant alternative splicing database (http://proteomics.ysu.edu/altsplice).
引用
收藏
页数:12
相关论文
共 40 条
[31]   A Novel Computational Framework to Predict Disease-Related Copy Number Variations by Integrating Multiple Data Sources [J].
Yuan, Lin ;
Sun, Tao ;
Zhao, Jing ;
Shen, Zhen .
FRONTIERS IN GENETICS, 2021, 12
[32]   Identification and differential expression of human collagenase-3 mRNA species derived from internal deletion, alternative splicing, and different polyadenylation and transcription initiation sites [J].
Tardif, G ;
Dupuis, M ;
Reboul, P ;
Geng, CS ;
Pelletier, JP ;
Ranger, P ;
Martel-Pelletier, J .
OSTEOARTHRITIS AND CARTILAGE, 2003, 11 (07) :524-537
[33]   Discovering Perturbation of Modular Structure in HIV Progression by Integrating Multiple Data Sources Through Non-Negative Matrix Factorization [J].
Ray, Sumanta ;
Maulik, Ujjwal .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (03) :869-877
[34]   Transcription and alternative splicing in the yir multigene family of the malaria parasite Plasmodium y. yoelii:: Identification of motifs suggesting epigenetic and post-transcriptional control of RNA expression [J].
Fonager, Jannik ;
Cunningham, Deirdre ;
Jarra, William ;
Koernig, Sandra ;
Henneman, Alex A. ;
Langhorne, Jean ;
Preiser, Peter .
MOLECULAR AND BIOCHEMICAL PARASITOLOGY, 2007, 156 (01) :1-11
[35]   Identification of Alternative Splicing-Related Genes CYB561 and FOLH1 in the Tumor-Immune Microenvironment for Endometrial Cancer Based on TCGA Data Analysis [J].
Sun, Dan ;
Zhang, Aiqian ;
Gao, Bingsi ;
Zou, Lingxiao ;
Huang, Huan ;
Zhao, Xingping ;
Xu, Dabao .
FRONTIERS IN GENETICS, 2022, 13
[36]   Identification of important long non-coding RNAs and highly recurrent aberrant alternative splicing events in hepatocellular carcinoma through integrative analysis of multiple RNA-Seq datasets [J].
Zhang, Lu ;
Liu, Xiaoqiao ;
Zhang, Xuegong ;
Chen, Ronghua .
MOLECULAR GENETICS AND GENOMICS, 2016, 291 (03) :1035-1051
[37]   Identification of important long non-coding RNAs and highly recurrent aberrant alternative splicing events in hepatocellular carcinoma through integrative analysis of multiple RNA-Seq datasets [J].
Lu Zhang ;
Xiaoqiao Liu ;
Xuegong Zhang ;
Ronghua Chen .
Molecular Genetics and Genomics, 2016, 291 :1035-1051
[38]   Identification of CaPs locus involving in purple stripe formation on unripe fruit, reveals allelic variation and alternative splicing of R2R3-MYB transcription factor in pepper (Capsicum annuum L.) [J].
Li, Ning ;
Liu, Yabo ;
Yin, Yanxu ;
Gao, Shenghua ;
Wu, Fangyuan ;
Yu, Chuying ;
Wang, Fei ;
Kang, Byoung-Cheorl ;
Xu, Kai ;
Jiao, Chunhai ;
Yao, Minghua .
FRONTIERS IN PLANT SCIENCE, 2023, 14
[39]   Genome-wide data ( ChIP-seq) enabled identification of cell wall-related and aquaporin genes as targets of tomato ASR1, a drought stress-responsive transcription factor [J].
Ricardi, Martiniano M. ;
Gonazlez, Rodrigo M. ;
Zhong, Silin ;
Dominguez, Pia G. ;
Duffy, Tomas ;
Turjanski, Pablo G. ;
Salter, Juan D. Salgado ;
Alleva, Karina ;
Carrari, Fernando ;
Giovannoni, James J. ;
Estevez, Jose M. ;
Iusem, Norberto D. .
BMC PLANT BIOLOGY, 2014, 14
[40]   Genome-wide data (ChIP-seq) enabled identification of cell wall-related and aquaporin genes as targets of tomato ASR1, a drought stress-responsive transcription factor [J].
Martiniano M Ricardi ;
Rodrigo M González ;
Silin Zhong ;
Pía G Domínguez ;
Tomas Duffy ;
Pablo G Turjanski ;
Juan D Salgado Salter ;
Karina Alleva ;
Fernando Carrari ;
James J Giovannoni ;
José M Estévez ;
Norberto D Iusem .
BMC Plant Biology, 14