Expanding Alternative Splicing Identification by Integrating Multiple Sources of Transcription Data in Tomato

被引:22
作者
Clark, Sarah [1 ]
Yu, Feng [2 ]
Gu, Lianfeng [3 ]
Min, Xiang Jia [1 ]
机构
[1] Youngstown State Univ, Dept Biol Sci, Youngstown, OH 44555 USA
[2] Youngstown State Univ, Dept Comp Sci & Informat Syst, Youngstown, OH 44555 USA
[3] Fujian Agr & Forestry Univ, Coll Forestry, Basic Forestry & Prote Ctr, Fuzhou, Fujian, Peoples R China
关键词
alternative splicing; gene expression; tomato; mRNA; plant; Solanum lycopersicum; transcriptome; GENOME-WIDE ANALYSIS; LONG NONCODING RNAS; SOLANUM-LYCOPERSICON; PROVIDES INSIGHTS; LANDSCAPE; REVEALS; COMPLEXITY; EVENTS; GENES; PROTEOME;
D O I
10.3389/fpls.2019.00689
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Tomato (Solanum lycopersicum) is an important vegetable and fruit crop. Its genome was completely sequenced and there are also a large amount of available expressed sequence tags (ESTs) and short reads generated by RNA sequencing (RNA-seq) technologies. Mapping transcripts including mRNA sequences, ESTs, and RNA-seq reads to the genome allows identifying pre-mRNA alternative splicing (AS), a post-transcriptional process generating two or more RNA isoforms from one pre-mRNA transcript. We comprehensively analyzed the AS landscape in tomato by integrating genome mapping information of all available mRNA and ESTs with mapping information of RNA-seq reads which were collected from 27 published projects. A total of 369,911 AS events were identified from 34,419 genomic loci involving 161,913 transcripts. Within the basic AS events, intron retention is the prevalent type (18.9%), followed by alternative acceptor site (12.9%) and alternative donor site (7.3%), with exon skipping as the least type (6.0%). Complex AS types having two or more basic event accounted for 54.9% of total AS events. Within 35,768 annotated protein-coding gene models, 23,233 gene models were found having pre-mRNAs generating AS isoform transcripts. Thus the estimated AS rate was 65.0% in tomato. The list of identified AS genes with their corresponding transcript isoforms serves as a catalog for further detailed examination of gene functions in tomato biology. The post-transcriptional information is also expected to be useful in improving the predicted gene models in tomato. The sequence and annotation information can be accessed at plant alternative splicing database (http://proteomics.ysu.edu/altsplice).
引用
收藏
页数:12
相关论文
共 40 条
[1]   Identification of alternative splicing events by RNA sequencing in early growth tomato fruits [J].
Sun, Yuan ;
Xiao, Han .
BMC GENOMICS, 2015, 16
[2]   Identification and genetic analysis of alternative splicing of long non-coding RNAs in tomato initial flowering stage [J].
Yang, Zhenchao ;
Yang, Zhao ;
Yang, Chengcheng ;
Wang, Zhengyan ;
Chen, Danyan ;
Xie, Yingge ;
Wu, Yongjun .
GENOMICS, 2020, 112 (01) :897-907
[3]   Identification of alternative splicing events by RNA sequencing in early growth tomato fruits [J].
Yuan Sun ;
Han Xiao .
BMC Genomics, 16
[4]   Identification of hepatopoietin dimerization, its interacting regions and alternative splicing of its transcription [J].
Li, Y ;
Wei, KH ;
Lu, CR ;
Li, YX ;
Li, M ;
Xing, GC ;
Wei, HD ;
Wang, QM ;
Chen, JZ ;
Wu, CS ;
Chen, HP ;
Yang, SC ;
He, FC .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 2002, 269 (16) :3888-3893
[5]   nagnag: Identification and quantification of NAGNAG alternative splicing using RNA-Seq data [J].
Yan, Xiaoyan ;
Sablok, Gaurav ;
Feng, Gang ;
Ma, Jiaxin ;
Zhao, Hongwei ;
Sun, Xiaoyong .
FEBS LETTERS, 2015, 589 (15) :1766-1770
[6]   Identification of cancer subtypes by integrating multiple types of transcriptomics data with deep learning in breast cancer [J].
Guo, Yang ;
Shang, Xuequn ;
Li, Zhanhuai .
NEUROCOMPUTING, 2019, 324 :20-30
[7]   The identification of switch-like alternative splicing exons among multiple samples with RNA-Seq data [J].
Qin, Zhiyi ;
Zhang, Xuegong .
PLOS ONE, 2017, 12 (05)
[8]   Integrating multiple evidence sources to predict transcription factor binding in the human genome [J].
Ernst, Jason ;
Plasterer, Heather L. ;
Simon, Itamar ;
Bar-Joseph, Ziv .
GENOME RESEARCH, 2010, 20 (04) :526-536
[9]   Multiple Alternative Promoters and Alternative Splicing Enable Universal Transcription-Based Logic Computation in Mammalian Cells [J].
Doshi, Jiten ;
Willis, Katie ;
Madurga, Angela ;
Stelzer, Christoph ;
Benenson, Yaakov .
CELL REPORTS, 2020, 33 (09)
[10]   Integrating Multiple Sources of Genomic Data by Multiplex Network Reconstruction [J].
Gao, S. ;
Zou, W. Y. ;
Liu, Y. Y. ;
Wang, X. W. ;
Zhuang, Y. ;
Wei, X. W. ;
Alhajj, R. .
PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, :1672-1677