Expanding Alternative Splicing Identification by Integrating Multiple Sources of Transcription Data in Tomato

被引:21
|
作者
Clark, Sarah [1 ]
Yu, Feng [2 ]
Gu, Lianfeng [3 ]
Min, Xiang Jia [1 ]
机构
[1] Youngstown State Univ, Dept Biol Sci, Youngstown, OH 44555 USA
[2] Youngstown State Univ, Dept Comp Sci & Informat Syst, Youngstown, OH 44555 USA
[3] Fujian Agr & Forestry Univ, Coll Forestry, Basic Forestry & Prote Ctr, Fuzhou, Fujian, Peoples R China
来源
FRONTIERS IN PLANT SCIENCE | 2019年 / 10卷
关键词
alternative splicing; gene expression; tomato; mRNA; plant; Solanum lycopersicum; transcriptome; GENOME-WIDE ANALYSIS; LONG NONCODING RNAS; SOLANUM-LYCOPERSICON; PROVIDES INSIGHTS; LANDSCAPE; REVEALS; COMPLEXITY; EVENTS; GENES; PROTEOME;
D O I
10.3389/fpls.2019.00689
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Tomato (Solanum lycopersicum) is an important vegetable and fruit crop. Its genome was completely sequenced and there are also a large amount of available expressed sequence tags (ESTs) and short reads generated by RNA sequencing (RNA-seq) technologies. Mapping transcripts including mRNA sequences, ESTs, and RNA-seq reads to the genome allows identifying pre-mRNA alternative splicing (AS), a post-transcriptional process generating two or more RNA isoforms from one pre-mRNA transcript. We comprehensively analyzed the AS landscape in tomato by integrating genome mapping information of all available mRNA and ESTs with mapping information of RNA-seq reads which were collected from 27 published projects. A total of 369,911 AS events were identified from 34,419 genomic loci involving 161,913 transcripts. Within the basic AS events, intron retention is the prevalent type (18.9%), followed by alternative acceptor site (12.9%) and alternative donor site (7.3%), with exon skipping as the least type (6.0%). Complex AS types having two or more basic event accounted for 54.9% of total AS events. Within 35,768 annotated protein-coding gene models, 23,233 gene models were found having pre-mRNAs generating AS isoform transcripts. Thus the estimated AS rate was 65.0% in tomato. The list of identified AS genes with their corresponding transcript isoforms serves as a catalog for further detailed examination of gene functions in tomato biology. The post-transcriptional information is also expected to be useful in improving the predicted gene models in tomato. The sequence and annotation information can be accessed at plant alternative splicing database (http://proteomics.ysu.edu/altsplice).
引用
收藏
页数:12
相关论文
共 40 条
  • [1] Identification of alternative splicing events by RNA sequencing in early growth tomato fruits
    Sun, Yuan
    Xiao, Han
    BMC GENOMICS, 2015, 16
  • [2] Identification and genetic analysis of alternative splicing of long non-coding RNAs in tomato initial flowering stage
    Yang, Zhenchao
    Yang, Zhao
    Yang, Chengcheng
    Wang, Zhengyan
    Chen, Danyan
    Xie, Yingge
    Wu, Yongjun
    GENOMICS, 2020, 112 (01) : 897 - 907
  • [3] Identification of alternative splicing events by RNA sequencing in early growth tomato fruits
    Yuan Sun
    Han Xiao
    BMC Genomics, 16
  • [4] Identification of hepatopoietin dimerization, its interacting regions and alternative splicing of its transcription
    Li, Y
    Wei, KH
    Lu, CR
    Li, YX
    Li, M
    Xing, GC
    Wei, HD
    Wang, QM
    Chen, JZ
    Wu, CS
    Chen, HP
    Yang, SC
    He, FC
    EUROPEAN JOURNAL OF BIOCHEMISTRY, 2002, 269 (16): : 3888 - 3893
  • [5] nagnag: Identification and quantification of NAGNAG alternative splicing using RNA-Seq data
    Yan, Xiaoyan
    Sablok, Gaurav
    Feng, Gang
    Ma, Jiaxin
    Zhao, Hongwei
    Sun, Xiaoyong
    FEBS LETTERS, 2015, 589 (15) : 1766 - 1770
  • [6] Identification of cancer subtypes by integrating multiple types of transcriptomics data with deep learning in breast cancer
    Guo, Yang
    Shang, Xuequn
    Li, Zhanhuai
    NEUROCOMPUTING, 2019, 324 : 20 - 30
  • [7] The identification of switch-like alternative splicing exons among multiple samples with RNA-Seq data
    Qin, Zhiyi
    Zhang, Xuegong
    PLOS ONE, 2017, 12 (05):
  • [8] Integrating multiple evidence sources to predict transcription factor binding in the human genome
    Ernst, Jason
    Plasterer, Heather L.
    Simon, Itamar
    Bar-Joseph, Ziv
    GENOME RESEARCH, 2010, 20 (04) : 526 - 536
  • [9] Multiple Alternative Promoters and Alternative Splicing Enable Universal Transcription-Based Logic Computation in Mammalian Cells
    Doshi, Jiten
    Willis, Katie
    Madurga, Angela
    Stelzer, Christoph
    Benenson, Yaakov
    CELL REPORTS, 2020, 33 (09):
  • [10] Integrating Multiple Sources of Genomic Data by Multiplex Network Reconstruction
    Gao, S.
    Zou, W. Y.
    Liu, Y. Y.
    Wang, X. W.
    Zhuang, Y.
    Wei, X. W.
    Alhajj, R.
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 1672 - 1677