Expanding Alternative Splicing Identification by Integrating Multiple Sources of Transcription Data in Tomato

被引:21
作者
Clark, Sarah [1 ]
Yu, Feng [2 ]
Gu, Lianfeng [3 ]
Min, Xiang Jia [1 ]
机构
[1] Youngstown State Univ, Dept Biol Sci, Youngstown, OH 44555 USA
[2] Youngstown State Univ, Dept Comp Sci & Informat Syst, Youngstown, OH 44555 USA
[3] Fujian Agr & Forestry Univ, Coll Forestry, Basic Forestry & Prote Ctr, Fuzhou, Fujian, Peoples R China
来源
FRONTIERS IN PLANT SCIENCE | 2019年 / 10卷
关键词
alternative splicing; gene expression; tomato; mRNA; plant; Solanum lycopersicum; transcriptome; GENOME-WIDE ANALYSIS; LONG NONCODING RNAS; SOLANUM-LYCOPERSICON; PROVIDES INSIGHTS; LANDSCAPE; REVEALS; COMPLEXITY; EVENTS; GENES; PROTEOME;
D O I
10.3389/fpls.2019.00689
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Tomato (Solanum lycopersicum) is an important vegetable and fruit crop. Its genome was completely sequenced and there are also a large amount of available expressed sequence tags (ESTs) and short reads generated by RNA sequencing (RNA-seq) technologies. Mapping transcripts including mRNA sequences, ESTs, and RNA-seq reads to the genome allows identifying pre-mRNA alternative splicing (AS), a post-transcriptional process generating two or more RNA isoforms from one pre-mRNA transcript. We comprehensively analyzed the AS landscape in tomato by integrating genome mapping information of all available mRNA and ESTs with mapping information of RNA-seq reads which were collected from 27 published projects. A total of 369,911 AS events were identified from 34,419 genomic loci involving 161,913 transcripts. Within the basic AS events, intron retention is the prevalent type (18.9%), followed by alternative acceptor site (12.9%) and alternative donor site (7.3%), with exon skipping as the least type (6.0%). Complex AS types having two or more basic event accounted for 54.9% of total AS events. Within 35,768 annotated protein-coding gene models, 23,233 gene models were found having pre-mRNAs generating AS isoform transcripts. Thus the estimated AS rate was 65.0% in tomato. The list of identified AS genes with their corresponding transcript isoforms serves as a catalog for further detailed examination of gene functions in tomato biology. The post-transcriptional information is also expected to be useful in improving the predicted gene models in tomato. The sequence and annotation information can be accessed at plant alternative splicing database (http://proteomics.ysu.edu/altsplice).
引用
收藏
页数:12
相关论文
共 40 条
  • [21] SplAdder: identification, quantification and testing of alternative splicing events from RNA-Seq data
    Kahles, Andre
    Ong, Cheng Soon
    Zhong, Yi
    Ratsch, Gunnar
    BIOINFORMATICS, 2016, 32 (12) : 1840 - 1847
  • [22] Protein Complex Identification by Integrating Protein-Protein Interaction Evidence from Multiple Sources
    Xu, Bo
    Lin, Hongfei
    Chen, Yang
    Yang, Zhihao
    Liu, Hongfang
    PLOS ONE, 2013, 8 (12):
  • [23] MULTIPLE MESSENGER-RNA ISOFORMS OF THE TRANSCRIPTION ACTIVATOR PROTEIN CREB - GENERATION BY ALTERNATIVE SPLICING AND SPECIFIC EXPRESSION IN PRIMARY SPERMATOCYTES
    RUPPERT, S
    COLE, TJ
    BOSHART, M
    SCHMID, E
    SCHUTZ, G
    EMBO JOURNAL, 1992, 11 (04) : 1503 - 1512
  • [24] Improved Annotation of the Peach (Prunus persica) Genome and Identification of Tissue- or Development Stage-Specific Alternative Splicing through the Integration of Iso-Seq and RNA-Seq Data
    Zhou, Hui
    Sheng, Yu
    Qiu, Keli
    Ren, Fei
    Shi, Pei
    Xie, Qingmei
    Guo, Jiying
    Pan, Haifa
    Zhang, Jinyun
    HORTICULTURAE, 2023, 9 (02)
  • [25] Discovery of microRNAs and Transcription Factors Co-Regulatory Modules by Integrating Multiple Types of Genomic Data
    Luo, Jiawei
    Xiang, Gen
    Pan, Chu
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2017, 16 (01) : 51 - 59
  • [26] Identification of survival-associated alternative splicing events and signatures in adrenocortical carcinoma based on TCGA SpliceSeq data
    Xu, Ning
    Ke, Zhi-Bin
    Lin, Xiao-Dan
    Lin, Fei
    Chen, Shao-Hao
    Wu, Yu-Peng
    Chen, Ye-Hui
    Wei, Yong
    Zheng, Qing-Shui
    AGING-US, 2020, 12 (06): : 4996 - 5009
  • [27] Identification of Functional Modules by Integration of Multiple Data Sources Using a Bayesian Network Classifier
    Wang, Jinlian
    Zuo, Yiming
    Liu, Lun
    Man, Yangao
    Tadesse, Mahlet G.
    Ressom, Habtom W.
    CIRCULATION-CARDIOVASCULAR GENETICS, 2014, 7 (02) : 206 - 217
  • [28] Identification of immune cell function in breast cancer by integrating multiple single-cell data
    Zhang, Liyuan
    Qin, Qiyuan
    Xu, Chen
    Zhang, Ningyi
    Zhao, Tianyi
    FRONTIERS IN IMMUNOLOGY, 2022, 13
  • [29] Identification of Alternative Splicing Events Associated with Paratuberculosis in Dairy Cattle Using Multi-Tissue RNA Sequencing Data
    Li, Houcheng
    Huang, Jinfeng
    Zhang, Junnan
    Gao, Yahui
    Han, Bo
    Sun, Dongxiao
    GENES, 2022, 13 (03)
  • [30] Identification of potentially new bifunctional RNA based on genome-wide data-mining of alternative splicing events
    Ulveling, Damien
    Francastel, Claire
    Hube, Florent
    BIOCHIMIE, 2011, 93 (11) : 2024 - 2027