Genome-wide analyses supported by RNA-Seq reveal non-canonical splice sites in plant genomes

被引:31
|
作者
Pucker, Boas [1 ,2 ,3 ]
Brockington, Samuel F. [1 ]
机构
[1] Univ Cambridge, Dept Plant Sci, Evolut & Divers, Cambridge, England
[2] Bielefeld Univ, CeBiTec, Genet & Genom Plants, Bielefeld, Germany
[3] Bielefeld Univ, Fac Biol, Bielefeld, Germany
来源
BMC GENOMICS | 2018年 / 19卷
关键词
Gene structure; Splicing; Annotation; Comparative genomics; Transcriptomics; Gene expression; Natural diversity; Evolution; INTRONS; GENES; SEQUENCES; MECHANISM; PROTEIN; CONSERVATION; EVOLUTION; MONOCOT;
D O I
10.1186/s12864-018-5360-z
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundMost eukaryotic genes comprise exons and introns thus requiring the precise removal of introns from pre-mRNAs to enable protein biosynthesis. U2 and U12 spliceosomes catalyze this step by recognizing motifs on the transcript in order to remove the introns. A process which is dependent on precise definition of exon-intron borders by splice sites, which are consequently highly conserved across species. Only very few combinations of terminal dinucleotides are frequently observed at intron ends, dominated by the canonical GT-AG splice sites on the DNA level.ResultsHere we investigate the occurrence of diverse combinations of dinucleotides at predicted splice sites. Analyzing 121 plant genome sequences based on their annotation revealed strong splice site conservation across species, annotation errors, and true biological divergence from canonical splice sites. The frequency of non-canonical splice sites clearly correlates with their divergence from canonical ones indicating either an accumulation of probably neutral mutations, or evolution towards canonical splice sites. Strong conservation across multiple species and non-random accumulation of substitutions in splice sites indicate a functional relevance of non-canonical splice sites. The average composition of splice sites across all investigated species is 98.7% for GT-AG, 1.2% for GC-AG, 0.06% for AT-AC, and 0.09% for minor non-canonical splice sites. RNA-Seq data sets of 35 species were incorporated to validate non-canonical splice site predictions through gaps in sequencing reads alignments and to demonstrate the expression of affected genes.ConclusionWe conclude that bona fide non-canonical splice sites are present and appear to be functionally relevant in most plant genomes, although at low abundance.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Genome-wide analyses supported by RNA-Seq reveal non-canonical splice sites in plant genomes
    Boas Pucker
    Samuel F. Brockington
    BMC Genomics, 19
  • [2] Novel Bioinformatics Method for Identification of Genome-Wide Non-Canonical Spliced Regions Using RNA-Seq Data
    Bai, Yongsheng
    Hassler, Justin
    Ziyar, Ahdad
    Li, Philip
    Wright, Zachary
    Menon, Rajasree
    Omenn, Gilbert S.
    Cavalcoli, James D.
    Kaufman, Randal J.
    Sartor, Maureen A.
    PLOS ONE, 2014, 9 (07):
  • [3] Analysis of canonical and non-canonical splice sites in mammalian genomes
    Burset, M
    Seledtsov, IA
    Solovyev, VV
    NUCLEIC ACIDS RESEARCH, 2000, 28 (21) : 4364 - 4375
  • [4] Genome-wide association and RNA-seq analyses reveal a potential gene related to linolenic acid in soybean seeds
    Qin, Di
    Xing, Jiehua
    Cheng, Ping
    Yu, Guohui
    PEERJ, 2023, 11
  • [5] Read-Split-Run: an improved bioinformatics pipeline for identification of genome-wide non-canonical spliced regions using RNA-Seq data
    Bai, Yongsheng
    Kinne, Jeff
    Donham, Brandon
    Jiang, Feng
    Ding, Lizhong
    Hassler, Justin R.
    Kaufman, Randal J.
    BMC GENOMICS, 2016, 17
  • [6] Read-Split-Run: an improved bioinformatics pipeline for identification of genome-wide non-canonical spliced regions using RNA-Seq data
    Yongsheng Bai
    Jeff Kinne
    Brandon Donham
    Feng Jiang
    Lizhong Ding
    Justin R. Hassler
    Randal J. Kaufman
    BMC Genomics, 17
  • [7] Animal, Fungi, and Plant Genome Sequences Harbor Different Non-Canonical Splice Sites
    Frey, Katharina
    Pucker, Boas
    CELLS, 2020, 9 (02)
  • [8] Genome-Wide Association and RNA-Seq Analyses Reveal a Potential Candidate Gene Related to Oil Content in Soybean Seeds
    Jia, Hongchang
    Han, Dezhi
    Yan, Xiaofei
    Zhang, Lei
    Liang, Jili
    Lu, Wencheng
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (15)
  • [9] Riboswitch Discovery by Combining RNA-Seq and Genome-Wide Identification of Transcriptional Start Sites
    Rosinski-Chupin, Isabelle
    Soutourina, Olga
    Martin-Verstraete, Isabelle
    RIBOSWITCH DISCOVERY, STRUCTURE AND FUNCTION, 2014, 549 : 3 - 27
  • [10] RNA-Seq and Genome-Wide Association Studies Reveal Potential Genes for Rice Seed Shattering
    Wu, Linxuan
    Yue, Jicheng
    Wang, Jiafeng
    Lu, Wenyu
    Huang, Ming
    Guo, Tao
    Wang, Hui
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (23)