Genome-wide analyses supported by RNA-Seq reveal non-canonical splice sites in plant genomes

被引:31
|
作者
Pucker, Boas [1 ,2 ,3 ]
Brockington, Samuel F. [1 ]
机构
[1] Univ Cambridge, Dept Plant Sci, Evolut & Divers, Cambridge, England
[2] Bielefeld Univ, CeBiTec, Genet & Genom Plants, Bielefeld, Germany
[3] Bielefeld Univ, Fac Biol, Bielefeld, Germany
来源
BMC GENOMICS | 2018年 / 19卷
关键词
Gene structure; Splicing; Annotation; Comparative genomics; Transcriptomics; Gene expression; Natural diversity; Evolution; INTRONS; GENES; SEQUENCES; MECHANISM; PROTEIN; CONSERVATION; EVOLUTION; MONOCOT;
D O I
10.1186/s12864-018-5360-z
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundMost eukaryotic genes comprise exons and introns thus requiring the precise removal of introns from pre-mRNAs to enable protein biosynthesis. U2 and U12 spliceosomes catalyze this step by recognizing motifs on the transcript in order to remove the introns. A process which is dependent on precise definition of exon-intron borders by splice sites, which are consequently highly conserved across species. Only very few combinations of terminal dinucleotides are frequently observed at intron ends, dominated by the canonical GT-AG splice sites on the DNA level.ResultsHere we investigate the occurrence of diverse combinations of dinucleotides at predicted splice sites. Analyzing 121 plant genome sequences based on their annotation revealed strong splice site conservation across species, annotation errors, and true biological divergence from canonical splice sites. The frequency of non-canonical splice sites clearly correlates with their divergence from canonical ones indicating either an accumulation of probably neutral mutations, or evolution towards canonical splice sites. Strong conservation across multiple species and non-random accumulation of substitutions in splice sites indicate a functional relevance of non-canonical splice sites. The average composition of splice sites across all investigated species is 98.7% for GT-AG, 1.2% for GC-AG, 0.06% for AT-AC, and 0.09% for minor non-canonical splice sites. RNA-Seq data sets of 35 species were incorporated to validate non-canonical splice site predictions through gaps in sequencing reads alignments and to demonstrate the expression of affected genes.ConclusionWe conclude that bona fide non-canonical splice sites are present and appear to be functionally relevant in most plant genomes, although at low abundance.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A survey of software for genome-wide discovery of differential splicing in RNA-Seq data
    Hooper, Joan E.
    HUMAN GENOMICS, 2014, 8
  • [22] A survey of software for genome-wide discovery of differential splicing in RNA-Seq data
    Joan E Hooper
    Human Genomics, 8
  • [23] Genome-wide RNA-seq, DNA methylation and small RNA-seq analysis unraveled complex gene regulatory networks in psoriasis pathogenesis
    Laha, Sayantan
    Das, Shantanab
    Banerjee, Urbee
    Ganguly, Torsa
    Senapati, Swapan
    Chatterjee, Gobinda
    Chatterjee, Raghunath
    GENE, 2025, 933
  • [24] Genome-wide analysis of acid tolerance genes of Enterococcus faecalis with RNA-seq and Tn-seq
    Zhanyi Chen
    Chenguang Niu
    Lifan Wei
    Zhengwei Huang
    Shujun Ran
    BMC Genomics, 25
  • [25] Genome-wide analysis of acid tolerance genes of Enterococcus faecalis with RNA-seq and Tn-seq
    Chen, Zhanyi
    Niu, Chenguang
    Wei, Lifan
    Huang, Zhengwei
    Ran, Shujun
    BMC GENOMICS, 2024, 25 (01)
  • [26] Comprehensive analyses of RNA-seq and genome-wide data point to enrichment of neuronal cell type subsets in neuropsychiatric disorders
    Olislagers, M.
    Rademaker, K.
    Adan, R. A. H.
    Lin, B. D.
    Luykx, J. J.
    MOLECULAR PSYCHIATRY, 2022, 27 (02) : 947 - 955
  • [27] Comprehensive analyses of RNA-seq and genome-wide data point to enrichment of neuronal cell type subsets in neuropsychiatric disorders
    M. Olislagers
    K. Rademaker
    R. A. H. Adan
    B. D. Lin
    J. J. Luykx
    Molecular Psychiatry, 2022, 27 : 947 - 955
  • [28] Genome-Wide RNA-Seq Analyses of O-Glycan Glycosyltransferases for Molecular Prognostic Markers in Non-Small Cell Lung Cancer
    Lin, S.
    Chen, Y.
    Yang, T.
    Wu, Y.
    Oswita, G. A.
    Lu, H.
    Yu, C.
    Tsai, H.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2018, 197
  • [29] Identification of Novel Pathogenic Mutations in Non-Canonical RNA Splice Sites in Congenital Heart Disease
    Jang, Min Young
    Tai, Angela C.
    Patel, Parth N.
    Ito, Kaoru
    Gorham, Joshua
    Pereira, Alexandre C.
    McKean, David M.
    Seidman, Christine E.
    Seidman, J. G.
    CIRCULATION RESEARCH, 2019, 125
  • [30] Genome-wide gene expression profiling of tongue squamous cell carcinoma by RNA-seq
    Zhang, Hai Xia
    Liu, Ou Sheng
    Deng, Chao
    He, Yan
    Feng, Ye Qian
    Ma, Jin An
    Hu, Chun Hong
    Tang, Zhan Gui
    CLINICAL ORAL INVESTIGATIONS, 2018, 22 (01) : 209 - 216