An improved statistical method for detecting heterotachy in nucleotide sequences

被引:20
作者
Baele, Guy [1 ]
Raes, Jeroen
Van de Peer, Yves
Vansteelandt, Stijn
机构
[1] Univ Ghent, Dept Appl Math & Comp Sci, Ghent, Belgium
[2] Univ Ghent, Dept Appl Math & Comp Sci, Ghent, Belgium
[3] Univ Ghent, Dept Plant Syst Biol, Ghent, Belgium
关键词
heterotachy; covation; false discovery rate; bootstrap support; ribosomal RNA; eukaryotes;
D O I
10.1093/molbev/msl006
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The principle of heterotachy states that the substitution rate of sites in a gene can change through time. In this article, we propose a powerful statistical test to detect sites that evolve according to the process of heterotachy. We apply this test to an alignment of 1289 eukaryotic rRNA molecules to 1) determine how widespread the phenomenon of heterotachy is in ribosomal RNA, 2) to test whether these heterotachous sites are nonrandomly distributed, that is, linked to secondary structure features of ribosomal RNA, and 3) to determine the impact of heterotachous sites on the bootstrap support of monophyletic groupings. Our study revealed that with 21 monophyletic taxa, approximately two-thirds of the sites in the considered set of sequences is heterotachous. Although the detected heterotachous sites do not appear bound to specific structural features of the small subunit rRNA, their presence is shown to have a large beneficial influence on the bootstrap support of monophyletic groups. Using extensive testing, we show that this may not be due to heterotachy itself but merely due to the increased substitution rate at the detected heterotachous sites.
引用
收藏
页码:1397 / 1405
页数:9
相关论文
共 45 条
[41]   KINN: An alignment-free accurate phylogeny reconstruction method based on inner distance distributions of k-mer pairs in biological sequences [J].
Tang, Runbin ;
Yu, Zuguo ;
Li, Jinyan .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2023, 179
[42]   Statistical Screening Method for Genetic Factors Influencing Susceptibility to Common Diseases in a Two-Stage Genome-Wide Association Study [J].
Sato, Yasunori ;
Laird, Nan ;
Suganami, Hideki ;
Hamada, Chikuma ;
Niki, Naoto ;
Yoshimura, Isao ;
Yoshida, Teruhiko .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2009, 8 (01)
[43]   Statistical significance in omic data analyses - Alternative/complementary method for efficient automatic identification of statistically significant tests in high throughput biological studies [J].
Nardini, Christine ;
Benini, Luca ;
Kuo, Michael D. .
BIOSIGNALS 2008: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON BIO-INSPIRED SYSTEMS AND SIGNAL PROCESSING, VOL 1, 2008, :56-+
[44]   1D and 2D annotation enrichment: a statistical method integrating quantitative proteomics with complementary high-throughput data [J].
Juergen Cox ;
Matthias Mann .
BMC Bioinformatics, 13
[45]   INTERMOLECULAR HYBRIDIZATION OF 5S RIBOSOMAL-RNA WITH 18S RIBOSOMAL-RNA - IDENTIFICATION OF A 5'-TERMINALLY-LOCATED NUCLEOTIDE-SEQUENCE IN MOUSE 5S RIBOSOMAL-RNA WHICH BASE-PAIRS WITH 2 SPECIFIC COMPLEMENTARY SEQUENCES IN 18S RIBOSOMAL-RNA [J].
SARGE, KD ;
MAXWELL, ES .
BIOCHIMICA ET BIOPHYSICA ACTA, 1991, 1088 (01) :57-70