On estimating evolutionary probabilities of population variants

被引:4
|
作者
Patel, Ravi [1 ,2 ]
Kumar, Sudhir [1 ,2 ,3 ]
机构
[1] Temple Univ, Inst Genom & Evolutionary Med, Philadelphia, PA 19122 USA
[2] Temple Univ, Dept Biol, Philadelphia, PA 19122 USA
[3] King Abdulaziz Univ, Ctr Excellence Genome Med & Res, Jeddah, Saudi Arabia
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Generalized method; Evolutionary probability; Forbidden alleles; Potential adaptation; DIVERGENCE TIMES; CONSERVATION; TIMETREES;
D O I
10.1186/s12862-019-1455-7
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
BackgroundThe evolutionary probability (EP) of an allele in a DNA or protein sequence predicts evolutionarily permissible (ePerm; EP0.05) and forbidden (eForb; EP<0.05) variants. EP of an allele represents an independent evolutionary expectation of observing an allele in a population based solely on the long-term substitution patterns captured in a multiple sequence alignment. In the neutral theory, EP and population frequencies can be compared to identify neutral and non-neutral alleles. This approach has been used to discover candidate adaptive polymorphisms in humans, which are eForbs segregating with high frequencies. The original method to compute EP requires the evolutionary relationships and divergence times of species in the sequence alignment (a timetree), which are not known with certainty for most datasets. This requirement impedes a general use of the original EP formulation. Here, we present an approach in which the phylogeny and times are inferred from the sequence alignment itself prior to the EP calculation. We evaluate if the modified EP approach produces results that are similar to those from the original method.ResultsWe compared EP estimates from the original and the modified approaches by using more than 18,000 protein sequence alignments containing orthologous sequences from 46 vertebrate species. For the original EP calculations, we used species relationships from UCSC and divergence times from TimeTree web resource, and the resulting EP estimates were considered to be the ground truth. We found that the modified approaches produced reasonable EP estimates for HGMD disease missense variant and 1000 Genomes Project missense variant datasets. Our results showed that reliable estimates of EP can be obtained without a priori knowledge of the sequence phylogeny and divergence times. We also found that, in order to obtain robust EP estimates, it is important to assemble a dataset with many sequences, sampling from a diversity of species groups.ConclusionWe conclude that the modified EP approach will be generally applicable for alignments and enable the detection of potentially neutral, deleterious, and adaptive alleles in populations.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] On estimating evolutionary probabilities of population variants
    Ravi Patel
    Sudhir Kumar
    BMC Evolutionary Biology, 19
  • [2] Measurement of population income: Variants of estimating biases
    Cherkashina, Tatyana Yu
    VOPROSY EKONOMIKI, 2020, (01): : 127 - 144
  • [3] ESTIMATING PROBABILITIES OF POPULATION RESPONSE RATES FROM DATA AND JUDGMENTS
    WALLSTEN, TS
    PHARMACOLOGY BIOCHEMISTRY AND BEHAVIOR, 1987, 27 (03) : 600 - 600
  • [4] ESTIMATING PROBABILITIES
    BRELSFOR.WM
    BULLETIN OF THE AMERICAN METEOROLOGICAL SOCIETY, 1967, 48 (03) : 205 - &
  • [5] ESTIMATING PROBABILITIES
    BRELSFORD, WM
    JONES, RH
    MONTHLY WEATHER REVIEW, 1967, 95 (08) : 570 - +
  • [6] Novel approaches to probabilistic neural networks through bagging and evolutionary estimating of prior probabilities
    Georgiou, Vasileios L.
    Alevizos, Philipos D.
    Vrahatis, Michael N.
    NEURAL PROCESSING LETTERS, 2008, 27 (02) : 153 - 162
  • [7] Novel Approaches to Probabilistic Neural Networks Through Bagging and Evolutionary Estimating of Prior Probabilities
    Vasileios L. Georgiou
    Philipos D. Alevizos
    Michael N. Vrahatis
    Neural Processing Letters, 2008, 27 : 153 - 162
  • [8] Estimating subjective probabilities
    Andersen, Steffen
    Fountain, John
    Harrison, Glenn W.
    Rutstroem, E. Elisabet
    JOURNAL OF RISK AND UNCERTAINTY, 2014, 48 (03) : 207 - 229
  • [9] ESTIMATING ORDERED PROBABILITIES
    KATZ, MW
    ANNALS OF MATHEMATICAL STATISTICS, 1963, 34 (03): : 967 - &
  • [10] ESTIMATING LOSS PROBABILITIES
    BROWNING, RL
    CHEMICAL ENGINEERING, 1969, 76 (27) : 135 - &