Identifying and Classifying Shared Selective Sweeps from Multilocus Data

被引:12
作者
Harris, Alexandre M. [1 ,2 ]
DeGiorgio, Michael [3 ]
机构
[1] Penn State Univ, Huck Inst Life Sci, Dept Biol, University Pk, PA 16802 USA
[2] Penn State Univ, Huck Inst Life Sci, Mol Cellular & Integrat Biosci, University Pk, PA 16802 USA
[3] Florida Atlantic Univ, Dept Comp & Elect Engn & Comp Sci, 777 Glades Rd,EE 418, Boca Raton, FL 33431 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
expected haplotype homozygosity; multilocus genotype; ancestral sweep; convergent sweep; MOLECULAR POPULATION-GENETICS; RECENT POSITIVE SELECTION; GENOME-WIDE SIGNALS; SOFT SWEEPS; HOMOLOGOUS RECOMBINATION; ETHNIC-DIFFERENCES; NATURAL-SELECTION; MUTATION-RATE; ADAPTATION; SIGNATURES;
D O I
10.1534/genetics.120.303137
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Positive selection causes beneficial alleles to rise to high frequency, resulting in a selective sweep of the diversity surrounding the selected sites. Accordingly, the signature of a selective sweep in an ancestral population may still remain in its descendants. Identifying signatures of selection in the ancestor that are shared among its descendants is important to contextualize the timing of a sweep, but few methods exist for this purpose. We introduce the statistic SS-H12, which can identify genomic regions under shared positive selection across populations and is based on the theory of the expected haplotype homozygosity statistic H12, which detects recent hard and soft sweeps from the presence of high-frequency haplotypes. SS-H12 is distinct from comparable statistics because it requires a minimum of only two populations, and properly identifies and differentiates between independent convergent sweeps and true ancestral sweeps, with high power and robustness to a variety of demographic models. Furthermore, we can apply SS-H12 in conjunction with the ratio of statistics we term H2Tot and H1Tot to further classify identified shared sweeps as hard or soft. Finally, we identified both previously reported and novel shared sweep candidates from human whole-genome sequences. Previously reported candidates include the well-characterized ancestral sweeps at LCT and SLC24A5 in Indo-Europeans, as well as GPHN worldwide. Novel candidates include an ancestral sweep at RGS18 in sub-Saharan Africans involved in regulating the platelet response and implicated in sudden cardiac death, and a convergent sweep at C2CD5 between European and East Asian populations that may explain their different insulin responses.
引用
收藏
页码:143 / 171
页数:29
相关论文
共 32 条
  • [21] Identifying genetic determinants of complex phenotypes from whole genome sequence data
    Long, George S.
    Hussen, Mohammed
    Dench, Jonathan
    Aris-Brosou, Stephane
    BMC GENOMICS, 2019, 20 (1)
  • [22] Identifying temporal and spatial patterns of variation from multimodal data using MEFISTO
    Velten, Britta
    Braunger, Jana M.
    Argelaguet, Ricard
    Arnol, Damien
    Wirbel, Jakob
    Bredikhin, Danila
    Zeller, Georg
    Stegle, Oliver
    NATURE METHODS, 2022, 19 (02) : 179 - +
  • [23] Private Selective Sweeps Identified from Next-Generation Pool-Sequencing Reveal Convergent Pathways under Selection in Two Inbred Schistosoma mansoni Strains
    Clement, Julie A. J.
    Toulza, Eve
    Gautier, Mathieu
    Parrinello, Hugues
    Roquis, David
    Boissier, Jerome
    Rognon, Anne
    Mone, Helene
    Mouahid, Gabriel
    Buard, Jerome
    Mitta, Guillaume
    Grunau, Christoph
    PLOS NEGLECTED TROPICAL DISEASES, 2013, 7 (12):
  • [24] Identifying Potentially Beneficial Genetic Mutations Associated with Monophyletic Selective Sweep and a Proof-of-Concept Study with Viral Genetic Data
    Furuse, Yuki
    MSYSTEMS, 2021, 6 (01)
  • [25] Inference of Gorilla Demographic and Selective History from Whole-Genome Sequence Data
    McManus, Kimberly F.
    Kelley, Joanna L.
    Song, Shiya
    Veeramah, Krishna R.
    Woerner, August E.
    Stevison, Laurie S.
    Ryder, Oliver A.
    Kidd, Jeffrey M.
    Wall, Jeffrey D.
    Bustamante, Carlos D.
    Hammer, Michael F.
    MOLECULAR BIOLOGY AND EVOLUTION, 2015, 32 (03) : 600 - 612
  • [26] Identifying structural variation in haploid microbial genomes from short-read resequencing data using breseq
    Barrick, Jeffrey E.
    Colburn, Geoffrey
    Deatherage, Daniel E.
    Traverse, Charles C.
    Strand, Matthew D.
    Borges, Jordan J.
    Knoester, David B.
    Reba, Aaron
    Meyer, Austin G.
    BMC GENOMICS, 2014, 15
  • [27] A Composite-Likelihood Method for Detecting Incomplete Selective Sweep from Population Genomic Data
    Vy, Ha My T.
    Kim, Yuseob
    GENETICS, 2015, 200 (02) : 633 - +
  • [28] A three-tiered integrative analysis of transcriptional data reveals the shared pathways related to heart failure from different aetiologies
    Jiang, Zhenhong
    Guo, Ninghong
    Hong, Kui
    JOURNAL OF CELLULAR AND MOLECULAR MEDICINE, 2020, 24 (16) : 9085 - 9096
  • [29] Inferring Selective Constraint from Population Genomic Data Suggests Recent Regulatory Turnover in the Human Brain
    Schrider, Daniel R.
    Kern, Andrew D.
    GENOME BIOLOGY AND EVOLUTION, 2015, 7 (12): : 3511 - 3528
  • [30] A spectral clustering-based method for identifying clones from high-throughput B cell repertoire sequencing data
    Nouri, Nima
    Kleinstein, Steven H.
    BIOINFORMATICS, 2018, 34 (13) : 341 - 349