Fast and flexible estimation of effective migration surfaces

被引:38
作者
Marcus, Joseph [1 ]
Ha, Wooseok [2 ]
Barber, Rina Foygel [3 ]
Novembre, John [1 ,4 ]
机构
[1] Univ Chicago, Dept Human Genet, Chicago, IL 60637 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[3] Univ Chicago, Dept Stat, Chicago, IL 60637 USA
[4] Univ Chicago, Dept Ecol & Evolut, 940 E 57Th St, Chicago, IL 60637 USA
来源
ELIFE | 2021年 / 10卷
基金
美国国家科学基金会;
关键词
POPULATION-STRUCTURE; GENE FLOW; HISTORY; MODEL; DIFFERENTIATION; INFERENCE; GEOGRAPHY; SELECTION;
D O I
10.7554/eLife.61927
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Spatial population genetic data often exhibits 'isolation-by-distance,' where genetic similarity tends to decrease as individuals become more geographically distant. The rate at which genetic similarity decays with distance is often spatially heterogeneous due to variable population processes like genetic drift, gene flow, and natural selection. Petkova et al., 2016 developed a statistical method called Estimating Effective Migration Surfaces (EEMS) for visualizing spatially heterogeneous isolation-by-distance on a geographic map. While EEMS is a powerful tool for depicting spatial population structure, it can suffer from slow runtimes. Here, we develop a related method called Fast Estimation of Effective Migration Surfaces (FEEMS). FEEMS uses a Gaussian Markov Random Field model in a penalized likelihood framework that allows for efficient optimization and output of effective migration surfaces. Further, the efficient optimization facilitates the inference of migration parameters per edge in the graph, rather than per node (as in EEMS). With simulations, we show conditions under which FEEMS can accurately recover effective migration surfaces with complex gene-flow histories, including those with anisotropy. We apply FEEMS to population genetic data from North American gray wolves and show it performs favorably in comparison to EEMS, with solutions obtained orders of magnitude faster. Overall, FEEMS expands the ability of users to quickly visualize and interpret spatial structure in their data.
引用
收藏
页数:46
相关论文
共 59 条
  • [1] Estimating recent migration and population-size surfaces
    Al-Asadi, Hussein
    Petkova, Desislava
    Stephens, Matthew
    Novembre, John
    [J]. PLOS GENETICS, 2019, 15 (01):
  • [2] Fast model-based estimation of ancestry in unrelated individuals
    Alexander, David H.
    Novembre, John
    Lange, Kenneth
    [J]. GENOME RESEARCH, 2009, 19 (09) : 1655 - 1664
  • [3] Space is the Place: Effects of Continuous Spatial Structure on Analysis of Population Genetic Data
    Battey, C. J.
    Ralph, Peter L.
    Kern, Andrew D.
    [J]. GENETICS, 2020, 215 (01) : 193 - 214
  • [4] Distributed optimization and statistical learning via the alternating direction method of multipliers
    Boyd S.
    Parikh N.
    Chu E.
    Peleato B.
    Eckstein J.
    [J]. Foundations and Trends in Machine Learning, 2010, 3 (01): : 1 - 122
  • [5] Spatial Population Genetics: It's About Time
    Bradburd, Gideon S.
    Ralph, Peter L.
    [J]. ANNUAL REVIEW OF ECOLOGY, EVOLUTION, AND SYSTEMATICS, VOL 50, 2019, 50 : 427 - 449
  • [6] Inferring Continuous and Discrete Population Genetic Structure Across Space
    Bradburd, Gideon S.
    Coop, Graham M.
    Ralph, Peter L.
    [J]. GENETICS, 2018, 210 (01) : 33 - 52
  • [7] A Spatial Framework for Understanding Population Structure and Admixture
    Bradburd, Gideon S.
    Ralph, Peter L.
    Coop, Graham M.
    [J]. PLOS GENETICS, 2016, 12 (01):
  • [8] Burch JW, 2005, WILDLIFE SOC B, V33, P1225, DOI 10.2193/0091-7648(2005)33[1225:EOWDEF]2.0.CO
  • [9] 2
  • [10] A LIMITED MEMORY ALGORITHM FOR BOUND CONSTRAINED OPTIMIZATION
    BYRD, RH
    LU, PH
    NOCEDAL, J
    ZHU, CY
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1995, 16 (05) : 1190 - 1208