A community-maintained standard library of population genetic models

被引:90
作者
Adrion, Jeffrey R. [1 ,2 ]
Cole, Christopher B. [3 ]
Dukler, Noah [4 ]
Galloway, Jared G. [1 ,2 ]
Gladstein, Ariella L. [5 ]
Gower, Graham [6 ]
Kyriazis, Christopher C. [7 ]
Ragsdale, Aaron P. [8 ]
Tsambos, Georgia [9 ]
Baumdicker, Franz [10 ]
Carlson, Jedidiah [11 ]
Cartwright, Reed A. [12 ,13 ]
Durvasula, Arun [14 ]
Gronau, Ilan [15 ]
Kim, Bernard Y. [16 ]
McKenzie, Patrick [17 ]
Messer, Philipp W. [18 ]
Noskova, Ekaterina [19 ]
Ortega-Del Vecchyo, Diego [20 ]
Racimo, Fernando [6 ]
Struck, Travis J. [21 ]
Gravel, Simon [8 ]
Gutenkunst, Ryan N. [21 ]
Lohmueller, Kirk E. [7 ,14 ]
Ralph, Peter L. [1 ,2 ,22 ]
Schrider, Daniel R. [5 ]
Siepel, Adam [4 ]
Kelleher, Jerome [23 ]
Kern, Andrew D. [1 ,2 ]
机构
[1] Univ Oregon, Dept Biol, Eugene, OR 97403 USA
[2] Univ Oregon, Inst Ecol & Evolut, Eugene, OR 97403 USA
[3] Univ Oxford, Weatherall Inst Mol Med, Oxford, England
[4] Cold Spring Harbor Lab, Simons Ctr Quantitat Biol, POB 100, Cold Spring Harbor, NY 11724 USA
[5] Univ North Carolina, Dept Genet, Chapel Hill, NC 27515 USA
[6] Univ Copenhagen, Lundbeck GeoGenet Ctr, Globe Inst, Copenhagen, Denmark
[7] Univ Calif Los Angeles, Dept Ecol & Evolutionary Biol, Los Angeles, CA USA
[8] McGill Univ, Dept Human Genet, Montreal, PQ, Canada
[9] Univ Melbourne, Sch Math & Stat, Melbourne Integrat Genom, Melbourne, Vic, Australia
[10] Univ Freiburg, Dept Math Stochast, Freiburg, Germany
[11] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[12] Arizona State Univ, Biodesign Inst, Tempe, AZ USA
[13] Arizona State Univ, Sch Life Sci, Tempe, AZ USA
[14] Univ Calif Los Angeles, David Geffen Sch Med, Dept Human Genet, Los Angeles, CA 90095 USA
[15] Herzliya Interdisciplinary Ctr, Efi Arazi Sch Comp Sci, Herzliyya, Israel
[16] Stanford Univ, Dept Biol, Stanford, CA 94305 USA
[17] Columbia Univ, Dept Ecol Evolut & Environm Biol, New York, NY USA
[18] Cornell Univ, Dept Computat Biol, Ithaca, NY USA
[19] ITMO Univ, Comp Technol Lab, St Petersburg, Russia
[20] Univ Nacl Autonoma Mexico, Int Lab Human Genome Res, Juriquilla, Mexico
[21] Univ Arizona, Dept Mol & Cellular Biol, Tucson, AZ 85721 USA
[22] Univ Oregon, Dept Math, Eugene, OR 97403 USA
[23] Univ Oxford, Li Ka Shing Ctr Hlth Informat & Discovery, Big Data Inst, Oxford, England
关键词
GENOME; INFERENCE; HISTORY; SIZE; MUTATIONS; LANDSCAPE; EVOLUTION; ROBUST;
D O I
10.7554/eLife.54967
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The explosion in population genomic data demands ever more complex modes of analysis, and increasingly, these analyses depend on sophisticated simulations. Recent advances in population genetic simulation have made it possible to simulate large and complex models, but specifying such models for a particular simulation engine remains a difficult and error-prone task. Computational genetics researchers currently re-implement simulation models independently, leading to inconsistency and duplication of effort. This situation presents a major barrier to empirical researchers seeking to use simulations for power analyses of upcoming studies or sanity checks on existing genomic data. Population genetics, as a field, also lacks standard benchmarks by which new tools for inference might be measured. Here, we describe a new resource, stdpopsim, that attempts to rectify this situation. Stdpopsim is a community-driven open source project, which provides easy access to a growing catalog of published simulation models from a range of organisms and supports multiple simulation engine backends. This resource is available as a well-documented python library with a simple command-line interface. We share some examples demonstrating how stdpopsim can be used to systematically compare demographic inference methods, and we encourage a broader community of developers to contribute to this growing resource.
引用
收藏
页码:1 / 39
页数:29
相关论文
共 59 条
[31]   Efficient pedigree recording for fast population genetics simulation [J].
Kelleher, Jerome ;
Thornton, Kevin R. ;
Ashander, Jaime ;
Ralph, Peter L. .
PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (11)
[32]   Efficient Coalescent Simulation and Genealogical Analysis for Large Sample Sizes [J].
Kelleher, Jerome ;
Etheridge, Alison M. ;
McVean, Gilean .
PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (05)
[33]  
Kemeny JG, 2012, Denumerable Markov chains, DOI [DOI 10.1007/978-1-4684-9455-6, 10.1007/978-1-4684-9455-6]
[34]   diploS/HIC: An Updated Approach to Classifying Selective Sweeps [J].
Kern, Andrew D. ;
Schrider, Daniel R. .
G3-GENES GENOMES GENETICS, 2018, 8 (06) :1959-1970
[35]   Inference of the Distribution of Selection Coefficients for New Nonsynonymous Mutations Using Large Samples [J].
Kim, Bernard Y. ;
Huber, Christian D. ;
Lohmueller, Kirk E. .
GENETICS, 2017, 206 (01) :345-361
[36]  
Kim Y, 2002, GENETICS, V160, P765
[37]   Snakemake-a scalable bioinformatics workflow engine [J].
Koester, Johannes ;
Rahmann, Sven .
BIOINFORMATICS, 2012, 28 (19) :2520-2522
[38]   Fine-scale recombination rate differences between sexes, populations and individuals [J].
Kong, Augustine ;
Thorleifsson, Gudmar ;
Gudbjartsson, Daniel F. ;
Masson, Gisli ;
Sigurdsson, Asgeir ;
Jonasdottir, Aslaug ;
Walters, G. Bragi ;
Jonasdottir, Adalbjorg ;
Gylfason, Arnaldur ;
Kristinsson, Kari Th. ;
Gudjonsson, Sigurjon A. ;
Frigge, Michael L. ;
Helgason, Agnar ;
Thorsteinsdottir, Unnur ;
Stefansson, Kari .
NATURE, 2010, 467 (7319) :1099-1103
[39]   Genomic Variation in Natural Populations of Drosophila melanogaster [J].
Langley, Charles H. ;
Stevens, Kristian ;
Cardeno, Charis ;
Lee, Yuh Chwen G. ;
Schrider, Daniel R. ;
Pool, John E. ;
Langley, Sasha A. ;
Suarez, Charlyn ;
Corbett-Detig, Russell B. ;
Kolaczkowski, Bryan ;
Fang, Shu ;
Nista, Phillip M. ;
Holloway, Alisha K. ;
Kern, Andrew D. ;
Dewey, Colin N. ;
Song, Yun S. ;
Hahn, Matthew W. ;
Begun, David J. .
GENETICS, 2012, 192 (02) :533-+
[40]   Inferring the demographic history and rate of adaptive substitution in Drosophila [J].
Li, Haipeng ;
Stephan, Wolfgang .
PLOS GENETICS, 2006, 2 (10) :1580-1589