Boosting forward-time population genetic simulators through genotype compression

被引：5

作者：

Ruths, Troy ^{[1
]}

Nakhleh, Luay ^{[1
]}

机构：

[1] Rice Univ, Dept Comp Sci, Houston, TX 77251 USA

来源：

BMC BIOINFORMATICS | 2013年 / 14卷

基金：

美国国家科学基金会;

关键词：

NETWORKS; BIOLOGY; MODELS;

D O I：

10.1186/1471-2105-14-192

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Background: Forward-time population genetic simulations play a central role in deriving and testing evolutionary hypotheses. Such simulations may be data-intensive, depending on the settings to the various parameters controlling them. In particular, for certain settings, the data footprint may quickly exceed the memory of a single compute node. Results: We develop a novel and general method for addressing the memory issue inherent in forward-time simulations by compressing and decompressing, in real-time, active and ancestral genotypes, while carefully accounting for the time overhead. We propose a general graph data structure for compressing the genotype space explored during a simulation run, along with efficient algorithms for constructing and updating compressed genotypes which support both mutation and recombination. We tested the performance of our method in very large-scale simulations. Results show that our method not only scales well, but that it also overcomes memory issues that would cripple existing tools. Conclusions: As evolutionary analyses are being increasingly performed on genomes, pathways, and networks, particularly in the era of systems biology, scaling population genetic simulators to handle large-scale simulations is crucial. We believe our method offers a significant step in that direction. Further, the techniques we provide are generic and can be integrated with existing population genetic simulators to boost their performance in terms of memory usage.

引用

页数：12

共 18 条

[1] Digital genetics: unravelling the genetic basis of evolution [J].