ParallelStructure: A R Package to Distribute Parallel Runs of the Population Genetics Program STRUCTURE on Multi-Core Computers

被引:158
作者
Besnier, Francois [1 ]
Glover, Kevin A. [1 ,2 ]
机构
[1] Inst Marine Res, Dept Populat Genet, N-5024 Bergen, Norway
[2] Univ Bergen, Fac Math & Nat Sci, Dept Informat, Bergen, Norway
关键词
MARKERS;
D O I
10.1371/journal.pone.0070651
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This software package provides an R-based framework to make use of multi-core computers when running analyses in the population genetics program STRUCTURE. It is especially addressed to those users of STRUCTURE dealing with numerous and repeated data analyses, and who could take advantage of an efficient script to automatically distribute STRUCTURE jobs among multiple processors. It also consists of additional functions to divide analyses among combinations of populations within a single data set without the need to manually produce multiple projects, as it is currently the case in STRUCTURE. The package consists of two main functions: MPI_structure() and parallel_structure() as well as an example data file. We compared the performance in computing time for this example data on two computer architectures and showed that the use of the present functions can result in several-fold improvements in terms of computation time. ParallelStructure is freely available at https://r-forge.r-project.org/projects/parallstructure/.
引用
收藏
页数:5
相关论文
共 14 条
[1]  
[Anonymous], 2013, R LANG ENV STAT COMP
[2]   Inferring the demographic history of European Ficedula flycatcher populations [J].
Backstrom, Niclas ;
Saetre, Glenn-Peter ;
Ellegren, Hans .
BMC EVOLUTIONARY BIOLOGY, 2013, 13
[3]   Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study [J].
Evanno, G ;
Regnaut, S ;
Goudet, J .
MOLECULAR ECOLOGY, 2005, 14 (08) :2611-2620
[4]   Arlequin (version 3.0): An integrated software package for population genetics data analysis [J].
Excoffier, Laurent ;
Laval, Guillaume ;
Schneider, Stefan .
EVOLUTIONARY BIOINFORMATICS, 2005, 1 :47-50
[5]   Traces of human migrations in Helicobacter pylori populations [J].
Falush, D ;
Wirth, T ;
Linz, B ;
Pritchard, JK ;
Stephens, M ;
Kidd, M ;
Blaser, MJ ;
Graham, DY ;
Vacher, S ;
Perez-Perez, GI ;
Yamaoka, Y ;
Mégraud, F ;
Otto, K ;
Reichard, U ;
Katzowitsch, E ;
Wang, XY ;
Achtman, M ;
Suerbaum, S .
SCIENCE, 2003, 299 (5612) :1582-1585
[6]   A Genome-Scan Method to Identify Selected Loci Appropriate for Both Dominant and Codominant Markers: A Bayesian Perspective [J].
Foll, Matthieu ;
Gaggiotti, Oscar .
GENETICS, 2008, 180 (02) :977-993
[7]   HIERFSTAT, a package for R to compute and test hierarchical F-statistics [J].
Goudet, J .
MOLECULAR ECOLOGY NOTES, 2005, 5 (01) :184-186
[8]   GENELAND: a computer package for landscape genetics [J].
Guillot, G ;
Mortier, F ;
Estoup, A .
MOLECULAR ECOLOGY NOTES, 2005, 5 (03) :712-715
[9]   Origin of extant domesticated sunflowers in eastern North America [J].
Harter, AV ;
Gardner, KA ;
Falush, D ;
Lentz, DL ;
Bye, RA ;
Rieseberg, LH .
NATURE, 2004, 430 (6996) :201-205
[10]   adegenet:: a R package for the multivariate analysis of genetic markers [J].
Jombart, Thibaut .
BIOINFORMATICS, 2008, 24 (11) :1403-1405