Optimization of a parallel permutation testing function for the SPRINT R package

被引:3
|
作者
Petrou, Savvas [1 ]
Sloan, Terence M. [1 ]
Mewissen, Muriel [2 ]
Forster, Thorsten [2 ]
Piotrowski, Michal [1 ]
Dobrzelecki, Bartosz [1 ]
Ghazal, Peter [2 ]
Trew, Arthur [1 ]
Hill, Jon [3 ]
机构
[1] Univ Edinburgh, Edinburgh Parallel Comp Ctr, Edinburgh EH9 3JZ, Midlothian, Scotland
[2] Univ Edinburgh, Sch Med, Div Pathway Med, Edinburgh EH16 4SB, Midlothian, Scotland
[3] Univ London Imperial Coll Sci Technol & Med, Dept Earth Sci & Engn, Appl Modeling & Computat Grp, London SW7 2AZ, England
来源
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE | 2011年 / 23卷 / 17期
基金
英国惠康基金; 英国工程与自然科学研究理事会;
关键词
HPC; MPI; Permutation; Microarray; R; SPRINT;
D O I
10.1002/cpe.1787
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The statistical language R and its Bioconductor package are favoured by many biostatisticians for processing microarray data. The amount of data produced by some analyses has reached the limits of many common bioinformatics computing infrastructures. High Performance Computing systems offer a solution to this issue. The Simple Parallel R Interface (SPRINT) is a package that provides biostatisticians with easy access to High Performance Computing systems and allows the addition of parallelized functions to R. Previous work has established that the SPRINT implementation of an R permutation testing function has close to optimal scaling on up to 512 processors on a supercomputer. Access to supercomputers, however, is not always possible, and so the work presented here compares the performance of the SPRINT implementation on a supercomputer with benchmarks on a range of platforms including cloud resources and a common desktop machine with multiprocessing capabilities. Copyright (C) 2011 John Wiley & Sons, Ltd.
引用
收藏
页码:2258 / 2268
页数:11
相关论文
共 19 条
  • [1] bayesclust: An R Package for Testing and Searching for Significant Clusters
    Gopal, Vikneswaran
    Fuentes, Claudio
    Casella, George
    JOURNAL OF STATISTICAL SOFTWARE, 2012, 47 (14): : 1 - 21
  • [2] Genetic Optimization Using Derivatives: The rgenoud Package for R
    Mebane, Walter R., Jr.
    Sekhon, Jasjeet S.
    JOURNAL OF STATISTICAL SOFTWARE, 2011, 42 (11): : 1 - 26
  • [3] CEoptim: Cross-Entropy R Package for Optimization
    Benham, Tim
    Duan, Qibin
    Kroese, Dirk P.
    Liquet, Benoit
    JOURNAL OF STATISTICAL SOFTWARE, 2017, 76 (08): : 1 - 29
  • [4] trust Optim : An R Package for Trust Region Optimization with Sparse Hessians
    Braun, Michael
    JOURNAL OF STATISTICAL SOFTWARE, 2014, 60 (04):
  • [5] Dendroclimatic calibration in R: The bootRes package for response and correlation function analysis
    Zang, Christian
    Biondi, Franco
    DENDROCHRONOLOGIA, 2013, 31 (01) : 68 - 74
  • [6] Design Evaluation and Optimization of Population Pharmacokinetics Model Using an R Package PopED
    Ryu, Howon
    Wong, Weng Kee
    MATHEMATICS, 2023, 11 (21)
  • [7] Factor Analysis for Multiple Testing (FAMT): An R Package for Large-Scale Significance Testing under Dependence
    Causeur, David
    Friguet, Chloe
    Houee-Bigot, Magalie
    Kloareg, Maela
    JOURNAL OF STATISTICAL SOFTWARE, 2011, 40 (14): : 1 - 19
  • [8] Multivariate and Propensity Score Matching Software with Automated Balance Optimization: The Matching Package for R
    Sekhon, Jasjeet S.
    JOURNAL OF STATISTICAL SOFTWARE, 2011, 42 (07): : 1 - 52
  • [9] varTestnlme: An R Package for Variance Components Testing in Linear and Nonlinear Mixed-Effects Models
    Baey, Charlotte
    Kuhn, Estelle
    JOURNAL OF STATISTICAL SOFTWARE, 2023, 107 (06): : 1 - 32
  • [10] covidscreen: a web app and R Package for assessing asymptomatic COVID-19 testing strategies
    Jesse Smith
    Yilun Sun
    Diego R. Hijano
    James M. Hoffman
    Hana Hakim
    Richard J. Webby
    Randall T Hayden
    Aditya H. Gaur
    Gregory T. Armstrong
    Motomi Mori
    Li Tang
    BMC Public Health, 22