A High-Performance Computing Implementation of Iterative Random Forest for the Creation of Predictive Expression Networks

被引:24
|
作者
Cliff, Ashley [1 ,2 ]
Romero, Jonathon [1 ,2 ]
Kainer, David [2 ]
Walker, Angelica [1 ,2 ]
Furches, Anna [1 ,2 ]
Jacobson, Daniel [1 ,2 ]
机构
[1] Univ Tennessee, Bredesen Ctr Interdisciplinary Res & Grad Educ, Knoxville, TN 37996 USA
[2] Oak Ridge Natl Lab, POB 2009, Oak Ridge, TN 37830 USA
关键词
Random Forest; Iterative Random Forest; Gene Expression Networks; high-performance computing; X-AI-based eQTL;
D O I
10.3390/genes10120996
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
As time progresses and technology improves, biological data sets are continuously increasing in size. New methods and new implementations of existing methods are needed to keep pace with this increase. In this paper, we present a high-performance computing (HPC)-capable implementation of Iterative Random Forest (iRF). This new implementation enables the explainable-AI eQTL analysis of SNP sets with over a million SNPs. Using this implementation, we also present a new method, iRF Leave One Out Prediction (iRF-LOOP), for the creation of Predictive Expression Networks on the order of 40,000 genes or more. We compare the new implementation of iRF with the previous R version and analyze its time to completion on two of the world's fastest supercomputers, Summit and Titan. We also show iRF-LOOP's ability to capture biologically significant results when creating Predictive Expression Networks. This new implementation of iRF will enable the analysis of biological data sets at scales that were previously not possible.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] High-performance computing in image registration
    Zanin, Michele
    Remondino, Fabio
    Dalla Mura, Mauro
    HIGH-PERFORMANCE COMPUTING IN REMOTE SENSING II, 2012, 8539
  • [22] Enabling High-Performance Computing as a Service
    AbdelBaky, Moustafa
    Parashar, Manish
    Kim, Hyunjoo
    Jordan, Kirk E.
    Sachdeva, Vipin
    Sexton, James
    Jamjoom, Hani
    Shae, Zon-Yin
    Pencheva, Gergina
    Tavakoli, Reza
    Wheeler, Mary F.
    COMPUTER, 2012, 45 (10) : 72 - 80
  • [23] HIGH-PERFORMANCE COMPUTING ON WALL STREET
    Spiers, Brad
    Wallez, Denis
    COMPUTER, 2010, 43 (12) : 53 - 59
  • [24] Predictive Dynamic Simulation for Large-Scale Power Systems through High-Performance Computing
    Huang, Zhenyu
    Jin, Shuangshuang
    Diao, Ruisheng
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 347 - 354
  • [25] Value co-creation in a high-performance computing (HPC) service ecosystem: opportunities for European SMEs
    Bhattacharya, Suman
    Damij, Nadja
    DIGITAL POLICY REGULATION AND GOVERNANCE, 2023, 25 (06) : 601 - 615
  • [26] A Survey of Communication Performance Models for High-Performance Computing
    Rico-Gallego, Juan A.
    Diaz-Martin, Juan C.
    Manumachu, Ravi Reddy
    Lastovetsky, Alexey L.
    ACM COMPUTING SURVEYS, 2019, 51 (06) : 1 - 36
  • [27] Quantum Computing and High-Performance Computing: Compilation Stack Similarities
    Alarcon, Sonia Lopez
    Elster, Anne
    COMPUTING IN SCIENCE & ENGINEERING, 2022, 24 (06) : 66 - 71
  • [28] A generalized method to predict the compressive strength of high-performance concrete by improved random forest algorithm
    Han, Qinghua
    Gui, Changqing
    Xu, Jie
    Lacidogna, Giuseppe
    CONSTRUCTION AND BUILDING MATERIALS, 2019, 226 : 734 - 742
  • [29] Automatic Reference Counting Implementation and Optimization of the Octave JIT Compiler for High-Performance Computing Platforms
    Lu, Shengyou
    Hu, Yujie
    Li, Feipeng
    Huang, Dan
    Zeng, Chuxuan
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 531 - 537
  • [30] High-Performance Computing for Rotorcraft Modeling and Simulation
    Strawn, Roger
    COMPUTING IN SCIENCE & ENGINEERING, 2010, 12 (05) : 27 - 35