Prediction of high-performance computing input/output variability and its application to optimization for system configurations

被引:9
|
作者
Xu, Li [1 ]
Lux, Thomas [2 ]
Chang, Tyler [2 ]
Li, Bo [2 ]
Hong, Yili [1 ]
Watson, Layne [2 ]
Butt, Ali [2 ]
Yao, Danfeng [2 ]
Cameron, Kirk [2 ]
机构
[1] Virginia Tech, Dept Stat, Blacksburg, VA 24061 USA
[2] Virginia Tech, Dept Comp Sci, Blacksburg, VA USA
基金
美国国家科学基金会;
关键词
Approximation methods; computer experiments; design analysis; Gaussian process; reliability; system design;
D O I
10.1080/08982112.2020.1866203
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Performance variability is an important measure for a reliable high performance computing (HPC) system. Performance variability is affected by complicated interactions between numerous factors, such as CPU frequency, the number of input/output (IO) threads, and the IO scheduler. In this paper, we focus on HPC IO variability. The prediction of HPC variability is a challenging problem in the engineering of HPC systems and there is little statistical work on this problem to date. Although there are many methods available in the computer experiment literature, the applicability of existing methods to HPC performance variability needs investigation, especially, when the objective is to predict performance variability both in interpolation and extrapolation settings. A data analytic framework is developed to model data collected from large-scale experiments. Various promising methods are used to build predictive models for the variability of HPC systems. We evaluate the performance of the methods by measuring prediction accuracy at previously unseen system configurations. We also discuss a methodology for optimizing system configurations that uses the estimated variability map. The findings from method comparisons and developed tool sets in this paper yield new insights into existing statistical methods and can be beneficial for the practice of HPC variability management. This paper has .
引用
收藏
页码:318 / 334
页数:17
相关论文
共 50 条
  • [1] Prediction for distributional outcomes in high-performance computing input/output variability
    Xu, Li
    Hong, Yili
    Morris, Max D.
    Cameron, Kirk W.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2024, 73 (03) : 561 - 580
  • [2] A HIGH-PERFORMANCE ANALOG INPUT-OUTPUT SYSTEM FOR TRANSPUTER APPLICATIONS
    ELPHICK, JR
    CLARKE, T
    LAWES, ST
    MICROPROCESSORS AND MICROSYSTEMS, 1995, 19 (01) : 3 - 8
  • [3] Prediction and characterization of application power use in a high-performance computing environment
    Bugbee, Bruce
    Phillips, Caleb
    Egan, Hilary
    Elmore, Ryan
    Gruchalla, Kenny
    Purkayastha, Avi
    STATISTICAL ANALYSIS AND DATA MINING, 2017, 10 (03) : 155 - 165
  • [4] DigitalLung: Application of High-Performance Computing to Biological System Simulation
    Burgreen, Greg W.
    Hester, Robert
    Soni, Bela
    Thompson, David
    Walters, D. Keith
    Walters, Keisha
    ADVANCES IN COMPUTATIONAL BIOLOGY, 2010, 680 : 573 - 584
  • [5] IMORC: Application Mapping, Monitoring and Optimization for High-Performance Reconfigurable Computing
    Schumacher, Tobias
    Plessl, Christian
    Platzner, Marco
    PROCEEDINGS OF THE 2009 17TH IEEE SYMPOSIUM ON FIELD PROGRAMMABLE CUSTOM COMPUTING MACHINES, 2009, : 275 - 278
  • [6] High-Performance Computing for Protein Fold Prediction
    Chuang, Li-Yeh
    Lin, Yu-Da
    Yang, Cheng-Hong
    2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
  • [7] Input/Output APIs and Data Organization for High Performance Scientific Computing
    Lofstead, Jay
    Zheng, Fang
    Klasky, Scott
    Schwan, Karsten
    PDSW'08: PROCEEDINGS OF THE 2008 3RD PETASCALE DATA STORAGE WORKSHOP, 2008, : 1 - +
  • [8] High performance Java']Java input/output for heterogeneous distributed computing
    Pérez, JM
    Sanchez, LM
    García, F
    Calderón, A
    Carretero, J
    10TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, PROCEEDINGS, 2005, : 969 - 974
  • [9] Industrial application areas of high-performance computing
    Strohmaier, E
    Dongarra, JJ
    Meuer, HW
    Simon, HD
    HIGH-PERFORMANCE COMPUTING AND NETWORKING, 1997, 1225 : 3 - 10
  • [10] Application of Virtualization Technology in High-Performance Computing
    Yan Junhao
    Xue Mingxia
    THIRD INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY (ISCSCT 2010), 2010, : 242 - 244