Statistical and machine learning models for optimizing energy in parallel applications

被引:4
作者
Endrei, Mark [1 ,2 ]
Jin, Chao [1 ,2 ]
Minh Ngoc Dinh [1 ,2 ]
Abramson, David [1 ,2 ]
Poxon, Heidi [3 ]
DeRose, Luiz [4 ]
de Supinski, Bronis R. [5 ]
机构
[1] Univ Queensland, Res Comp Ctr, Brisbane, Qld 4072, Australia
[2] Univ Queensland, Sch ITEE, Brisbane, Qld 4072, Australia
[3] Cray Inc, Programming Environm Grp, Bloomington, MN USA
[4] Cray Inc, Bloomington, MN USA
[5] Lawrence Livermore Natl Lab, LC, Livermore, CA 94550 USA
基金
澳大利亚研究理事会;
关键词
Energy efficiency; performance; regression modeling; machine learning; high performance computing;
D O I
10.1177/1094342019842915
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Rising power costs and constraints are driving a growing focus on the energy efficiency of high performance computing systems. The unique characteristics of a particular system and workload and their effect on performance and energy efficiency are typically difficult for application users to assess and to control. Settings for optimum performance and energy efficiency can also diverge, so we need to identify trade-off options that guide a suitable balance between energy use and performance. We present statistical and machine learning models that only require a small number of runs to make accurate Pareto-optimal trade-off predictions using parameters that users can control. We study model training and validation using several parallel kernels and more complex workloads, including Algebraic Multigrid (AMG), Large-scale Atomic Molecular Massively Parallel Simulator, and Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. We demonstrate that we can train the models using as few as 12 runs, with prediction error of less than 10%. Our AMG results identify trade-off options that provide up to 45% improvement in energy efficiency for around 10% performance loss. We reduce the sample measurement time required for AMG by 90%, from 13 h to 74 min.
引用
收藏
页码:1079 / 1097
页数:19
相关论文
共 49 条
[41]  
Seabold S., 2010, 9 PYTHON SCI C, DOI [10.25080/majora-92bf1922-011, DOI 10.25080/MAJORA-92BF1922-011]
[42]   Pareto Governors for Energy-Optimal Computing [J].
Sen, Rathijit ;
Wood, David A. .
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2017, 14 (01)
[43]  
Song CH, 2011, 2011 INTERNATIONAL CONFERENCE ON ECONOMIC, EDUCATION AND MANAGEMENT (ICEEM2011), VOL I, P128
[44]   Towards Fine-grained Dynamic Tuning of HPC Applications on Modern Multi-Core Architectures [J].
Sourouri, Mohammed ;
Raknes, Espen Birger ;
Reissmann, Nico ;
Langguth, Johannes ;
Hackenberg, Daniel ;
Schoene, Robert ;
Kjeldsberg, Per Gunnar .
SC'17: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2017,
[45]  
Tiwari A, 2012, LECT NOTES COMPUT SC, V7156, P178, DOI 10.1007/978-3-642-29740-3_21
[46]  
Van der Wijngaart RF, 2014, IEEE HIGH PERF EXTR
[47]  
Venkatapathy Aswin Karthik Ramachandran, 2015, 2015 IEEE 16th International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM), P1, DOI 10.1109/WoWMoM.2015.7158206
[48]  
WILKINSON GN, 1973, ROY STAT SOC C-APP, V22, P392
[49]   Extending Amdahl's Law for Energy-Efficient Computing in the Many-Core Era [J].
Woo, Dong Hyuk ;
Lee, Hsien-Hsin S. .
COMPUTER, 2008, 41 (12) :24-+