Energy efficiency vs. performance of the numerical solution of PDEs: An application study on a low-power ARM-based cluster

被引:45
作者
Goeddeke, Dominik [1 ]
Komatitsch, Dimitri [2 ]
Geveler, Markus [1 ]
Ribbrock, Dirk [1 ]
Rajovic, Nikola [3 ,4 ]
Puzovic, Nikola [3 ]
Ramirez, Alex [3 ,4 ]
机构
[1] TU Dortmund, Fac Math, Inst Angew Math, Dortmund, Germany
[2] Univ Aix Marseille, CNRS, Lab Mech & Acoust, Marseille, France
[3] Barcelona Supercomp Ctr, Dept Comp Sci, Barcelona, Spain
[4] Univ Politecn Cataluna, BarcelonaTech, Dept Arquitectura Comp, Barcelona, Spain
关键词
High performance computing; Energy efficiency; Low-power processors; ARM processors; Parallel scalability; Finite elements; Multigrid; Wave propagation; Lattice-Boltzmann; SEISMIC-WAVE PROPAGATION; SPECTRAL-ELEMENT; PARALLEL; SIMULATIONS; COMPUTATION; PROCESSORS; FUTURE; EARTH;
D O I
10.1016/j.jcp.2012.11.031
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Power consumption and energy efficiency are becoming critical aspects in the design and operation of large scale HPC facilities, and it is unanimously recognised that future exascale supercomputers will be strongly constrained by their power requirements. At current electricity costs, operating an HPC system over its lifetime can already be on par with the initial deployment cost. These power consumption constraints, and the benefits a more energy-efficient HPC platform may have on other societal areas, have motivated the HPC research community to investigate the use of energy-efficient technologies originally developed for the embedded and especially mobile markets. However, lower power does not always mean lower energy consumption, since execution time often also increases. In order to achieve competitive performance, applications then need to efficiently exploit a larger number of processors. In this article, we discuss how applications can efficiently exploit this new class of low-power architectures to achieve competitive performance. We evaluate if they can benefit from the increased energy efficiency that the architecture is supposed to achieve. The applications that we consider cover three different classes of numerical solution methods for partial differential equations, namely a low-order finite element multigrid solver for huge sparse linear systems of equations, a Lattice-Boltzmann code for fluid simulation, and a high-order spectral element method for acoustic or seismic wave propagation modelling. We evaluate weak and strong scalability on a cluster of 96 ARM Cortex-A9 dual-core processors and demonstrate that the ARM-based cluster can be more efficient in terms of energy to solution when executing the three applications compared to an x86-based reference machine. (c) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:132 / 150
页数:19
相关论文
共 70 条
[1]  
Akcelik V., 2003, Supercomputing, 2003 ACM/IEEE Conference, P52, DOI DOI 10.1109/SC.2003.10056
[2]  
[Anonymous], COMPUTER SCI RES DEV
[3]  
[Anonymous], 2011, Growth in data center electricity use 2005 to 2010
[4]  
Anzt H., 2011, 2011 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, P829, DOI 10.1109/IPDPS.2011.226
[5]  
ARM holdings plc, 2011, ANN REP ACC 2011
[6]  
ARM Ltd, 2011, CORT A9 REV R3P0 TEC
[7]  
ARM Ltd, 2011, CORT A9 PROC
[8]  
ARM Ltd, 2011, CORT SER PROGR GUID
[9]  
ARM Ltd, 2011, CORT A9 FLOAT POINT
[10]  
Augustin W., 2011, NEW FRONT HIGH PERF, P1