Footprint-Aware Power Capping for Hybrid Memory Based Systems

被引:6
作者
Arima, Eishi [1 ,2 ]
Hanawa, Toshihiro [1 ]
Trinitis, Carsten [2 ]
Schulz, Martin [2 ]
机构
[1] Univ Tokyo, Tokyo, Japan
[2] Tech Univ Munich, Munich, Germany
来源
HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2020 | 2020年 / 12151卷
关键词
D O I
10.1007/978-3-030-50743-5_18
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
High Performance Computing (HPC) systems are facing severe limitations in both power and memory bandwidth/capacity. By now, these limitations have been addressed individually: to improve performance under a strict power constraint, power capping, which sets power limits to components/nodes/jobs, is an indispensable feature; and for memory bandwidth/capacity increase, the industry has begun to support hybrid main memory designs that comprise multiple different technologies including emerging memories (e.g., 3D stacked DRAM or Non-Volatile RAM) in one compute node. However, few works look at the combination of both trends. This paper explicitly targets power managements on hybrid memory based HPC systems and is based on the following observation: in spite of the system software's efforts to optimize data allocations on such a system, the effective memory bandwidth can decrease considerably when we scale the problem size of applications. As a result, the performance bottleneck component changes in accordance with the footprint (or data) size, which then also changes the optimal power cap settings in a node. Motivated by this observation, we propose a power management concept called footprint-aware power capping (FPCAP) and a profile-driven software framework to realize it. Our experimental result on a real system using HPC benchmarks shows that our approach is successful in correctly setting power caps depending on the footprint size while keeping around 93/96% of performance/power-efficiency compared to the best settings.
引用
收藏
页码:347 / 369
页数:23
相关论文
共 39 条
[1]   Resistive Random Access Memory (ReRAM) Based on Metal Oxides [J].
Akinaga, Hiroyuki ;
Shima, Hisashi .
PROCEEDINGS OF THE IEEE, 2010, 98 (12) :2237-2251
[2]  
Arima E, 2015, 2015 33RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), P149, DOI 10.1109/ICCD.2015.7357096
[3]   Adaptive Configuration Selection for Power-Constrained Heterogeneous Systems [J].
Bailey, Peter E. ;
Lowenthal, David K. ;
Ravi, Vignesh ;
Rountree, Barry ;
Schulz, Martin ;
de Supinski, Bronis R. .
2014 43RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2014, :371-380
[4]   Demand-Aware Power Management for Power-Constrained HPC Systems [J].
Cao, Thang ;
He, Yuan ;
Kondo, Masaaki .
2016 16TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2016, :21-31
[5]  
Consortium H.M.C., 2015, Last Revision
[6]   CoScale: Coordinating CPU and Memory System DVFS in Server Systems [J].
Deng, Qingyuan ;
Meisner, David ;
Bhattacharjee, Abhishek ;
Wenisch, Thomas F. ;
Bianchini, Ricardo .
2012 IEEE/ACM 45TH INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-45), 2012, :143-154
[7]  
Dhiman G, 2009, DES AUT CON, P664
[8]   Dynamic Power Sharing for Higher Job Throughput [J].
Ellsworth, Daniel A. ;
Malony, Allen D. ;
Rountree, Barry ;
Schulz, Martin .
PROCEEDINGS OF SC15: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2015,
[9]  
Felter W., 2005, Proceedings of the 19th Annual International Conference on Supercomputing, P293, DOI DOI 10.1145/1088149.1088188
[10]   Application-Aware Power Coordination on Power Bounded NUMA Multicore Systems [J].
Ge, Rong ;
Zou, Pengfei ;
Feng, Xizhou .
2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2017, :591-600