Reducing Static Energy in Supercomputer Interconnection Networks Using Topology-Aware Partitioning

被引:6
作者
Chen, Juan [1 ]
Tang, Yuhua [1 ]
Dong, Yong [1 ]
Xue, Jingling [2 ]
Wang, Zhiyuan [1 ]
Zhou, Wenhao [1 ]
机构
[1] Natl Univ Def Technol, Sch Comp, State Key Lab High Performance Comp, Changsha 410073, Hunan, Peoples R China
[2] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
Supercomputer interconnection networks; topology-aware partitioning; static energy management; switching off unused routers; Tianhe-2; POWER; CHIP;
D O I
10.1109/TC.2015.2493523
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The key to reducing static energy in supercomputers is switching off their unused components. Routers are the major components of a supercomputer. Whether routers can be effectively switched off or not has become the key to static energy management for supercomputers. For many typical applications, the routers in a supercomputer exhibit low utilization. However, there is no effective method to switch the routers off when they are idle. By analyzing the router occupancy in time and space, for the first time, we present a routing-policy guided topology partitioning methodology to solve this problem. We propose topology partitioning methods for three kinds of commonly used topologies (mesh, torus and fat-tree) equipped with the three most popular routing policies (deterministic routing, directionally adaptive routing and fully adaptive routing). Based on the above methods, we propose the key techniques required in this topology partitioning based static energy management in supercomputer interconnection networks to switch off unused routers in both time and space dimensions. Three topology-aware resource allocation algorithms have been developed to handle effectively different job-mixes running on a supercomputer. We validate the effectiveness of our methodology by using Tianhe-2 and a simulator for the aforementioned topologies and routing policies. The energy savings achieved on a subsystem of Tianhe-2 range from 3.8 to 79.7 percent. This translates into a yearly energy cost reduction of up to half a million US dollars for Tianhe-2.
引用
收藏
页码:2588 / 2602
页数:15
相关论文
共 35 条
  • [1] Abts D, 2010, CONF PROC INT SYMP C, P338, DOI 10.1145/1816038.1816004
  • [2] TOFU: A 6D MESH/TORUS INTERCONNECT FOR EXASCALE COMPUTERS
    Ajima, Yuichiro
    Sumimoto, Shinji
    Shimizu, Toshiyuki
    [J]. COMPUTER, 2009, 42 (11) : 36 - 40
  • [3] [Anonymous], 1975, Queueing Systems
  • [4] [Anonymous], 2011, ICS 11, DOI [10.1145/1995896.1995909, DOI 10.1145/1995896.1995909]
  • [5] [Anonymous], 2014, O SOURCE LATTICE BOL
  • [6] NoRD: Node-Router Decoupling for Effective Power-gating of On-Chip Routers
    Chen, Lizhong
    Pinkston, Timothy M.
    [J]. 2012 IEEE/ACM 45TH INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-45), 2012, : 270 - 281
  • [7] Energy- and performance-aware incremental mapping for networks on chip with multiple voltage levels
    Chou, Chen-Ling
    Ogras, Umit Y.
    Marculescu, Radu
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2008, 27 (10) : 1866 - 1879
  • [8] Dally W., 2003, Principles and Practices of Interconnection Networks
  • [9] Analysis of Linpack and power efficiencies of the world's TOP500 supercomputers
    Deng, Yuefan
    Zhang, Peng
    Marques, Carlos
    Powell, Reid
    Zhang, Li
    [J]. PARALLEL COMPUTING, 2013, 39 (6-7) : 271 - 279
  • [10] Exploiting Geometric Partitioning in Task Mapping for Parallel Computers
    Deveci, Mehmet
    Rajamanickam, Sivasankaran
    Leung, Vitus J.
    Pedretti, Kevin
    Olivier, Stephen L.
    Bunde, David P.
    Catalyfirek, Emit V.
    Devine, Karen
    [J]. 2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,