An embedded sectioning scheme for multiprocessor topology-aware mapping of irregular applications

被引:3
|
作者
Kirmani, Shad [1 ]
Park, Jeonghyung [1 ]
Raghavan, Padma [1 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
来源
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS | 2017年 / 31卷 / 01期
基金
美国国家科学基金会;
关键词
Topology; mapping; embedding;
D O I
10.1177/1094342015597082
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of mapping irregular applications to multiprocessor architectures whose interconnect topologies affect the latencies of data movement across processor nodes. The starting point for solutions to this problem concerns suitable weighted graph representations of an irregular application and a processor topology. Prior results for this problem have demonstrated that graph partitioning approaches can provide high-quality solutions. Additionally, when coordinate information is available for the weighted graph of the application, the geometric mapping schemes can also provide high-quality solutions. We develop and present a scheme that we call embedded sectioning' that directly computes a locality enhancing embedding of the weighted graph representation which is then mapped to the processor topology using recursive coordinate bisection. Our scheme is specifically directed at gaining high-quality mappings for highly irregular applications where the amount of communication can vary greatly. We evaluate the quality of mappings produced by embedded sectioning for mesh-based processor topologies using well-accepted measures including congestion, dilation and their product, referred to as the communication volume. For a test suite of unit-weight graphs mapped to a 32x32 mesh of processors, our method improves congestion by 26%, dilation by 52% and communication volume by 64% relative to the best values of these measures from nine other schemes. Additionally, we observe that these improvements increase with an increase in the skewness of communication in applications. For a test suite with a skewness of two the corresponding improvements for congestion, dilation and communication volume are 72%, 52% and 87%, respectively.
引用
收藏
页码:91 / 103
页数:13
相关论文
共 50 条
  • [1] Topology-Aware Reliability Optimization for Multiprocessor Systems
    Meng, Jie
    Kaplan, Fulya
    Hsieh, Mingyu
    Coskun, Ayse K.
    2012 IEEE/IFIP 20TH INTERNATIONAL CONFERENCE ON VLSI AND SYSTEM-ON-CHIP (VLSI-SOC), 2012, : 243 - U348
  • [2] Topology-aware job mapping
    Georgiou, Yiannis
    Jeannot, Emmanuel
    Mercier, Guillaume
    Villiermet, Adele
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2018, 32 (01): : 14 - 27
  • [3] Topology-aware application layer multicast scheme
    Zhang X.-C.
    Wang Z.
    Luo W.-M.
    Yan B.-P.
    Ruan Jian Xue Bao/Journal of Software, 2010, 21 (08): : 2010 - 2022
  • [4] Netloc: A Tool for Topology-Aware Process Mapping
    Bordage, Cyril
    Foyer, Clement
    Goglin, Brice
    EURO-PAR 2017: PARALLEL PROCESSING WORKSHOPS, 2018, 10659 : 157 - 166
  • [5] Topology-aware resource management for HPC applications
    Georgiou, Yiannis
    Jeannot, Emmanuel
    Mercier, Guillaume
    Villiermet, Adele
    18TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING (ICDCN 2017), 2017,
  • [6] Topology-aware handoff scheme for surveillance patrol robot
    Tseng, Chien-Chao
    Lin, Chia-Liang
    Shih, Bih-Yaw
    Chen, Chia-Yen
    Chen, Chen-Yuan
    NONLINEAR DYNAMICS, 2013, 73 (03) : 2073 - 2081
  • [7] Topology-aware handoff scheme for surveillance patrol robot
    Chien-Chao Tseng
    Chia-Liang Lin
    Bih-Yaw Shih
    Chia-Yen Chen
    Chen-Yuan Chen
    Nonlinear Dynamics, 2013, 73 : 2073 - 2081
  • [8] Fast and high quality topology-aware task mapping
    Deveci, Mehmet
    Kaya, Kamer
    Ucar, Bora
    Catalyurek, Umit V.
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 197 - 206
  • [9] Communication Characterization and Optimization of Applications Using Topology-Aware Task Mapping on Large Supercomputers
    Sreepathi, Sarat
    D'Azevedo, Ed
    Philip, Bobby
    Worley, Patrick
    PROCEEDINGS OF THE 2016 ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING (ICPE'16), 2016, : 225 - 236
  • [10] A Topology-Aware Adaptive Deployment Framework for Elastic Applications
    Keller, Matthias
    Peuster, Manuel
    Robbert, Christoph
    Karl, Holger
    2013 17TH INTERNATIONAL CONFERENCE ON INTELLIGENCE IN NEXT GENERATION NETWORKS (ICIN), 2013, : 61 - 69