PIPES: A Language and Compiler for Task-based Programming on Distributed-Memory Clusters

被引:0
作者
Kong, Martin [1 ]
Pouchet, Louis-Noel [2 ]
Sadayappan, P. [2 ]
Sarkar, Vivek [1 ]
机构
[1] Rice Univ, Houston, TX 77251 USA
[2] Ohio State Univ, Columbus, OH 43210 USA
来源
SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS | 2016年
基金
美国国家科学基金会;
关键词
Distributed computing; Concurrent Collections; task parallelism; macro-dataflow; polyhedral compilation;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Applications running on clusters of shared-memory computers are often implemented using OpenMP+MPI. Productivity can be vastly improved using task-based programming, a paradigm where the user expresses the data and control-flow relations between tasks, offering the runtime maximal freedom to place and schedule tasks. While productivity is increased, high-performance execution remains challenging: the implementation of parallel algorithms typically requires specific task placement and communication strategies to reduce inter-node communications and exploit data locality. In this work, we present a new macro-dataflow programming environment for distributed-memory clusters, based on the Intel Concurrent Collections (CnC) runtime. Our language extensions let the user define virtual topologies, task mappings, task-centric data placement, task and communication scheduling, etc. We introduce a compiler to automatically generate Intel CnC C++ run-time, with key automatic optimizations including task coarsening and coalescing. We experimentally validate our approach on a variety of scientific computations, demonstrating both productivity and performance.
引用
收藏
页码:456 / 467
页数:12
相关论文
共 32 条
[11]  
Dathathri R., 2016, TOPC
[12]   DATA-FLOW ANALYSIS OF ARRAY AND SCALAR REFERENCES [J].
FEAUTRIER, P .
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 1991, 20 (01) :23-53
[13]   SOME EFFICIENT SOLUTIONS TO THE AFFINE SCHEDULING PROBLEM .2. MULTIDIMENSIONAL TIME [J].
FEAUTRIER, P .
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 1992, 21 (06) :389-420
[14]  
Feautrier P., 1988, RAIRO RECHERCHE OPER
[15]   FLAME: Formal linear algebra methods environment [J].
Gunnels, JA ;
Gustavson, FG ;
Henry, GM ;
Van de Geijn, RA .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2001, 27 (04) :422-455
[16]  
Intel, 2010, INT CONC COLL C C RE
[17]  
Intel, INT CONC COLL
[18]  
Intel, INT CONC COLL C
[19]  
Knobe K., 2012, CPC
[20]  
Kong M., 2015, IWAPT