X10 and APGAS at Petascale

被引:0
|
作者
Tardieu, Olivier [1 ]
Herta, Benjamin [1 ]
Cunningham, David [2 ]
Grove, David [1 ]
Kambadur, Prabhanjan [1 ]
Saraswat, Vijay [1 ]
Shinnar, Avraham [1 ]
Takeuchi, Mikio [3 ]
Vaziri, Mandana [1 ]
机构
[1] IBM Corp, TJ Watson Res Ctr, Armonk, NY 10504 USA
[2] Google Inc, Mountain View, CA USA
[3] IBM Res Tokyo, Tokyo, Japan
关键词
X10; APGAS; scalability; performance;
D O I
10.1145/2692916.2555245
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
X10 is a high-performance, high-productivity programming language aimed at large-scale distributed and shared-memory parallel applications. It is based on the Asynchronous Partitioned Global Address Space (APGAS) programming model, supporting the same fine-grained concurrency mechanisms within and across shared-memory nodes. We demonstrate that X10 delivers solid performance at petascale by running (weak scaling) eight application kernels on an IBM Power 775 supercomputer utilizing up to 55,680 Power7 cores (for 1.7 Pflop/s of theoretical peak performance). We detail our advances in distributed termination detection, distributed load balancing, and use of high-performance interconnects that enable X10 to scale out to tens of thousands of cores. For the four HPC Class 2 Challenge benchmarks, X10 achieves 41% to 87% of the system's potential at scale (as measured by IBM's HPCC Class 1 optimized runs). We also implement K-Means, Smith-Waterman, Betweenness Centrality, and Unbalanced Tree Search (UTS) for geometric trees. Our UTS implementation is the first to scale to petaflop systems.
引用
收藏
页码:53 / 66
页数:14
相关论文
共 50 条
  • [31] Improving the Performance of X10 Programs by Clock Removal
    Feautrier, Paul
    Violard, Eric
    Ketterlin, Alain
    COMPILER CONSTRUCTION, 2014, 8409 : 113 - 132
  • [33] Experiences with Implementing Task Pools in Chapel and X10
    Fohry, Claudia
    Breitbart, Jens
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT II, 2014, 8385 : 75 - 85
  • [34] Array Dataflow Analysis for Polyhedral X10 Programs
    Yuki, Tomofumi
    Feautrier, Paul
    Rajopadhye, Sanjay
    Saraswat, Vijay
    ACM SIGPLAN NOTICES, 2013, 48 (08) : 23 - 33
  • [35] 《北京亦庄X10号地》简介
    文耀
    考古, 2011, (03) : 90 - 90
  • [36] Resilient X10 Efficient failure-aware programming
    Cunningham, David
    Grove, David
    Herta, Benjamin
    Iyengar, Arun
    Kawachiya, Kiyokuni
    Murata, Hiroki
    Saraswat, Vijay
    Takeuchi, Mikio
    Tardieu, Olivier
    ACM SIGPLAN NOTICES, 2014, 49 (08) : 67 - 80
  • [37] X10-FT: Transparent fault tolerance for APGAS language and runtime
    Hao, Zhijun
    Xie, Chenning
    Chen, Haibo
    Zang, Binyu
    PARALLEL COMPUTING, 2014, 40 (02) : 136 - 156
  • [38] X10-PM: Transparent and Efficient Place Migration for APGAS Language
    Hao, Zhijun
    Zang, Binyu
    Xie, Chenning
    Chen, Haibo
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 1337 - 1344
  • [39] CAN THE HADRON EXPERIMENT RESULTS AT 2 X10(3)-2 X10(4) TEV BE EXPLAINED IN THE QUARK-GLUON STRING MODEL
    DUNAEVSKY, AM
    KRUTIKOVA, NP
    JOURNAL OF PHYSICS G-NUCLEAR AND PARTICLE PHYSICS, 1993, 19 (12) : 2167 - 2176