Advanced Computing and Optimization Infrastructure for Extremely Large-Scale Graphs on Post Peta-Scale Supercomputers

被引:1
作者
Fujisawa, Katsuki [1 ,2 ]
Endo, Toshio [3 ,4 ]
Yasui, Yuichiro [5 ,6 ]
机构
[1] Kyushu Univ, Inst Math Ind, Fukuoka, Japan
[2] JST CREST, Fukuoka, Japan
[3] Tokyo Inst Technol, Global Sci Informat & Comp Ctr, Tokyo, Japan
[4] JST CREST, Tokyo, Japan
[5] Kyushu Univ, Ctr Coevolut Social Syst, Fukuoka, Japan
[6] JST COI, Fukuoka, Japan
来源
MATHEMATICAL SOFTWARE, ICMS 2016 | 2016年 / 9725卷
关键词
Graph analysis; Breadth-first search; Optimization problem; High performance computing; Supercomputer; Big data;
D O I
10.1007/978-3-319-42432-3_33
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this talk, we present our ongoing research project. The objective of this project is to develop advanced computing and optimization infrastructures for extremely large-scale graphs on post peta-scale supercomputers. We explain our challenge to Graph 500 and Green Graph 500 benchmarks that are designed to measure the performance of a computer system for applications that require irregular memory and network access patterns. The 1st Graph500 list was released in November 2010. The Graph500 benchmark measures the performance of any supercomputer performing a BFS (Breadth-First Search) in terms of traversed edges per second (TEPS). In 2014 and 2015, our project team was a winner of the 8th, 10th, and 11th Graph500 and the 3rd to 6th Green Graph500 benchmarks, respectively. We also present our parallel implementation for large-scale SDP (SemiDefinite Programming) problem. The semidefinite programming (SDP) problem is a predominant problem in mathematical optimization. The primal-dual interior-point method (PDIPM) is one of the most powerful algorithms for solving SDP problems, and many research groups have employed it for developing software packages. We solved the largest SDP problem (which has over 2.33 million constraints), thereby creating a new world record. Our implementation also achieved 1.774 PFlops in double precision for large-scale Cholesky factorization using 2,720 CPUs and 4,080 GPUs on the TSUBAME 2.5 supercomputer.
引用
收藏
页码:265 / 274
页数:10
相关论文
共 25 条
  • [11] Iwabuchi K., 2013, INT C HIGH PERF COMP
  • [12] Iwabuchi K., 2014, P IEEE BIG DATA2014
  • [13] Nakata M, 2013, LECT NOTES SER INST, V24, P163
  • [14] Suzumura T., 2011, P 2011 IEEE INT S WO
  • [15] Tsujita Y., 2015, 1 INT WORKSH EXTR SC, DOI [10.1145/2832241.2832245, DOI 10.1145/2832241.2832245]
  • [16] Ueno K., 2012, 21 INT ACM S HIGH PE
  • [17] SDPARA: SemiDefinite Programming Algorithm paRAllel version
    Yamashita, M
    Fujisawa, K
    Kojima, M
    [J]. PARALLEL COMPUTING, 2003, 29 (08) : 1053 - 1067
  • [18] Yamashita M., 2011, INT SERIES OPERATION
  • [19] Algorithm 925: Parallel Solver for Semidefinite Programming Problem having Sparse Schur Complement Matrix
    Yamashita, Makoto
    Fujisawa, Katsuki
    Fukuda, Mituhiro
    Nakata, Kazuhide
    Nakata, Maho
    [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2012, 39 (01):
  • [20] Yasui Yuichiro, 2014, Supercomputing. 29th International Conference, ISC 2014. Proceedings: LNCS 8488, P365, DOI 10.1007/978-3-319-07518-1_23