Advanced Computing and Optimization Infrastructure for Extremely Large-Scale Graphs on Post Peta-Scale Supercomputers

被引：1

作者：

Fujisawa, Katsuki ^{[1
,2
]}

Endo, Toshio ^{[3
,4
]}

Yasui, Yuichiro ^{[5
,6
]}

机构：

[1] Kyushu Univ, Inst Math Ind, Fukuoka, Japan

[2] JST CREST, Fukuoka, Japan

[3] Tokyo Inst Technol, Global Sci Informat & Comp Ctr, Tokyo, Japan

[4] JST CREST, Tokyo, Japan

[5] Kyushu Univ, Ctr Coevolut Social Syst, Fukuoka, Japan

[6] JST COI, Fukuoka, Japan

来源：

MATHEMATICAL SOFTWARE, ICMS 2016 | 2016年 / 9725卷

关键词：

Graph analysis; Breadth-first search; Optimization problem; High performance computing; Supercomputer; Big data;

D O I：

10.1007/978-3-319-42432-3_33

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this talk, we present our ongoing research project. The objective of this project is to develop advanced computing and optimization infrastructures for extremely large-scale graphs on post peta-scale supercomputers. We explain our challenge to Graph 500 and Green Graph 500 benchmarks that are designed to measure the performance of a computer system for applications that require irregular memory and network access patterns. The 1st Graph500 list was released in November 2010. The Graph500 benchmark measures the performance of any supercomputer performing a BFS (Breadth-First Search) in terms of traversed edges per second (TEPS). In 2014 and 2015, our project team was a winner of the 8th, 10th, and 11th Graph500 and the 3rd to 6th Green Graph500 benchmarks, respectively. We also present our parallel implementation for large-scale SDP (SemiDefinite Programming) problem. The semidefinite programming (SDP) problem is a predominant problem in mathematical optimization. The primal-dual interior-point method (PDIPM) is one of the most powerful algorithms for solving SDP problems, and many research groups have employed it for developing software packages. We solved the largest SDP problem (which has over 2.33 million constraints), thereby creating a new world record. Our implementation also achieved 1.774 PFlops in double precision for large-scale Cholesky factorization using 2,720 CPUs and 4,080 GPUs on the TSUBAME 2.5 supercomputer.

引用

页码：265 / 274

页数：10

共 25 条

[11] Iwabuchi K., 2013, INT C HIGH PERF COMP
[12] Iwabuchi K., 2014, P IEEE BIG DATA2014
[13] Nakata M, 2013, LECT NOTES SER INST, V24, P163
[14] Suzumura T., 2011, P 2011 IEEE INT S WO
[15] Tsujita Y., 2015, 1 INT WORKSH EXTR SC, DOI [10.1145/2832241.2832245, DOI 10.1145/2832241.2832245]
[16] Ueno K., 2012, 21 INT ACM S HIGH PE
[17] SDPARA: SemiDefinite Programming Algorithm paRAllel version
Yamashita, M
Fujisawa, K
Kojima, M
[J]. PARALLEL COMPUTING, 2003, 29 (08) : 1053 - 1067
[18] Yamashita M., 2011, INT SERIES OPERATION
[19] Algorithm 925: Parallel Solver for Semidefinite Programming Problem having Sparse Schur Complement Matrix
Yamashita, Makoto
Fujisawa, Katsuki
Fukuda, Mituhiro
Nakata, Kazuhide
Nakata, Maho
[J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2012, 39 (01):
[20] Yasui Yuichiro, 2014, Supercomputing. 29th International Conference, ISC 2014. Proceedings: LNCS 8488, P365, DOI 10.1007/978-3-319-07518-1_23

← 1 2 3 →