Advanced Computing and Optimization Infrastructure for Extremely Large-Scale Graphs on Post Peta-Scale Supercomputers

被引:1
作者
Fujisawa, Katsuki [1 ,2 ]
Endo, Toshio [3 ,4 ]
Yasui, Yuichiro [5 ,6 ]
机构
[1] Kyushu Univ, Inst Math Ind, Fukuoka, Japan
[2] JST CREST, Fukuoka, Japan
[3] Tokyo Inst Technol, Global Sci Informat & Comp Ctr, Tokyo, Japan
[4] JST CREST, Tokyo, Japan
[5] Kyushu Univ, Ctr Coevolut Social Syst, Fukuoka, Japan
[6] JST COI, Fukuoka, Japan
来源
MATHEMATICAL SOFTWARE, ICMS 2016 | 2016年 / 9725卷
关键词
Graph analysis; Breadth-first search; Optimization problem; High performance computing; Supercomputer; Big data;
D O I
10.1007/978-3-319-42432-3_33
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this talk, we present our ongoing research project. The objective of this project is to develop advanced computing and optimization infrastructures for extremely large-scale graphs on post peta-scale supercomputers. We explain our challenge to Graph 500 and Green Graph 500 benchmarks that are designed to measure the performance of a computer system for applications that require irregular memory and network access patterns. The 1st Graph500 list was released in November 2010. The Graph500 benchmark measures the performance of any supercomputer performing a BFS (Breadth-First Search) in terms of traversed edges per second (TEPS). In 2014 and 2015, our project team was a winner of the 8th, 10th, and 11th Graph500 and the 3rd to 6th Green Graph500 benchmarks, respectively. We also present our parallel implementation for large-scale SDP (SemiDefinite Programming) problem. The semidefinite programming (SDP) problem is a predominant problem in mathematical optimization. The primal-dual interior-point method (PDIPM) is one of the most powerful algorithms for solving SDP problems, and many research groups have employed it for developing software packages. We solved the largest SDP problem (which has over 2.33 million constraints), thereby creating a new world record. Our implementation also achieved 1.774 PFlops in double precision for large-scale Cholesky factorization using 2,720 CPUs and 4,080 GPUs on the TSUBAME 2.5 supercomputer.
引用
收藏
页码:265 / 274
页数:10
相关论文
共 25 条
  • [1] [Anonymous], P 2012 ACM IEEE C SU
  • [2] [Anonymous], 2011, UCBEECS2011117
  • [3] [Anonymous], P ACM IEEE INT C HIG
  • [4] [Anonymous], SC 12
  • [5] [Anonymous], SC 12
  • [6] Traversing Trillions of Edges in Real-time: Graph Exploration on Large-scale Parallel Machines
    Checconi, Fabio
    Petrini, Fabrizio
    [J]. 2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
  • [7] Fujisawa K., 2013, INT C HIGH PERF COMP
  • [8] Advanced Computing and Optimization Infrastructure for Extremely Large-Scale Graphs on Post Peta-Scale Supercomputers
    Fujisawa, Katsuki
    Suzumura, Toyotaro
    Sato, Hitoshi
    Ueno, Koji
    Yasui, Yuichiro
    Iwabuchi, Keita
    Endo, Toshio
    [J]. OPTIMIZATION IN THE REAL WORLD: TOWARD SOLVING REAL-WORLD OPTIMIZATION PROBLEMS, 2016, 13 : 1 - 13
  • [9] Petascale General Solver for Semidefinite Programming Problems with over Two Million Constraints
    Fujisawa, Katsuki
    Endo, Toshio
    Yasui, Yuichiro
    Sato, Hitoshi
    Matsuzawa, Naoki
    Matsuoka, Satoshi
    Waki, Hayato
    [J]. 2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
  • [10] Iwabuchi K., 2014, INT WORKSH HIGH PERF