REDUCING COMMUNICATION COSTS FOR SPARSE MATRIX MULTIPLICATION WITHIN ALGEBRAIC MULTIGRID

被引:21
作者
Rallard, Grey [1 ]
Siefer, Christopher [2 ]
Hu, Jonathan [1 ]
机构
[1] Sandia Natl Labs, Livermore, CA 94550 USA
[2] Sandia Natl Labs, POB 5800, Albuquerque, NM 87185 USA
关键词
parallel spar matrix-matrix multiplication; algebraic multigrid setup phase; outer-product algorithm;
D O I
10.1137/15M1028807
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We consider the sequence of sparse matrix-matrix multiplications performed during the setup phase of algebraic multigrid. In particular, we show that the most commonly used parallel algorithm is often not the TIC Pit communication-efficient one for all of the matrix-Matrix multiplications involved. By using an alternative algorithm, we show that the communication costs are reduced (in theory and practice), and we demonstrate the performance benefit for both model (structured) and more realistic unstructured problems on large-scale distributed-memory parallel systems. Our theoretical analysis shows that we can reduce communication by a factor of up to 5.4 for a model problem, and we observe in our empirical evaluation communication reductions of factors up to 4.7 for structured problems and 3.7 for unstructured problems. These reductions in communication translate to run-time speedups of factors up to 2.8 and 2.5, respectively.
引用
收藏
页码:C203 / C231
页数:29
相关论文
共 27 条
  • [1] SIMULTANEOUS INPUT AND OUTPUT MATRIX PARTITIONING FOR OUTER-PRODUCT-PARALLEL SPARSE MATRIX-MATRIX MULTIPLICATION
    Akbudak, Kadir
    Aykanat, Cevdet
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2014, 36 (05) : C568 - C590
  • [2] [Anonymous], 2000, P SUP ACM IEEE 2000
  • [3] [Anonymous], 1969, THESIS
  • [4] [Anonymous], 2006, Technical report SAND2006-2649
  • [5] Baker CG, 2012, SCI PROGRAMMING-NETH, V20, P115, DOI [10.1155/2012/693861, 10.3233/SPR-2012-0349]
  • [6] Ballard G., 2013, Proceedings of the 25th ACM Symposium on Parallelism in Algorithms and Architectures. SPAA '13, P222
  • [7] Brief Announcement: Hypergraph Partitioning for Parallel Sparse Matrix-Matrix Multiplication
    Ballard, Grey
    Druinsky, Alex
    Knight, Nicholas
    Schwartz, Oded
    [J]. SPAA'15: PROCEEDINGS OF THE 27TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2015, : 86 - 88
  • [8] EXPOSING FINE-GRAINED PARALLELISM IN ALGEBRAIC MULTIGRID METHODS
    Bell, Nathan
    Dalton, Steven
    Olson, Luke N.
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2012, 34 (04) : C123 - C152
  • [9] Sparse matrix multiplication: The distributed block-compressed sparse row library
    Borstnik, Urban
    VandeVondele, Joost
    Weber, Valery
    Hutter, Juerg
    [J]. PARALLEL COMPUTING, 2014, 40 (5-6) : 47 - 58
  • [10] Brandt A., 2011, MULTIGRID TECHNIQUES