Communication Lower Bounds for Matricized Tensor Times Khatri-Rao Product

被引:22
作者
Ballard, Grey [1 ]
Rouse, Kathryn [1 ]
Knight, Nicholas [2 ]
机构
[1] Wake Forest Univ, Dept Comp Sci, Winston Salem, NC 27109 USA
[2] NYU, Dept Math, New York, NY USA
来源
2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS) | 2018年
关键词
COLLECTIVE COMMUNICATION; DECOMPOSITIONS;
D O I
10.1109/IPDPS.2018.00065
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The matricized-tensor times Khatri-Rao product (MTTKRP) computation is the typical bottleneck in algorithms for computing a CP decomposition of a tensor. In order to develop high performance sequential and parallel algorithms, we establish communication lower bounds that identify how much data movement is required for this computation in the case of dense tensors. We also present sequential and parallel algorithms that attain the lower bounds and are therefore communication optimal. In particular, we show that the structure of the computation allows for less communication than the straightforward approach of casting the computation as a matrix multiplication operation.
引用
收藏
页码:557 / 567
页数:11
相关论文
共 27 条
  • [1] Aggour K. S., 2016, 1602 RENSS POL I
  • [2] Anandkumar A, 2014, J MACH LEARN RES, V15, P2773
  • [3] [Anonymous], 1981, P 13 ACM S THEOR COM, DOI DOI 10.1145/800076.802486
  • [4] [Anonymous], 2013, UCBEECS201361
  • [5] Parallel Tensor Compression for Large-Scale Scientific Data
    Austin, Woody
    Ballard, Grey
    Kolda, Tamara G.
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 912 - 922
  • [6] Efficient MATLAB computations with sparse and factored tensors
    Bader, Brett W.
    Kolda, Tamara G.
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2007, 30 (01) : 205 - 231
  • [7] Communication lower bounds and optimal algorithms for numerical linear algebra
    Ballard, G.
    Carson, E.
    Demmel, J.
    Hoemmen, M.
    Knight, N.
    Schwartz, O.
    [J]. ACTA NUMERICA, 2014, 23 : 1 - 155
  • [8] Ballard G., 2012, Proceedings of the 24th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA '12, P77, DOI DOI 10.1145/2312005.2312021
  • [9] Ballard Grey., 2017, COMMUNICATION LOWER
  • [10] Ballard Grey, 2016, ACM T PARALLEL COMPU, V3