Using sampled information: is it enough for the sparse matrix-vector product locality optimization?

被引:2
作者
Pichel, Juan C. [1 ]
Lorenzo, Juan A. [1 ]
Rivera, Francisco F. [1 ]
Heras, Dora B. [1 ]
Pena, Tomas F. [1 ]
机构
[1] Univ Santiago de Compostela, Ctr Invest Tecnoloxias Informac CITIUS, Santiago De Compostela 15782, Spain
关键词
sparse matrix; locality; hardware counters; sampling; performance; PERFORMANCE OPTIMIZATION; MULTIPLICATION;
D O I
10.1002/cpe.2949
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
One of the main factors that affect the performance of the sparse matrix-vector product (SpMV) is the low data reuse caused by the irregular and indirect memory access patterns. Different strategies to deal with this problem such as data reordering techniques have been proposed. The computational cost of these techniques is typically high because they consider all the nonzeros of the sparse matrix in order to find an appropriate permutation of rows and columns that improves the SpMV performance. In this paper, we analyze the possibility of increasing the locality of the SpMV using incomplete information in the reordering process. This partial information comes as a consequence of considering only a subset of the nonzero elements of the matrix. These nonzeros are obtained from the original matrix through a sampling process. In particular, two different sampling methods have been considered: a random sampling and an event-based sampling using hardware counters. We have detected that a small number of samples is enough to obtain quality reorderings. As a consequence, using sampling-based reorderings leads to noticeable performance improvements with respect to the non-reordered matrices, reaching speedup values up to 2.1x. In addition, an important reduction in the computational time required by the reordering technique has been observed. Copyright (c) 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:98 / 117
页数:20
相关论文
共 32 条
  • [1] An approximate minimum degree ordering algorithm
    Amestoy, PR
    Davis, TA
    Duff, IS
    [J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1996, 17 (04) : 886 - 905
  • [2] [Anonymous], SEVERAL STRATEGIES R
  • [3] Applegate David L, 2006, TRAVELING SALESMAN P
  • [4] Pattern-based Sparse Matrix Representation for Memory-Efficient SMVM Kernels
    Belgin, Mehmet
    Back, Godmar
    Ribbens, Calvin J.
    [J]. ICS'09: PROCEEDINGS OF THE 2009 ACM SIGARCH INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, 2009, : 100 - 109
  • [5] Buck B. R., 2004, SC 04, P58
  • [6] Performance optimization and modeling of blocked sparse kernels
    Buttari, Alfredo
    Eijkhout, Victor
    Langou, Julien
    Filippone, Salvatore
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2007, 21 (04) : 467 - 484
  • [7] Cavazos J, 2007, INT SYM CODE GENER, P185
  • [8] Choi Y, 2002, P EPIC 2 WORKSH IST, P48
  • [9] Performance comparison of data-reordering algorithms for sparse matrix-vector multiplication in edge-based unstructured grid computations
    Coutinho, ALGA
    Martins, MAD
    Sydenstricker, RM
    Elias, RN
    [J]. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2006, 66 (03) : 431 - 460
  • [10] Davis T., 1997, NA Digest, V97