Improving memory hierarchy performance for irregular applications using data and computation reorderings

被引：65

作者：

Mellor-Crummey, J

Whalley, D

Kennedy, K

机构：

[1] Rice Univ, Dept Comp Sci, Houston, TX 77005 USA

[2] Florida State Univ, Dept Comp Sci, Tallahassee, FL 32306 USA

来源：

INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING | 2001年 / 29卷 / 03期

关键词：

memory hierarchy optimization; data reordering; computation reordering; space-filling curves; multi-level blocking;

D O I：

10.1023/A:1011119519789

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The performance of irregular applications on modern computer systems is hurt by the wide gap between CPU and memory speeds because these applications typically under-utilize multi-level memory hierarchies, which help hide this gap. This paper investigates using data and computation reorderings to improve memory hierarchy utilization for irregular applications. We evaluate the impact of reordering on data reuse at different levels in the memory hierarchy. We focus on coordinated data and computation reordering based on space-filling curves and we introduce a new architecture-independent multi-level blocking strategy for irregular applications. For two particle codes we studied, the most effective reorderings reduced overall execution time by a factor of two and four, respectively. Preliminary experience with a scatter benchmark derived from a large unstructured mesh application showed that careful data and computation ordering reduced primary cache misses by a factor of two compared to a random ordering.

引用

页码：217 / 247

页数：31

共 37 条

[1]

ABUSUFAH W, 1979, P 1979 NAT COMP C, P969

[2]

ALFURAIH I, 1998, P INT PAR P S MARCH

[3]

ALLEN JR, 1984, SIGPLAN NOTICES, V19, P233, DOI 10.1145/502949.502897

[4] CHARMM - A PROGRAM FOR MACROMOLECULAR ENERGY, MINIMIZATION, AND DYNAMICS CALCULATIONS [J].