Analyzing Memory Access Intensity in Parallel Programs on Multicore

被引:0
作者
Liu, Lixia [1 ]
Li, Zhiyuan [1 ]
Sameh, Ahmed H. [1 ]
机构
[1] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47906 USA
来源
ICS'08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING | 2008年
关键词
Memory Bandwidth; Intensity; Multicore; Parallel; Linear System;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As the shared memory bus becomes a major performance bottleneck for many numerical applications on multicore chips, understanding how the increased parallelism on chip strains the memory bandwidth and hence affects the efficiency of parallel codes becomes a critical issue. This paper introduces the notion of memory access intensity to facilitate quantitative analysis of program's memory behavior on multicores which employ state-of-the-art prefetching hardware. Three numerical solvers for large scale sparse linear systems are used to demonstrate the estimation of memory access intensity and its effect on program performance.
引用
收藏
页码:359 / 367
页数:9
相关论文
共 16 条
[1]  
ALAM SR, 2006, INT S WORKL CHAR
[2]  
Asanovic K, 2006, UCBEECS2006183
[3]  
Blackford L. S., 1997, ScaLAPACK user's guide
[4]   IMPROVING THE RATIO OF MEMORY OPERATIONS TO FLOATING-POINT OPERATIONS IN LOOPS [J].
CARR, S ;
KENNEDY, K .
ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 1994, 16 (06) :1768-1810
[5]  
CHAI L, 2007, CLUSTER COMPUTING GR
[6]  
CLEARY A, 1997, IMPLEMENTATION SCALA
[7]  
Hennessy J. L., 2007, COMPUTER ARCHITECTUR
[8]  
Intel, INT MATH KERN LIB
[9]  
Lam M. S., 1991, P 4 INT C ARCH SUPP
[10]   Automatic tiling of iterative stencil loops [J].
Li, ZY ;
Song, YH .
ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2004, 26 (06) :975-1028