Analyzing Memory Access Intensity in Parallel Programs on Multicore

被引：0

作者：

Liu, Lixia ^{[1
]}

Li, Zhiyuan ^{[1
]}

Sameh, Ahmed H. ^{[1
]}

机构：

[1] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47906 USA

来源：

ICS'08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING | 2008年

关键词：

Memory Bandwidth; Intensity; Multicore; Parallel; Linear System;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

As the shared memory bus becomes a major performance bottleneck for many numerical applications on multicore chips, understanding how the increased parallelism on chip strains the memory bandwidth and hence affects the efficiency of parallel codes becomes a critical issue. This paper introduces the notion of memory access intensity to facilitate quantitative analysis of program's memory behavior on multicores which employ state-of-the-art prefetching hardware. Three numerical solvers for large scale sparse linear systems are used to demonstrate the estimation of memory access intensity and its effect on program performance.

引用

页码：359 / 367

页数：9

共 16 条

[1]

ALAM SR, 2006, INT S WORKL CHAR

[2]

Asanovic K, 2006, UCBEECS2006183

[3]

Blackford L. S., 1997, ScaLAPACK user's guide

[4] IMPROVING THE RATIO OF MEMORY OPERATIONS TO FLOATING-POINT OPERATIONS IN LOOPS [J].

CARR, S ;

KENNEDY, K .

ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 1994, 16 (06) :1768-1810

[5]

CHAI L, 2007, CLUSTER COMPUTING GR

[6]

CLEARY A, 1997, IMPLEMENTATION SCALA

[7]

Hennessy J. L., 2007, COMPUTER ARCHITECTUR

[8]

Intel, INT MATH KERN LIB

[9]

Lam M. S., 1991, P 4 INT C ARCH SUPP

[10] Automatic tiling of iterative stencil loops [J].

Li, ZY ;

Song, YH .

ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2004, 26 (06) :975-1028

← 1 2 →