Compiler directed parallelization of loops in scale for shared-memory multiprocessors

被引：0

作者：

Johnson, GS ^{[1
]}

Sethumadhavan, S

机构：

[1] Univ Texas, Dept Comp Sci, Austin, TX 78712 USA

[2] Univ Texas, Texas Adv Comp Ctr, Austin, TX 78712 USA

来源：

COMPUTATIONAL SCIENCE - ICCS 2003, PT III, PROCEEDINGS | 2003年 / 2659卷

关键词：

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Effective utilization of symmetric shared-memory multiprocessors (SMPs) is predicated on the development of efficient parallel code. Unfortunately, efficient parallelism is not always easy for the programmer to identify. Worse, exploiting such parallelism may directly conflict with optimizations affecting per-processor utilization (i.e. loop reordering to improve data locality). Here, we present our experience with a loop-level parallel compiler optimization for SMPs proposed by McKinley [6]. The algorithm uses dependence analysis and a simple model of the target machine, to transform nested loops. The goal of the approach is to promote efficient execution of parallel loops by exposing sources of large-grain parallel work while maintaining per-processor locality. We implement the optimization within the Scale compiler framework, and analyze the performance of multiprocessor code produced for three microbenchmarks.

引用

页码：946 / 955

页数：10

共 50 条

[11] CIRCUIT SIMULATION ON SHARED-MEMORY MULTIPROCESSORS
SADAYAPPAN, P
VISVANATHAN, V
IEEE TRANSACTIONS ON COMPUTERS, 1988, 37 (12) : 1634 - 1642
[12] REDUCING CONTENTION IN SHARED-MEMORY MULTIPROCESSORS
STENSTROM, P
COMPUTER, 1988, 21 (11) : 26 - 35
[13] SYNCHRONIZATION ALGORITHMS FOR SHARED-MEMORY MULTIPROCESSORS
GRAUNKE, G
THAKKAR, S
COMPUTER, 1990, 23 (06) : 60 - 69
[14] SPECIAL ISSUE ON SHARED-MEMORY MULTIPROCESSORS
YEW, PC
WAH, BW
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1991, 12 (02) : 85 - 86
[15] Architectural trends for shared-memory multiprocessors
Stenstrom, P
THIRTIETH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL 1: SOFTWARE TECHNOLOGY AND ARCHITECTURE, 1997, : 732 - 733
[16] Parallelization of While Loops in Nested Loop Programs for Shared-Memory Multiprocessor Systems
Geuns, Stefan J.
Bekooij, Marco J. G.
Bijlsma, Tjerk
Corporaal, Henk
2011 DESIGN, AUTOMATION & TEST IN EUROPE (DATE), 2011, : 697 - 702
[17] Inimizing the directory size for large-scale shared-memory multiprocessors
Kong, J
Yew, PC
Lee, GH
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (11): : 2533 - 2543
[18] Limited combining strategies for large-scale shared-memory multiprocessors
Lee, G
Kang, BC
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1998, 52 (02) : 109 - 119
[19] Exploiting wavefront parallelism on large-scale shared-memory multiprocessors
Manjikian, N
Abdelrahman, TS
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2001, 12 (03) : 259 - 271
[20] The impact of shared-cache clustering in small-scale shared-memory multiprocessors
Nayfeh, BA
Olukotun, K
Singh, JP
SECOND INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 1996, : 74 - 84

← 1 2 3 4 5 →