Automatic Computer System Characterization for a Parallelizing Compiler

被引:1
作者
Sussman, Alan [1 ]
Lo, Norman [1 ]
Anderson, Timothy [2 ]
机构
[1] Univ Maryland, UMIACS, College Pk, MD 20742 USA
[2] BAE Syst, Electron Solutions, Burlington, MA 01803 USA
来源
2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER) | 2011年
关键词
D O I
10.1109/CLUSTER.2011.32
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Effectively utilizing the compute power of modern multi-core machines is a challenging task for a programmer. Automated extraction of shared memory parallelism via powerful compiler transformations and optimizations is one means to such a goal. However, the effectiveness of such transformations is tied to detailed characteristics of the target computer system. In this paper, we describe an automated system for capturing such computer system characteristics that is based on prior work on various parts of the overall problem. The system characteristics measured include the number of available compute elements available to run threads, multiple memory hierarchy parameters, and functional unit latencies and bandwidths. We show experimental results on a wide range of compute platforms that validate the effectiveness of the overall approach.
引用
收藏
页码:216 / 224
页数:9
相关论文
共 11 条
[1]  
[Anonymous], 2006, NIAGARA2 HIGHLY THRE
[2]  
[Anonymous], LLVM COMP INFR
[3]  
[Anonymous], 2011, PORTABLE HARDWARE LO
[4]  
Broquedis F, 2010, P 18 EUR C PAR DISTR
[5]  
Gonzalez-Dominguez J, 2010, P 24 IEEE INT PAR DI
[6]  
Huang Jialu, 2010, P 2010 INT S COD GEN
[7]   ALGORITHMS FOR SCALABLE SYNCHRONIZATION ON SHARED-MEMORY MULTIPROCESSORS [J].
MELLORCRUMMEY, JM ;
SCOTT, ML .
ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1991, 9 (01) :21-65
[8]  
Raman E., 2008, P 2008 INT S COD GEN
[9]  
Wolfe M. J., 1995, HIGH PERFORMANCE COM
[10]  
YOTOV K, 2005, P 2 INT C QUANT EV S