Understanding the impact of multi-core architecture in cluster computing: A case study with intel dual-core system

被引:0
作者
Chai, Lei [1 ]
Gao, Qi [1 ]
Panda, Dhabaleswar K. [1 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
来源
CCGRID 2007: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID | 2007年
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Multi-core processors are growing as a new industry trend as single core processors rapidly reach the physical limits of possible complexity and speed. In the new Top500 supercomputer list, more than 20% processors belong to the multi-core processor family However without an in-depth study on application behaviors and trends on multi-core clusters, we might not be able to understand the characteristics of multi-core cluster in a comprehensive manner and hence not be able to get optimal performance. In this paper, we take on these challenges and design a set of experiments to study the impact of multi-core architecture on cluster computing. We choose to use one of the most advanced multi-core servers, Intel Bensley system with Wood-crest processors, as our evaluation Platform, and use benchmarks including HPL, NAMD, and NAS as the applications to study From our message distribution experiments, we find that on an average about 50% messages are transferred through intra-Node communication, which is much higher than intuition. This trend indicates that optimizing intra-node communication is as important as optimizing internode communication in a multi-core cluster We also observe that cache and memory, contention may be a potential bottleneck in multi-core clusters, and communication middleware and applications should be multi-core aware to alleviate this problem. We demonstrate that multi-core aware algorithm, e.g. data tiling, improves benchmark execution time by up to 70%. We also compare the scalability of a multi-core cluster with that of a single-core cluster and find that the scalability of the multi-core cluster is promising.
引用
收藏
页码:471 / +
页数:2
相关论文
共 17 条
[1]  
ALAM SR, 2006, INT S WORKL CHAR
[2]   THE NAS PARALLEL BENCHMARKS [J].
BAILEY, DH ;
BARSZCZ, E ;
BARTON, JT ;
BROWNING, DS ;
CARTER, RL ;
DAGUM, L ;
FATOOHI, RA ;
FREDERICKSON, PO ;
LASINSKI, TA ;
SCHREIBER, RS ;
SIMON, HD ;
VENKATAKRISHNAN, V ;
WEERATUNGA, SK .
INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1991, 5 (03) :63-73
[3]  
BUNTINAS D, 2006, INT S CLUST COMP GRI
[4]  
BUNTINAS D, 2006, INT C PAR PROC
[5]  
BURGER TW, INTEL MULTICORE PROC
[6]  
CHAI L, 2006, IEEE INT C CLUST COM
[7]  
CURTISMAURY M, 2005, IWOMP
[8]  
DOMEIKA M, OPTIMIZATION TECHNIQ
[9]  
GANESH K, OPTIMIZATION TECHNIQ
[10]  
HE Y, HYBRID OPENMP MPI PR