Performance Evaluation of Multi-Core Intel Xeon Processors on Basic Linear Algebra Subprograms

被引:0
作者
Soliman, Mostafa I. [1 ]
机构
[1] S Valley Univ, Aswan Fac Engn, Dept Elect Engn, Comp & Syst Sect, Kena, Egypt
来源
ICCES: 2008 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS | 2007年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-core technology is it natural next step in delivering the benefits of Moore's law to computing platforms. On multi-core processors, the performance of many applications would he improved by parallel processing threads of codes using multi-threading techniques. This paper evaluates the performance of the multi-core Intel Xeon processors on the widely used basic linear algebra subprograms (BLAS). On two dual-core Intel Xeon processors with Hyper-Threading (FIT) technology, our results show that it performance of around 20 GFLOPS is achieved on Level-3 (matrix-matrix operations) BLAS using multi-threading, SIMD, matrix blocking, and loop unrolling techniques. However, on a small size of Level-2 (matrix-vector operations) and Level-1 (vector operations) BLAS, the use of multi-threading technique speeds down the execution because of the thread creation overheads. Thus the use of Intel SIMD instruction set is the way to improve the performance of single-threaded Level-2 (6 GFLOPS) and Level-1 BLAS (3 GFLOPS). When the problem size becomes large (cannot fit in L2 cache), the performance of the four Xeon cores with HT is less than 2 and I GFLOPS on Level-2 and level-1 BLAS, respectively, even though eight threads are executed in parallel.
引用
收藏
页码:3 / 9
页数:7
相关论文
共 16 条
[1]  
Akhter S., 2006, MULTICORE PROGRAMMIN
[2]  
Amdahl G, 1967, P AFIPS SPRING JOINT, V30, P483, DOI DOI 10.1145/1465482.1465560
[3]  
[Anonymous], P INT S COMP ARCH
[4]  
[Anonymous], 1993, MATRIX COMPUTATIONS
[5]  
BINSTOCK A, 2003, PROGRAMMING HYPERTHE
[6]  
Dongarra J., 2002, SOURCEBOOK PARALLEL
[7]  
Dongarra Jack J., 1991, SOLVING LINEAR SYSTE
[8]   AN EXTENDED SET OF FORTRAN BASIC LINEAR ALGEBRA SUBPROGRAMS [J].
DONGARRA, JJ ;
DUCROZ, J ;
HAMMARLING, S ;
HANSON, RJ .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1988, 14 (01) :1-17
[9]  
DONGARRA JJ, 1990, ACM T MATH SOFTWARE, V16, P1, DOI 10.1145/77626.79170
[10]  
GERBER R, 2006, SOFTWARE OPTIMIZATIO