Performance Evaluation of NAS Parallel Benchmarks on Intel® Xeon Phi™

被引:20
|
作者
Ramachandran, Arunmoezhi [1 ]
Vienne, Jerome [2 ]
Van der Wijngaart, Rob [3 ]
Koesterke, Lars [2 ]
Sharapov, Ilya [3 ]
机构
[1] Univ Texas Dallas, Richardson, TX 75083 USA
[2] Univ Texas Austin, Texas Adv Comp Ctr, Austin, TX USA
[3] Intel Corp, Santa Clara, CA USA
来源
2013 42ND ANNUAL INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP) | 2013年
基金
美国国家科学基金会;
关键词
Parallel programming; Multicore processing; Performance analysis;
D O I
10.1109/ICPP.2013.87
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
NAS parallel benchmarks (NPB) are a set of applications commonly used to evaluate parallel systems. We use the NPB-OpenMP version to examine the performance of the Intel's new Xeon Phi co-processor and focus specially on the many-core aspect of the Xeon Phi architecture. A first analysis studies the scalability up to 244 threads on 61 cores, the impact of affinity settings on scaling and compare performance characteristics of Xeon Phi and traditional Xeon CPUs. The application of several well-established optimization techniques allows us to identify common bottlenecks that can specifically impede performance on the Xeon Phi but are not as severe on multi-core CPUs. We also find that many of the OpenMP-parallel loops are too short (in terms of the number of loop iterations) for a balanced execution by 244 threads. New, or redesigned benchmarks will be needed to accommodate the greatly increased number of cores and threads. At the end, we summarize our findings in a set recommendations for performance optimization for Xeon Phi.
引用
收藏
页码:736 / 743
页数:8
相关论文
共 50 条
  • [1] Parallel application benchmarks and performance evaluation of the Intel Xeon 7500 family processors
    Kopta, Piotr
    Kulczewski, Michal
    Kurowski, Krzysztof
    Piontek, Tomasz
    Gepner, Pawel
    Puchalski, Mariusz
    Komasa, Jacek
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 : 372 - 381
  • [2] Performance Evaluation of R with Intel Xeon Phi Coprocessor
    El-Khamra, Yaakoub
    Gaffney, Niall
    Walling, David
    Wernert, Eric
    Xu, Weijia
    Zhang, Hui
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [3] An Early Performance Evaluation of OpenCL on Intel Xeon Phi
    Gao, Xiang
    Xu, Chuanfu
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL AND INFORMATION SCIENCES (ICCIS 2014), 2014, : 561 - 566
  • [4] Benchmarking Parallel Chess Search in Stockfish on Intel Xeon and Intel Xeon Phi Processors
    Czarnul, Pawel
    COMPUTATIONAL SCIENCE - ICCS 2018, PT III, 2018, 10862 : 457 - 464
  • [5] Performance and Energy Evaluation of Data Prefetching on Intel Xeon Phi
    Guttman, Diana
    Kandemir, Mahmut Taylan
    Arunachalam, Meenakshi
    Calina, Vlad
    2015 IEEE International Symposium on Performance Analysis and Software (ISPASS), 2015, : 288 - 297
  • [6] Evaluation of Rodinia Codes on Intel Xeon Phi
    Misra, Goldi
    Kurkure, Nisha
    Das, Abhishek
    Valmiki, Manjunatha
    Das, Shweta
    Gupta, Abhinav
    FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION (ISMS 2013), 2013, : 415 - 419
  • [7] Optimizing Performance of ROMS on Intel Xeon Phi
    Bhaskaran, Gopal
    Gaurav, Pratyush
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 2854 - 2858
  • [8] Graph Problems Performance Comparison Using Intel Xeon and Intel Xeon-Phi
    Hanzelka, Jiri
    Skopal, Robert
    Slaninova, Katerina
    Martinovic, Jan
    Dvorsky, Jiri
    ADVANCED COMPUTING AND SYSTEMS FOR SECURITY, VOL 3, 2017, 567 : 73 - 83
  • [9] Performance Evaluation of Sparse Matrix Multiplication Kernels on Intel Xeon Phi
    Saule, Erik
    Kaya, Kamer
    Catalyuerek, Uemit V.
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT I, 2014, 8384 : 559 - 570
  • [10] Efficient Parallel Multigrid Method on Intel Xeon Phi Clusters
    Nakajima, Kengo
    Gerofi, Balazs
    Ishikawa, Yutaka
    Horikoshi, Masashi
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION WORKSHOPS (HPC ASIA 2021 WORKSHOPS), 2020, : 46 - 49