Performance Evaluation of an OpenCL Implementation of the Lattice Boltzmann Method on the Intel Xeon Phi

被引:2
|
作者
Obrecht, Christian [1 ]
Tourancheau, Bernard [2 ]
Kuznik, Frederic [1 ]
机构
[1] INSA Lyon, CETHIL, UMR5008, F-69621 Villeurbanne, France
[2] UJF Grenoble, LIG, UMR5217, F-38041 Grenoble 9, France
关键词
Intel Xeon Phi; OpenCL; computational fluid dynamics; lattice Boltzmann method;
D O I
10.1142/S0129626415410017
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A portable OpenCL implementation of the lattice Boltzmann method targeting emerging many-core architectures is described. The main purpose of this work is to evaluate and compare the performance of this code on three mainstream hardware architectures available today, namely an Intel CPU, an Nvidia GPU, and the Intel Xeon Phi. Because of the similarities between OpenCL and CUDA, we chose to follow some of the strategies devised to implement efficient lattice Boltzmann solvers on Nvidia GPU, while remaining as generic as possible. Being fairly configurable, this program makes possible to ascertain the best options for each hardware platforms. The achieved performance is quite satisfactory for both the CPU and the GPU. For the Xeon Phi however, the results are below expectations. Nevertheless, comparison with data from the literature shows that on this architecture the code seems memory-bound.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Lattice Boltzmann Method Implementation on Multiple Devices using OpenCL
    Tekic, Jelena B.
    Tekic, Predrag M.
    Rackovic, Milos
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2018, 18 (03) : 3 - 8
  • [22] Implementation of short read alignment algorithm in OpenCL on Xeon Phi coprocessor
    Zhao, Xiquan
    Liu, Chuang
    Tan, Guangming
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 1633 - 1636
  • [23] Performance benchmarking of deep learning framework on Intel Xeon Phi
    Chao-Tung Yang
    Jung-Chun Liu
    Yu-Wei Chan
    Endah Kristiani
    Chan-Fu Kuo
    The Journal of Supercomputing, 2021, 77 : 2486 - 2510
  • [24] Understanding the Performance of Stencil Computations on Intel's Xeon Phi
    Peraza, Joshua
    Tiwari, Ananta
    Laurenzano, Michael
    Carrington, Laura
    Ward, William A.
    Campbell, Roy
    2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [25] Performance Evaluation and Scalability Analysis of NPB-MZ on Intel Xeon Phi Coprocessor
    Li, Yuqian
    Che, Yonggang
    Wang, Zhenghua
    COMPUTER ENGINEERING AND TECHNOLOGY, NCCET 2013, 2013, 396 : 143 - 152
  • [26] Performance benchmarking of deep learning framework on Intel Xeon Phi
    Yang, Chao-Tung
    Liu, Jung-Chun
    Chan, Yu-Wei
    Kristiani, Endah
    Kuo, Chan-Fu
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (03): : 2486 - 2510
  • [27] Performance Optimization of OpenFOAM* on Clusters of Intel® Xeon Phi™ Processors
    Ojha, Ravi
    Pawar, Prasad
    Gupta, Sonia
    Klemm, Michael
    Nambiar, Manoj
    2017 IEEE 24TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING WORKSHOPS (HIPCW), 2017, : 51 - 59
  • [28] Implementation of the Lattice Boltzmann Method on Heterogeneous Hardware and Platforms using OpenCL
    Tekic, Predrag M.
    Radjenovic, Jelena B.
    Rackovic, Milos
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2012, 12 (01) : 51 - 56
  • [29] Modeling Performance and Energy for Applications Offloaded to Intel Xeon Phi
    Lawson, Gary
    Sundriyal, Vaibhav
    Sosonkina, Masha
    Shen, Yuzhong
    PROCEEDINGS OF CO-HPC 2015: 2ND INTERNATIONAL WORKSHOP ON HARDWARE-SOFTWARE CO-DESIGN FOR HIGH PERFORMANCE COMPUTING, 2015,
  • [30] High Performance Stencil Computations for Intel® Xeon Phi™ Coprocessor
    Feng, Luxia
    Dong, Yushan
    Li, Chunjiang
    Jiang, Hao
    ADVANCED COMPUTER ARCHITECTURE, ACA 2016, 2016, 626 : 108 - 117