Performance Evaluation of an OpenCL Implementation of the Lattice Boltzmann Method on the Intel Xeon Phi

被引：2

作者：

Obrecht, Christian ^{[1
]}

Tourancheau, Bernard ^{[2
]}

Kuznik, Frederic ^{[1
]}

机构：

[1] INSA Lyon, CETHIL, UMR5008, F-69621 Villeurbanne, France

[2] UJF Grenoble, LIG, UMR5217, F-38041 Grenoble 9, France

来源：

PARALLEL PROCESSING LETTERS | 2015年 / 25卷 / 03期

关键词：

Intel Xeon Phi; OpenCL; computational fluid dynamics; lattice Boltzmann method;

D O I：

10.1142/S0129626415410017

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

A portable OpenCL implementation of the lattice Boltzmann method targeting emerging many-core architectures is described. The main purpose of this work is to evaluate and compare the performance of this code on three mainstream hardware architectures available today, namely an Intel CPU, an Nvidia GPU, and the Intel Xeon Phi. Because of the similarities between OpenCL and CUDA, we chose to follow some of the strategies devised to implement efficient lattice Boltzmann solvers on Nvidia GPU, while remaining as generic as possible. Being fairly configurable, this program makes possible to ascertain the best options for each hardware platforms. The achieved performance is quite satisfactory for both the CPU and the GPU. For the Xeon Phi however, the results are below expectations. Nevertheless, comparison with data from the literature shows that on this architecture the code seems memory-bound.

引用

页数：13

共 50 条

[21] Lattice Boltzmann Method Implementation on Multiple Devices using OpenCL
Tekic, Jelena B.
Tekic, Predrag M.
Rackovic, Milos
ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2018, 18 (03) : 3 - 8
[22] Implementation of short read alignment algorithm in OpenCL on Xeon Phi coprocessor
Zhao, Xiquan
Liu, Chuang
Tan, Guangming
2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 1633 - 1636
[23] Performance benchmarking of deep learning framework on Intel Xeon Phi
Chao-Tung Yang
Jung-Chun Liu
Yu-Wei Chan
Endah Kristiani
Chan-Fu Kuo
The Journal of Supercomputing, 2021, 77 : 2486 - 2510
[24] Understanding the Performance of Stencil Computations on Intel's Xeon Phi
Peraza, Joshua
Tiwari, Ananta
Laurenzano, Michael
Carrington, Laura
Ward, William A.
Campbell, Roy
2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
[25] Performance Evaluation and Scalability Analysis of NPB-MZ on Intel Xeon Phi Coprocessor
Li, Yuqian
Che, Yonggang
Wang, Zhenghua
COMPUTER ENGINEERING AND TECHNOLOGY, NCCET 2013, 2013, 396 : 143 - 152
[26] Performance benchmarking of deep learning framework on Intel Xeon Phi
Yang, Chao-Tung
Liu, Jung-Chun
Chan, Yu-Wei
Kristiani, Endah
Kuo, Chan-Fu
JOURNAL OF SUPERCOMPUTING, 2021, 77 (03): : 2486 - 2510
[27] Performance Optimization of OpenFOAM* on Clusters of Intel® Xeon Phi™ Processors
Ojha, Ravi
Pawar, Prasad
Gupta, Sonia
Klemm, Michael
Nambiar, Manoj
2017 IEEE 24TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING WORKSHOPS (HIPCW), 2017, : 51 - 59
[28] Implementation of the Lattice Boltzmann Method on Heterogeneous Hardware and Platforms using OpenCL
Tekic, Predrag M.
Radjenovic, Jelena B.
Rackovic, Milos
ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2012, 12 (01) : 51 - 56
[29] Modeling Performance and Energy for Applications Offloaded to Intel Xeon Phi
Lawson, Gary
Sundriyal, Vaibhav
Sosonkina, Masha
Shen, Yuzhong
PROCEEDINGS OF CO-HPC 2015: 2ND INTERNATIONAL WORKSHOP ON HARDWARE-SOFTWARE CO-DESIGN FOR HIGH PERFORMANCE COMPUTING, 2015,
[30] High Performance Stencil Computations for Intel® Xeon Phi™ Coprocessor
Feng, Luxia
Dong, Yushan
Li, Chunjiang
Jiang, Hao
ADVANCED COMPUTER ARCHITECTURE, ACA 2016, 2016, 626 : 108 - 117

← 1 2 3 4 5 →