SIMD Monte-Carlo Numerical Simulations Accelerated on GPU and Xeon Phi

被引:0
作者
Bastien Plazolles
Didier El Baz
Martin Spel
Vincent Rivola
Pascal Gegout
机构
[1] Université de Toulouse,LAAS
[2] CNRS,CNRS
[3] R.Tech,undefined
[4] Géosciences Environnement Toulouse (CNRS UMR5563),undefined
[5] Université de Toulouse,undefined
来源
International Journal of Parallel Programming | 2018年 / 46卷
关键词
Parallel computing; Multi-core CPU; Xeon Phi; OpenMP; GPU; CUDA; Numerical integrator; Monte-Carlo simulations;
D O I
暂无
中图分类号
学科分类号
摘要
The efficiency of a pleasingly parallel application is studied for several computing platforms. A real world problem, i.e., Monte-Carlo numerical simulations of stratospheric balloon envelope drift descent is considered. We detail the optimization of the SIMD parallel codes on the K40 and K80 GPUs as well as on the Intel Xeon Phi. We emphasize on loop and task parallelism, multi-threading and vectorization, respectively. The experiments show that GPU and MIC permit one to decrease computing time by non negligeable factors, as compared to a parallel code implemented on a two sockets CPU (E5-2680-v2) which finally allows us to use these devices in operational conditions.
引用
收藏
页码:584 / 606
页数:22
相关论文
共 10 条
[1]  
Aldinucci M(2015)Parallel visual data restoration on multi-gpgpus using stencil-reduce pattern Int. J. High Perform. Comput. Appl. 29 461-472
[2]  
Pezzi GP(2014)Ray-tracing of GNSS signal through the atmosphere powered by CUDA, HMPP and GPUs technologies IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 7 1592-1602
[3]  
Drocco M(undefined)undefined undefined undefined undefined-undefined
[4]  
Spampinato C(undefined)undefined undefined undefined undefined-undefined
[5]  
Torquati M(undefined)undefined undefined undefined undefined-undefined
[6]  
Gegout P(undefined)undefined undefined undefined undefined-undefined
[7]  
Oberle P(undefined)undefined undefined undefined undefined-undefined
[8]  
Desjardins C(undefined)undefined undefined undefined undefined-undefined
[9]  
Moyard J(undefined)undefined undefined undefined undefined-undefined
[10]  
Brunet PM(undefined)undefined undefined undefined undefined-undefined