The advent of multicore processors during the past decade and especially the recent introduction of many-core Graphics Processing Units (GPUs) open new horizons to large-scale, high-resolution simulations for a broad range of scientific fields. Residing at the forefront of advancements in multiprocessor technology, GPUs are often chosen as co-processors when intensive parts of applications need to be computed. Among the various domains, the scientific area of Computational Fluid Dynamics (CFD) is a potential candidate that could significantly benefit from the utilization of many-core GPUs. In order to investigate this possibility, we herein evaluate the performance of a high order accurate method for the simulation of compressible flows. Targeting computer systems with multiple GPUs, the current implementation and the respective performance evaluation are taking place on a GPU cluster. With respect to using these GPUs, this paper offers an alternative to the mainstream approach of message passing by considering shared memory abstraction. In the implementations presented in this paper, the updates on shared data are not explicitly coded by the programmer across the simulation phases, but are propagated through Software Distributed Shared Memory (SDSM). This way, we intend to preserve a unified memory view that extends the memory hierarchy from the node level to the cluster level. Such an extension could significantly facilitate the porting of multithreaded codes at GPU clusters. Our results indicate that the presented approach is competitive with the message passing paradigm and they lay grounds for further research on the use of shared memory abstraction for future GPU clusters. (C) 2014 Elsevier Ltd. All rights reserved.
机构:
Beijing Normal Univ, Sch Math Sci, Lab Math & Complex Syst, Beijing, Peoples R ChinaBeijing Normal Univ, Sch Math Sci, Lab Math & Complex Syst, Beijing, Peoples R China
Wang, Yuhang
Cao, Guiyu
论文数: 0引用数: 0
h-index: 0
机构:
Southern Univ Sci & Technol, Acad Adv Interdisciplinary Studies, Shenzhen, Peoples R ChinaBeijing Normal Univ, Sch Math Sci, Lab Math & Complex Syst, Beijing, Peoples R China
Cao, Guiyu
Pan, Liang
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Normal Univ, Sch Math Sci, Lab Math & Complex Syst, Beijing, Peoples R ChinaBeijing Normal Univ, Sch Math Sci, Lab Math & Complex Syst, Beijing, Peoples R China
机构:
China Ship Sci Res Ctr, Wuxi 214082, Peoples R China
Taihu Lab Deep Sea Technol Sci, Wuxi 214082, Peoples R ChinaChina Ship Sci Res Ctr, Wuxi 214082, Peoples R China
Yu, Jun
Zhang, Xian-pi
论文数: 0引用数: 0
h-index: 0
机构:
China Ship Sci Res Ctr, Wuxi 214082, Peoples R China
Taihu Lab Deep Sea Technol Sci, Wuxi 214082, Peoples R ChinaChina Ship Sci Res Ctr, Wuxi 214082, Peoples R China
Zhang, Xian-pi
Chen, Ji-Ping
论文数: 0引用数: 0
h-index: 0
机构:
China Ship Sci Res Ctr, Wuxi 214082, Peoples R China
Univ Sci & Technol China, Dept Modern Mech, Hefei 230026, Peoples R ChinaChina Ship Sci Res Ctr, Wuxi 214082, Peoples R China
Chen, Ji-Ping
Xu, Yuan-Qing
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Inst Technol, Sch Med Technol, Beijing 100081, Peoples R ChinaChina Ship Sci Res Ctr, Wuxi 214082, Peoples R China