Parallel Sn Sweep Scheduling Algorithm on Unstructured Grids for Multigroup Time-Dependent Particle Transport Equations

被引:6
作者
Liu, Jie [1 ]
Chi, Lihua [1 ]
Wang QingLin [1 ]
Gong Chunye [1 ]
Jie, Jiang [1 ]
Gan Xinbiao [1 ]
Li Shengguo [1 ]
Hu, Qingfeng [1 ]
Masterson, Tom [2 ]
机构
[1] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Proc Lab, Changsha 410073, Hunan, Peoples R China
[2] Univ Colorado, Dept Phys, Boulder, CO 80309 USA
基金
中国国家自然科学基金; 中国博士后科学基金; 国家高技术研究发展计划(863计划);
关键词
Transport equation; sweep scheduling method; discrete ordinates method; ACCELERATION; SIMULATIONS; GPU;
D O I
10.13182/NSE15-53
中图分类号
TL [原子能技术]; O571 [原子核物理学];
学科分类号
0827 ; 082701 ;
摘要
Sweep scheduling methods used in particle transport problems belong to the class of precedence-constrained scheduling problems that are NP-complete. It is difficult to schedule local tasks for this type of transport problem and simultaneously optimize computational performance and parallel processor communication. In this paper, we present a parallel spatial-domain-decomposition algorithm to divide the tasks among the available processors. We also present a new algorithm for scheduling tasks within each processor. The scheduling algorithm has the required data and does not need to communicate with any other processor. This algorithm optimizes and assigns task priorities within the processor. Computational tasks whose results are required by another processor receive the highest priority. We combined these two algorithms to solve two-dimensional particle transport equations on unstructured grids. Our results show good performance and scalability up to 16 384 processors on the TianHe-2 supercomputer.
引用
收藏
页码:527 / 536
页数:10
相关论文
共 30 条
[1]  
ADAMS M. P., 2015, P JOINT INT C MATH C
[2]  
ADAMS M. P., 2013, P INT C MATH COMP ME
[3]  
BAILEY T. S., 2009, P INT C MATH COMP ME
[4]  
BAKER R. S., 2000, P SUPERCOMPUTING 200
[5]   An Sn algorithm for the massively parallel CM-200 computer [J].
Baker, RS ;
Koch, KR .
NUCLEAR SCIENCE AND ENGINEERING, 1998, 128 (03) :312-320
[6]   Microarchitectural performance comparison of Intel Knights Corner and Intel Sandy Bridge with CFD applications [J].
Che, Yonggang ;
Zhang, Lilun ;
Wang, Yongxian ;
Xu, Chuanfu ;
Liu, Wei ;
Wang, Zhenghua .
JOURNAL OF SUPERCOMPUTING, 2014, 70 (01) :321-348
[7]   Parallel algorithms for Sn transport sweeps on unstructured meshes [J].
Colomer, G. ;
Borrell, R. ;
Trias, F. X. ;
Rodriguez, I. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2013, 232 (01) :118-135
[8]   Comparison via parallel performance models of angular and spatial domain decompositions for solving neutral particle transport problems [J].
Fischer, James W. ;
Azmy, Y. Y. .
PROGRESS IN NUCLEAR ENERGY, 2007, 49 (01) :37-60
[9]  
Forum M., 2012, MPI: A Message-Passing Interface Standard Version 3.0
[10]   Introduction of Parallel GPGPU Acceleration Algorithms for the Solution of Radiative Transfer [J].
Godoy, William F. ;
Liu, Xu .
NUMERICAL HEAT TRANSFER PART B-FUNDAMENTALS, 2011, 59 (01) :1-25