Parallel Performance Optimizations on Unstructured Mesh-Based Simulations

被引:7
作者
Sarje, Abhinav [1 ]
Song, Sukhyun [2 ]
Jacobsen, Douglas [3 ]
Huck, Kevin [4 ]
Hollingsworth, Jeffrey [2 ]
Malony, Allen [4 ]
Williams, Samuel [1 ]
Oliker, Leonid [1 ]
机构
[1] Lawrence Berkeley Natl Lab, Berkeley, CA USA
[2] Univ Maryland, College Pk, MD USA
[3] Los Alamos Natl Lab, Los Alamos, NM 87545 USA
[4] Univ Oregon, Eugene, OR 97403 USA
来源
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE | 2015年 / 51卷
关键词
Unstructured Mesh; Ocean Modeling; Graph Partitioning; Performance Optimization;
D O I
10.1016/j.procs.2015.05.466
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper addresses two key parallelization challenges the unstructured mesh-based ocean modeling code, MPAS-Ocean, which uses a mesh based on Voronoi tessellations: (1) load imbalance across processes, and (2) unstructured data access patterns, that inhibit intra-and inter-node performance. Our work analyzes the load imbalance due to naive partitioning of the mesh, and develops methods to generate mesh partitioning with better load balance and reduced communication. Furthermore, we present methods that minimize both inter-and intra-node data movement and maximize data reuse. Our techniques include predictive ordering of data elements for higher cache efficiency, as well as communication reduction approaches. We present detailed performance data when running on thousands of cores using the Cray XC30 supercomputer and show that our optimization strategies can exceed the original performance by over 2x. Additionally, many of these solutions can be broadly applied to a wide variety of unstructured grid-based computations.
引用
收藏
页码:2016 / 2025
页数:10
相关论文
共 20 条
[11]  
Hu Y. F., 1999, PARALLEL DISTRIBUTED
[12]  
Karypis G., 2013, METIS A SOFTWARE PAC
[13]   Analysis of the clustering properties of the Hilbert space-filling curve [J].
Moon, B ;
Jagadish, HV ;
Faloutsos, C ;
Saltz, JH .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2001, 13 (01) :124-141
[14]  
Moulitsas I., 2008, ICA3PP 08
[15]   A multi-resolution approach to global ocean modeling [J].
Ringler, Todd ;
Petersen, Mark ;
Higdon, Robert L. ;
Jacobsen, Doug ;
Jones, Philip W. ;
Maltrud, Mathew .
OCEAN MODELLING, 2013, 69 :211-232
[16]   Parallel static and dynamic multi-constraint graph partitioning [J].
Schloegel, K ;
Karypis, G ;
Kumar, V .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2002, 14 (03) :219-240
[17]  
Selvakkumaran N., 2005, IEEE T COMPUT AID D, P1
[18]   Revisiting hypergraph models for sparse matrix partitioning [J].
Ucar, Bora ;
Aykanat, Cevdet .
SIAM REVIEW, 2007, 49 (04) :595-603
[19]  
Vo T., 2012, J. Graph. Tools, V16, P25, DOI DOI 10.1080/2151237X.2012.641828
[20]  
Walshaw C., 1995, DYNAMIC MESH PARITIO