Optimization and performance of a Fortran 90 MPI-based unstructured code on large scale parallel systems

被引:0
|
作者
Shires, D [1 ]
Mohan, R [1 ]
Mark, A [1 ]
机构
[1] USA, Res Lab, HPC Div, Aberdeen Proving Ground, MD 21010 USA
来源
PDPTA'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS | 2001年
关键词
MPI; unstructured grids; code optimization; massively parallel;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Message Passing Interface (MPI) has become the standard in achieving effective results when using the message passing paradigm of parallelization. Codes written using MPI are extremely portable, and are applicable to both clusters and massively parallel computing platforms. Since MPI uses the single program, multiple data (SPMD) approach to parallelism, good performance requires careful tuning of the serial code, as well as careful data and control flow analysis to limit communication. We discuss optimization strategies used and their degree of success to increase performance of an MPI-based unstructured finite element simulations code written in Fortran 90. We discuss performance results based on implementations using several modern massively parallel computing platforms including the SGI Origin 3800, IBM Nighthawk 2 SMP, and Cray T3E-1200.
引用
收藏
页码:34 / 40
页数:3
相关论文
共 50 条
  • [21] A parallel parameterized level set topology optimization framework for large-scale structures with unstructured meshes
    Lin, Haoju
    Liu, Hui
    Wei, Peng
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2022, 397
  • [22] Using MPI File Caching to Improve Parallel Write Performance for Large-Scale Scientific Applications
    Liao, Wei-keng
    Ching, Avery
    Coloma, Kenin
    Nisar, Arifa
    Choudhary, Alok
    Chen, Jacqueline
    Sankaran, Ramanan
    Klasky, Scott
    2007 ACM/IEEE SC07 CONFERENCE, 2010, : 661 - +
  • [23] Improvement of the computational performance of a parallel unstructured WENO finite volume CFD code for Implicit Large Eddy Simulation
    Tsoutsanis, Panagiotis
    Antoniadis, Antonis F.
    Jenkins, Karl W.
    COMPUTERS & FLUIDS, 2018, 173 : 157 - 170
  • [24] Optimization of multigrid based elliptic solver for large scale simulations in the FLASH code
    Daley, Christopher
    Vanella, Marcos
    Dubey, Anshu
    Weide, Klaus
    Balaras, Elias
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2012, 24 (18): : 2346 - 2361
  • [25] Performance Optimization of HPC Applications in Large-Scale Cluster Systems
    Li, Longxiang
    PROCEEDINGS OF THE 2022 ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING (ICPE '22), 2022, : 3 - 3
  • [26] Adaptive Performance Optimization for Large-Scale Traffic Control Systems
    Kouvelas, Anastasios
    Aboudolas, Konstantinos
    Kosmatopoulos, Elias B.
    Papageorgiou, Markos
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 12 (04) : 1434 - 1445
  • [27] Performance Optimization of Load Imbalanced Workloads in Large Scale Dragonfly Systems
    Prisacari, Bogdan
    Rodriguez, German
    Minkenberg, Cyriel
    Garcia, Marina
    Vallejo, Enrique
    Beivide, Ramon
    2015 IEEE 16TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (HPSR), 2015, : 202 - 207
  • [28] Performance Characteristics of Hybrid MPI/OpenMP Implementations of NAS Parallel Benchmarks SP and BT on Large-Scale Multicore Clusters
    Wu, Xingfu
    Taylor, Valerie
    COMPUTER JOURNAL, 2012, 55 (02): : 154 - 167
  • [29] Performance prediction of large-scale parallel discrete event models of physical systems
    Perumalla, KS
    Fujimoto, RM
    Thakare, PJ
    Pande, S
    Karimabadi, H
    Omelchenko, Y
    Driscoll, J
    Proceedings of the 2005 Winter Simulation Conference, Vols 1-4, 2005, : 356 - 364
  • [30] Performance measurement and analysis of large-scale parallel applications on leadership computing systems
    Wylie, Brian J. N.
    Geimer, Markus
    Wolf, Felix
    SCIENTIFIC PROGRAMMING, 2008, 16 (2-3) : 167 - 181