GPU accelerated computational homogenization based on a variational approach in a reduced basis framework

被引:59
作者
Fritzen, Felix [1 ]
Hodapp, Max [2 ]
Leuschner, Matthias [1 ]
机构
[1] Karlsruhe Inst Technol KIT, Inst Engn Mech, Young Investigator Grp Comp Aided Mat Modeling, D-76131 Karlsruhe, Germany
[2] Ecole Polytech Fed Lausanne, Lab Multiscale Mech Modeling LAMMM, STI IGM LAMMM, CH-1015 Lausanne, Switzerland
关键词
Nvidia CUDA; Graphics processing unit (GPU); GPU accelerated batched BLAS; Reduced basis model order reduction; Generalized Standard Material (GSM); Mixed incremental variational approach; TRANSFORMATION FIELD ANALYSIS; HYPER-REDUCTION; IMPLEMENTATION;
D O I
10.1016/j.cma.2014.05.006
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Computational multiscale methods such as the FE2 technique (Feyel, 1999) come along with large demands in both CPU time and memory. In order to significantly reduce the computational cost of multiscale methods the authors recently proposed a hybrid computational homogenization method for visco-plastic materials using a reduced basis approach in a mixed variational formulation (Fritzen and Leuschner, 2013). In the present contribution two extensions of the method are introduced: First, the previous proposal is extended by allowing for heterogeneous hardening variables instead of piecewise constant fields. This leads to an improved accuracy of the method. Second, a massively parallel GPU implementation of the algorithm using Nvidia's CUDA framework is presented. The GPU subroutines for the batched linear algebraic operations are integrated into a specialized library in order to facilitate its use. The impact of the heterogeneous hardening states on the accuracy and the performance gains obtained from the dedicated GPU implementation are illustrated by means of numerical examples. An overall speedup in the order of 10(4) with respect to a high performance finite element implementation is achieved while preserving good accuracy of the predicted nonlinear material response. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:186 / 217
页数:32
相关论文
共 36 条
  • [1] General purpose molecular dynamics simulations fully implemented on graphics processing units
    Anderson, Joshua A.
    Lorenz, Chris D.
    Travesset, A.
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2008, 227 (10) : 5342 - 5359
  • [2] [Anonymous], P 2009 IEEE INT PAR
  • [3] [Anonymous], 2012, Cuda c best practices guide
  • [4] [Anonymous], INT C HIGH PERF COMP
  • [5] Bailey David H, 1991, Supercomputing Review
  • [6] Bathe K.-J., 2006, FINITE ELEMENT PROCE
  • [7] Recent Advances and New Challenges in the Use of the Proper Generalized Decomposition for Solving Multidimensional Models
    Chinesta, Francisco
    Ammar, Amine
    Cueto, Elias
    [J]. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2010, 17 (04) : 327 - 350
  • [8] ON TRANSFORMATION STRAINS AND UNIFORM-FIELDS IN MULTIPHASE ELASTIC MEDIA
    DVORAK, GJ
    BENVENISTE, Y
    [J]. PROCEEDINGS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 1992, 437 (1900): : 291 - 310
  • [9] IMPLEMENTATION OF THE TRANSFORMATION FIELD ANALYSIS FOR INELASTIC COMPOSITE-MATERIALS
    DVORAK, GJ
    BAHEIELDIN, YA
    WAFA, AM
    [J]. COMPUTATIONAL MECHANICS, 1994, 14 (03) : 201 - 228
  • [10] Large calculation of the flow over a hypersonic vehicle using a GPU
    Elsen, Erich
    LeGresley, Patrick
    Darve, Eric
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2008, 227 (24) : 10148 - 10161