共 103 条
[1]
Ahamed AKC(2017)Conjugate gradient method with graphics processing unit acceleration: CUDA vs OpenCL Adv Eng Softw 111 32-42
[2]
Magoulès F(2016)Large scale three-dimensional topology optimisation of heat sinks cooled by natural convection Int J Heat Mass Transf 100 876-891
[3]
Alexandersen J(2017)An efficient sparse matrix-vector multiplication on CUDA-enabled graphic processing units for finite element method simulations Int J Numer Methods Eng 110 57-78
[4]
Sigmund O(2017)Preconditioned Krylov solvers on GPUs Parallel Comput 68 32-44
[5]
Aage N(2018)A stencil scaling approach for accelerating matrix-free finite element implementations SIAM J Sci Comput 40 C748-C778
[6]
Altinkaynak A(2013)A parallel node-based solution scheme for implicit finite element method using GPU Proc Eng 61 318-324
[7]
Anzt H(1986)Element-by-element linear and nonlinear solution schemes Int J Numer Methods Biomed Eng 2 145-153
[8]
Gates M(2011)Assembly of finite element methods on graphics processors Int J Numer Methods Eng 85 640-669
[9]
Dongarra J(2019)Batched triangular dense linear algebra kernels for very small matrix sizes on GPUs ACM Trans Math Softw TOMS 45 15:1-15:28
[10]
Kreutzer M(2019)A matrix-free high-order discontinuous Galerkin compressible Navier–Stokes solver: a performance comparison of compressible and incompressible formulations for turbulent incompressible flows Int J Numer Methods Fluids 89 71-102