A multi-GPU and CUDA-aware MPI-based spectral element formulation for ultrasonic wave propagation in solid media
被引:1
作者:
Li, Feilong
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hung Hom, Kowloon, Hong Kong, Peoples R ChinaHong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hung Hom, Kowloon, Hong Kong, Peoples R China
Li, Feilong
[1
]
Zou, Fangxin
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hung Hom, Kowloon, Hong Kong, Peoples R ChinaHong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hung Hom, Kowloon, Hong Kong, Peoples R China
Zou, Fangxin
[1
]
Rao, Jing
论文数: 0引用数: 0
h-index: 0
机构:
Beihang Univ, Sch Instrumentat & Optoelect Engn, Beijing 100191, Peoples R China
Univ New South Wales, Sch Engn & Informat Technol, Canberra, ACT 2600, AustraliaHong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hung Hom, Kowloon, Hong Kong, Peoples R China
Rao, Jing
[2
,3
]
机构:
[1] Hong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hung Hom, Kowloon, Hong Kong, Peoples R China
[2] Beihang Univ, Sch Instrumentat & Optoelect Engn, Beijing 100191, Peoples R China
[3] Univ New South Wales, Sch Engn & Informat Technol, Canberra, ACT 2600, Australia
In this paper, we introduce a new multi-GPU-based spectral element (SE) formulation for simulating ultrasonic wave propagation in solids. To maximize communication efficiency, we purposely developed, based on CUDA-aware MPI, two novel message exchange strategies which allow the common nodal forces of different sub-domains to be shared between different GPUs in a direct manner, as opposed to via CPU hosts, during central difference-based time integration steps. The new multi-GPU and CUDA-aware MPI-based formulation is benchmarked against a multi-CPU core and classical MPI-based counterpart, demonstrating a remarkable ac-celeration in each and every stage of the computation of ultrasonic wave propagation, namely matrix assembly, time integration and message exchange. More importantly, both the computational efficiency and the degree-of-freedom limit of the new formulation are actually scalable with the number of GPUs used, potentially allowing larger structures to be computed and higher computational speeds to be realized. Finally, the new formulation was used to simulate the interaction between Lamb waves and randomly shaped thickness loss defects on plates, showing its potential to become an efficient, accurate and robust technique for addressing the propagation of ultrasonic waves in realistic engineering structures.
机构:
Stanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USAStanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USA
Cecka, Cris
Lew, Adrian J.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USA
Stanford Univ, Dept Mech Engn, Stanford, CA 94305 USAStanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USA
Lew, Adrian J.
Darve, E.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USA
Stanford Univ, Dept Mech Engn, Stanford, CA 94305 USAStanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USA
机构:
Stanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USAStanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USA
Cecka, Cris
Lew, Adrian J.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USA
Stanford Univ, Dept Mech Engn, Stanford, CA 94305 USAStanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USA
Lew, Adrian J.
Darve, E.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USA
Stanford Univ, Dept Mech Engn, Stanford, CA 94305 USAStanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USA