Time-domain seismic modeling in viscoelastic media for full waveform inversion on heterogeneous computing platforms with OpenCL

被引:29
作者
Fabien-Ouellet, Gabriel [1 ]
Gloaguen, Erwan [1 ]
Giroux, Bernard [1 ]
机构
[1] INRS ETE, 490 Rue Couronne, Quebec City, PQ G1K 9A9, Canada
关键词
OpenCL; GPU; Seismic; Viscoelasticity; Full waveform Inversion; Adjoint state method; FREE-SURFACE; PROPAGATION; EFFICIENT; VELOCITY; REVERSE; SIMULATION; IMPLEMENTATION; ALGORITHM; RTM;
D O I
10.1016/j.cageo.2016.12.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Full Waveform Inversion (FWI) aims at recovering the elastic parameters of the Earth by matching recordings of the ground motion with the direct solution of the wave equation. Modeling the wave propagation for realistic scenarios is computationally intensive, which limits the applicability of FWI. The current hardware evolution brings increasing parallel computing power that can speed up the computations in FWI. However, to take advantage of the diversity of parallel architectures presently available, new programming approaches are required. In this work, we explore the use of OpenCL to develop a portable code that can take advantage of the many parallel processor architectures now available. We present a program called SeisCL for 2D and 3D viscoelastic FWI in the time domain. The code computes the forward and adjoint wavefields using finite difference and outputs the gradient of the misfit function given by the adjoint state method. To demonstrate the code portability on different architectures, the performance of SeisCL is tested on three different devices: Intel CPUs, NVidia GPUs and Intel Xeon PHI. Results show that the use of GPUs with OpenCL can speed up the computations by nearly two orders of magnitudes over a single threaded application on the CPU. Although OpenCL allows code portability, we show that some device-specific optimization is still required to get the best performance out of a specific architecture. Using OpenCL in conjunction with MPI allows the domain decomposition of large models on several devices located on different nodes of a cluster. For large enough models, the speedup of the domain decomposition varies quasi-linearly with the number of devices. Finally, we investigate two different approaches to compute the gradient by the adjoint state method and show the significant advantages of using OpenCL for FWI.
引用
收藏
页码:142 / 155
页数:14
相关论文
共 67 条
[1]  
Abdelkhalek R., 2009, FAST SEISMIC MODELIN, P36, DOI [10.1109/hpcsim.2009.5192786, DOI 10.1109/HPCSIM.2009.5192786]
[2]  
Abdullah D., 2013, DEV PARALLEL APPL MU, V4
[3]  
Anderson JE, 2012, GEOPHYSICS, V77, pS93, DOI [10.1190/GEO2011-0114.1, 10.1190/geo2011-0114.1]
[4]  
[Anonymous], 2007, Compute unified device architecture programming guide
[5]  
[Anonymous], 1992, Optimization Methods and Software, DOI DOI 10.1080/10556789208805505
[6]   Full waveform inversion for seismic velocity and anelastic losses in heterogeneous structures [J].
Askan, Aysegul ;
Akcelik, Volkan ;
Bielak, Jacobo ;
Ghattas, Omar .
BULLETIN OF THE SEISMOLOGICAL SOCIETY OF AMERICA, 2007, 97 (06) :1990-2008
[7]   Viscoacoustic waveform inversion of velocity structures in the time domain [J].
Bai, Jianyong ;
Yingst, David ;
Bloor, Robert ;
Leveille, Jacques .
GEOPHYSICS, 2014, 79 (03) :R103-R119
[8]   MODELING OF A CONSTANT-Q - METHODOLOGY AND ALGORITHM FOR AN EFFICIENT AND OPTIMALLY INEXPENSIVE VISCOELASTIC TECHNIQUE [J].
BLANCH, JO ;
ROBERTSSON, JOA ;
SYMES, WW .
GEOPHYSICS, 1995, 60 (01) :176-184
[9]   Wavefield compression for adjoint methods in full-waveform inversion [J].
Boehm, Christian ;
Hanzich, Mauricio ;
de la Puente, Josep ;
Fichtner, Andreas .
GEOPHYSICS, 2016, 81 (06) :R385-R397
[10]   Parallel 3-D viscoelastic finite difference seismic modelling [J].
Bohlen, T .
COMPUTERS & GEOSCIENCES, 2002, 28 (08) :887-899