Exploiting temporal data reuse and asynchrony in the reverse time migration

被引:1
作者
Qu, Long [1 ]
Abdelkhalak, Rached [2 ]
Ltaief, Hatem [1 ]
Said, Issam [2 ]
Keyes, David [1 ,3 ]
机构
[1] King Abdullah Univ Sci & Technol, Extreme Comp Res Ctr, 4700 King Abudllah Univ Sci & Technol, Jeddah 23955, Saudi Arabia
[2] NVIDIA, Jeddah, Saudi Arabia
[3] King Abdullah Univ Sci & Technol, Appl Math & Computat Sci, Jeddah, Saudi Arabia
关键词
reverse time migration; asynchronous imaging condition; data reuse; temporal blocking; high performance computing; oil and gas exploration; STENCIL COMPUTATIONS; OPTIMIZATION; BLOCKING;
D O I
10.1177/10943420221128529
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Reverse Time Migration (RTM) is a state-of-the-art algorithm used in seismic depth imaging in complex geological environments for the oil and gas exploration industry. It calculates high-resolution images by solving the three-dimensional acoustic wave equation using seismic datasets recorded at various receiver locations. Reverse Time Migration's computational phases are predominantly composed of stencil computational kernels for the finite-difference time-domain scheme, applying the absorbing boundary conditions, and I/O operations needed for the imaging condition. In this paper, we integrate the asynchronous Multicore Wavefront Diamond (MWD) tiling approach into the full RTM workflow. Multicore Wavefront Diamond permits to further increase data reuse by leveraging spatial with Temporal Blocking (TB) during the stencil computations. This integration engenders new challenges with a snowball effect on the legacy synchronous RTM workflow as it requires rethinking of how the absorbing boundary conditions, the I/O operations, and the imaging condition operate. These disruptive changes are necessary to maintain the performance superiority of asynchronous stencil execution throughout the time integration, while ensuring the quality of the subsurface image does not deteriorate. We assess the overall performance of the new MWD-based RTM and compare against traditional Spatial Blocking (SB)-based RTM on various shared-memory systems using the SEG Salt3D model. The MWD-based RTM achieves up to 70% performance speedup compared to SB-based RTM. To our knowledge, this paper highlights for the first time the applicability of asynchronous executions with temporal blocking throughout the whole RTM. This may eventually create new research opportunities in improving hydrocarbon extraction for the petroleum industry.
引用
收藏
页码:132 / 150
页数:19
相关论文
共 45 条
  • [1] Abdelkhalak, 2019, APPL HIGH PERFORMANC, DOI [10.2118/194722-MS, DOI 10.2118/194722-MS]
  • [2] Asynchronous computations for solving the acoustic wave propagation equation
    Akbudak, Kadir
    Ltaief, Hatem
    Etienne, Vincent
    Abdelkhalak, Rached
    Tonellot, Thierry
    Keyes, David
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2020, 34 (04) : 377 - 393
  • [3] Maximizing I/O Bandwidth for Reverse Time Migration on Heterogeneous Large-Scale Systems
    Alturkestani, Tariq
    Ltaief, Hatem
    Keyes, David
    [J]. EURO-PAR 2020: PARALLEL PROCESSING, 2020, 12247 : 263 - 278
  • [4] 26 PFLOPS Stencil Computations for Atmospheric Modeling on Sunway TaihuLight
    Ao, Yulong
    Yang, Chao
    Wang, Xinliang
    Xue, Wei
    Fu, Haohuan
    Liu, Fangfang
    Gan, Lin
    Xu, Ping
    Ma, Wenjing
    [J]. 2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 535 - 544
  • [5] Bandishti V, 2012, INT CONF HIGH PERFOR
  • [6] REVERSE TIME MIGRATION
    BAYSAL, E
    KOSLOFF, DD
    SHERWOOD, JWC
    [J]. GEOPHYSICS, 1983, 48 (11) : 1514 - 1524
  • [7] Bleistein N., 2001, MATH MULTIDIMENSIONA, DOI [10.1007/978-1-4613-0001-4, DOI 10.1007/978-1-4613-0001-4]
  • [8] A practical automatic polyhedral parallelizer and locality optimizer
    Bondhugula, Uday
    Hartono, Albert
    Ramanujam, J.
    Sadayappan, P.
    [J]. ACM SIGPLAN NOTICES, 2008, 43 (06) : 101 - 113
  • [9] Christen M., 2011, Proceedings of the 25th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2011), P676, DOI 10.1109/IPDPS.2011.70
  • [10] Datta K., 2009, Auto-tuning Stencil Codes for Cache-Based Multicore Platforms"