Kronecker Neural Networks Overcome Spectral Bias for PINN-Based Wavefield Computation

被引:13
作者
bin Waheed, Umair [1 ]
机构
[1] King Fahd Univ Petr & Minerals, Dept Geosci, Dhahran 31261, Saudi Arabia
关键词
Mathematical models; Computational modeling; Training; Standards; Biological neural networks; Solid modeling; Computer architecture; Helmholtz equation; physics-informed neural networks (PINNs); seismic modeling; wavefield modeling;
D O I
10.1109/LGRS.2022.3209901
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Wavefield computation constitutes the majority of the computational cost for seismic applications, including reverse-time migration and full waveform inversion (FWI). One of the popular approaches is to solve for the wavefields in the frequency domain using the Helmholtz equation. However, the Helmholtz solvers require inversion of a large stiffness matrix that can become computationally intractable for large 3-D models or in the case of modeling high frequencies. Recently, researchers have explored the potential of physics-informed neural networks (PINNs) in solving the Helmholtz equation with limited success. While a number of attractive features have been demonstrated for the PINN-based Helmholtz solvers, their large training cost has been the main impediment in their widespread adoption for wavefield computations. The large training cost is mainly due to the spectral bias of neural networks, which poses difficulty in training the PINN model for high-frequency wavefields. In this work, the author uses Kronecker neural networks (KNNs) that form a general framework for neural networks with adaptive activation functions. He, specifically, implements it using a standard feed-forward neural network using a composite activation function formed using the inverse tangent (atan), exponential linear unit (elu), locally adaptive sine (l-sin), and locally adaptive cosine (l-cos) activation functions. Thanks to the oscillatory noise added by the sine and cosine terms, the network is able to explore more and learn faster. This allows the network to get rid of saturation regions from the output of each layer and overcome slow convergence. Through numerical tests, the efficacy of the proposed approach in fast and accurate wavefield modeling in the frequency domain is demonstrated.
引用
收藏
页数:5
相关论文
共 16 条
[1]  
Al-Safwan A, 2021, Arxiv, DOI arXiv:2110.07721
[2]  
Alkhalifah T., 2021, PROC 1 INT M APPL GE, P2440
[3]   Wavefield solutions from machine learned functions constrained by the Helmholtz equation [J].
Alkhalifah, Tariq ;
Song, Chao ;
bin Waheed, Umair ;
Hao, Qi .
ARTIFICIAL INTELLIGENCE IN GEOSCIENCES, 2021, 2 :11-19
[4]   PINNeik: Eikonal solution using physics-informed neural networks [J].
bin Waheed, Umair ;
Haghighat, Ehsan ;
Alkhalifah, Tariq ;
Song, Chao ;
Hao, Qi .
COMPUTERS & GEOSCIENCES, 2021, 155
[5]   Parallel L-BFGS-B algorithm on GPU [J].
Fei, Yun ;
Rong, Guodong ;
Wang, Bin ;
Wang, Wenping .
COMPUTERS & GRAPHICS-UK, 2014, 40 :1-9
[6]   SciANN: A Keras/TensorFlow wrapper for scientific computations and physics-informed deep learning using artificial neural networks [J].
Haghighat, Ehsan ;
Juanes, Ruben .
COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2021, 373
[7]  
Huang XQ, 2021, Arxiv, DOI arXiv:2109.14536
[8]   Deep Kronecker neural networks: A general framework for neural networks with adaptive activation functions [J].
Jagtap, Ameya D. ;
Shin, Yeonjong ;
Kawaguchi, Kenji ;
Karniadakis, George Em .
NEUROCOMPUTING, 2022, 468 (165-180) :165-180
[9]  
Kingma DP, 2014, ADV NEUR IN, V27
[10]   Seismic waveform inversion in the frequency domain, Part 1: Theory and verification in a physical scale model [J].
Pratt, RG .
GEOPHYSICS, 1999, 64 (03) :888-901