ML-descent: An optimization algorithm for full-waveform inversion using machine learning

被引:0
作者
Sun, Bingbing [1 ]
Alkhalifah, Tariq [1 ]
机构
[1] King Abdullah Univ Sci & Technol Phys Sci & Engn, Thuwal 23955, Saudi Arabia
关键词
VELOCITY INVERSION; MODEL;
D O I
10.1190/GEO2019-0641.1
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Full-waveform inversion (FWI) is a nonlinear optimization problem, and a typical optimization algorithm such as the nonlinear conjugate gradient or limited-memory Broyden-FletcherGoldfarb-Shanno (LBFGS) would iteratively update the model mainly along the gradient-descent direction of the misfit function or a slight modification of it. Based on the concept of meta-learning, rather than using a hand-designed optimization algorithm, we have trained the machine (represented by a neural network) to learn an optimization algorithm, entitled the "ML descent," and apply it in FWI. Using a recurrent neural network (RNN), we use the gradient of the misfit function as the input, and the hidden states in the RNN incorporate the history information of the gradient similar to an LBFGS algorithm. However, unlike the fixed form of the LBFGS algorithm, the machine-learning (ML) version evolves in response to the gradient. The loss function for training is formulated as a weighted summation of the L-2 norm of the data residuals in the original inverse problem. As with any well-defined nonlinear inverse problem, the optimization can be locally approximated by a linear convex problem; thus, to accelerate the training, we train the neural network by minimizing randomly generated quadratic functions instead of performing time-consuming FWIs. To further improve the accuracy and robustness, we use a variational autoencoder that projects and represents the model in latent space. We use the Marmousi and the overthrust examples to demonstrate that the ML-descent method shows faster convergence and outperforms conventional optimization algorithms. The energy in the deeper part of the models can be recovered by the ML-descent even when the pseudoinverse of the Hessian is not incorporated in the FWI update.
引用
收藏
页码:R477 / R492
页数:16
相关论文
共 42 条
[1]  
[Anonymous], 2016, ARXIV160604474
[2]  
[Anonymous], 2001, ADAPT LEARN SYST SIG
[3]  
[Anonymous], 2005, Inverse problem theory and methods for data fitting and model parameter estimation'
[4]  
[Anonymous], 1987, Unconstrained Optimization Practical Methods of Optimization
[5]  
Bleistein N., 2013, Mathematics of multidimensional seismic imaging, migration, and inversion
[6]   A nonlinear conjugate gradient method with a strong global convergence property [J].
Dai, YH ;
Yuan, Y .
SIAM JOURNAL ON OPTIMIZATION, 1999, 10 (01) :177-182
[7]   Estimating a starting model for full-waveform inversion using a global optimization method [J].
Datta, Debanjan ;
Sen, Mrinal K. .
GEOPHYSICS, 2016, 81 (04) :R211-R223
[8]   VARIABLE METRIC METHOD FOR MINIMIZATION [J].
Davidon, William C. .
SIAM JOURNAL ON OPTIMIZATION, 1991, 1 (01) :1-17
[9]  
Dey R, 2017, MIDWEST SYMP CIRCUIT, P1597, DOI 10.1109/MWSCAS.2017.8053243
[10]   Seislet transform and seislet frame [J].
Fomel, Sergey ;
Liu, Yang .
GEOPHYSICS, 2010, 75 (03) :V25-V38