On computing the hyperparameter of extreme learning machines: Algorithm and application to computational PDEs, and comparison with classical and high-order finite elements

被引:31
作者
Dong, Suchuan [1 ]
Yang, Jielin [1 ]
机构
[1] Purdue Univ, Ctr Computat & Appl Math, Dept Math, W Lafayette, IN 47907 USA
关键词
Extreme learning machine; Local extreme learning machine; Neural network; Least squares; Nonlinear least squares; Differential evolution; ORDINARY DIFFERENTIAL-EQUATIONS; ADAPTIVE FUNCTION APPROXIMATION; NEURAL-NETWORKS; UNIVERSAL APPROXIMATION; FEEDFORWARD NETWORKS; NUMERICAL-SOLUTION; STOCHASTIC CHOICE; SIMULATIONS; DERIVATIVES; FRAMEWORK;
D O I
10.1016/j.jcp.2022.111290
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We consider the use of extreme learning machines (ELM) for computational partial differential equations (PDE). In ELM the hidden-layer coefficients in the neural network are assigned to random values generated on [-R-m, R-m] and fixed, where R-m is a user-provided constant, and the output-layer coefficients are trained by a linear or nonlinear least squares computation. We present a method for computing the optimal or near-optimal value of R-m based on the differential evolution algorithm. The presented method enables us to illuminate the characteristics of the optimal R-m for two types of ELM configurations: (i) Single-R-m-ELM, corresponding to the conventional ELM method in which a single R-m is used for generating the random coefficients in all the hidden layers, and (ii) Multi-R-m-ELM, corresponding to a modified ELM method in which multiple R-m constants are involved with each used for generating the random coefficients of a different hidden layer. We adopt the optimal R-m from this method and also incorporate other improvements into the ELM implementation. In particular, here we compute all the differential operators involving the output fields of the last hidden layer by a forward-mode auto-differentiation, as opposed to the reverse-mode auto-differentiation in a previous work. These improvements significantly reduce the network training time and enhance the ELM performance. We systematically compare the computational performance of the current improved ELM with that of the finite element method (FEM), both the classical second-order FEM and the high-order FEM with Lagrange elements of higher degrees, for solving a number of linear and nonlinear PDEs. It is shown that the current improved ELM far outperforms the classical FEM. Its computational performance is comparable to that of the high-order FEM for smaller problem sizes, and for larger problem sizes the ELM markedly outperforms the high-order FEM. (c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页数:41
相关论文
共 69 条
[11]   A convective-like energy-stable open boundary condition for simulations of incompressible flows [J].
Dong, S. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2015, 302 :300-328
[12]   A time-stepping scheme involving constant coefficient matrices for phase-field simulations of two-phase incompressible flows with large density ratios [J].
Dong, S. ;
Shen, J. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2012, 231 (17) :5788-5804
[13]  
DONG S, J COMPUT PHYS, V435
[14]   P-refinement and P-threads [J].
Dong, SC ;
Karniadakis, GE .
COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2003, 192 (19) :2191-2201
[15]   A modified batch intrinsic plasticity method for pre-training the random coefficients of extreme learning machines [J].
Dong, Suchuan ;
Li, Zongwei .
JOURNAL OF COMPUTATIONAL PHYSICS, 2021, 445
[16]   Local extreme learning machines and domain decomposition for solving linear and nonlinear partial differential equations [J].
Dong, Suchuan ;
Li, Zongwei .
COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2021, 387
[17]   Physics Informed Extreme Learning Machine (PIELM)–A rapid method for the numerical solution of partial differential equations [J].
Dwivedi V. ;
Srinivasan B. .
Neurocomputing, 2022, 391 :96-118
[18]   A simplicial homology algorithm for Lipschitz optimisation [J].
Endres, Stefan C. ;
Sandrock, Carl ;
Focke, Walter W. .
JOURNAL OF GLOBAL OPTIMIZATION, 2018, 72 (02) :181-217
[19]   Numerical solution and bifurcation analysis of nonlinear partial differential equations with extreme learning machines [J].
Fabiani, Gianluca ;
Calabro, Francesco ;
Russo, Lucia ;
Siettos, Constantinos .
JOURNAL OF SCIENTIFIC COMPUTING, 2021, 89 (02)
[20]   On robust randomized neural networks for regression: a comprehensive review and evaluation [J].
Freire, Ananda L. ;
Rocha-Neto, Ajalmar R. ;
Barreto, Guilherme A. .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (22) :16931-16950