On computing the hyperparameter of extreme learning machines: Algorithm and application to computational PDEs, and comparison with classical and high-order finite elements

被引：31

作者：

Dong, Suchuan ^{[1
]}

Yang, Jielin ^{[1
]}

机构：

[1] Purdue Univ, Ctr Computat & Appl Math, Dept Math, W Lafayette, IN 47907 USA

来源：

JOURNAL OF COMPUTATIONAL PHYSICS | 2022年 / 463卷

关键词：

Extreme learning machine; Local extreme learning machine; Neural network; Least squares; Nonlinear least squares; Differential evolution; ORDINARY DIFFERENTIAL-EQUATIONS; ADAPTIVE FUNCTION APPROXIMATION; NEURAL-NETWORKS; UNIVERSAL APPROXIMATION; FEEDFORWARD NETWORKS; NUMERICAL-SOLUTION; STOCHASTIC CHOICE; SIMULATIONS; DERIVATIVES; FRAMEWORK;

D O I：

10.1016/j.jcp.2022.111290

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

We consider the use of extreme learning machines (ELM) for computational partial differential equations (PDE). In ELM the hidden-layer coefficients in the neural network are assigned to random values generated on [-R-m, R-m] and fixed, where R-m is a user-provided constant, and the output-layer coefficients are trained by a linear or nonlinear least squares computation. We present a method for computing the optimal or near-optimal value of R-m based on the differential evolution algorithm. The presented method enables us to illuminate the characteristics of the optimal R-m for two types of ELM configurations: (i) Single-R-m-ELM, corresponding to the conventional ELM method in which a single R-m is used for generating the random coefficients in all the hidden layers, and (ii) Multi-R-m-ELM, corresponding to a modified ELM method in which multiple R-m constants are involved with each used for generating the random coefficients of a different hidden layer. We adopt the optimal R-m from this method and also incorporate other improvements into the ELM implementation. In particular, here we compute all the differential operators involving the output fields of the last hidden layer by a forward-mode auto-differentiation, as opposed to the reverse-mode auto-differentiation in a previous work. These improvements significantly reduce the network training time and enhance the ELM performance. We systematically compare the computational performance of the current improved ELM with that of the finite element method (FEM), both the classical second-order FEM and the high-order FEM with Lagrange elements of higher degrees, for solving a number of linear and nonlinear PDEs. It is shown that the current improved ELM far outperforms the classical FEM. Its computational performance is comparable to that of the high-order FEM for smaller problem sizes, and for larger problem sizes the ELM markedly outperforms the high-order FEM. (c) 2022 Elsevier Inc. All rights reserved.

引用

页数：41

共 69 条

[1] Towards a more efficient and cost-sensitive extreme learning machine: A state-of-the-art review of recent trend [J].

Alaba, Peter Adeniyi ;

Popoola, Segun Isaiah ;

Olatomiwa, Lanre ;

Akanle, Mathew Boladele ;

Ohunakin, Olayinka S. ;

Adetiba, Emmanuel ;

Alex, Opeoluwa David ;

Atayero, Aderemi A. A. ;

Daud, Wan Mohd Ashri Wan .

NEUROCOMPUTING, 2019, 350 :70-90

[2]

[Anonymous], 1991, Finite Element Analysis, DOI 10.1002/9781119426479

[3]

Baydin AG, 2018, J MACH LEARN RES, V18

[4] Deep least-squares methods: An unsupervised learning-based numerical method for solving elliptic PDEs [J].

Cai, Zhiqiang ;

Chen, Jingshuang ;

Liu, Min ;

Liu, Xinyu .

JOURNAL OF COMPUTATIONAL PHYSICS, 2020, 420

[5] Extreme learning machine collocation for the numerical solution of elliptic PDEs with sharp gradients [J].

Calabro, Francesco ;

Fabiani, Gianluca ;

Siettos, Constantinos .

COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2021, 387

[6]

Cotter N E, 1990, IEEE Trans Neural Netw, V1, P290, DOI 10.1109/72.80265

[7]

Courant R., 1943, Bulletin of the American Mathematical Society, V49, P1, DOI [10.1090/S0002-9904-1943-07818-4, 10.1201/b16924-5, DOI 10.1201/B16924-5]

[8]

Cyr EC, 2020, PR MACH LEARN RES, V107, P512

[9] NEURAL-NETWORK-BASED APPROXIMATIONS FOR SOLVING PARTIAL-DIFFERENTIAL EQUATIONS [J].

DISSANAYAKE, MWMG ;

PHANTHIEN, N .

COMMUNICATIONS IN NUMERICAL METHODS IN ENGINEERING, 1994, 10 (03) :195-201

[10] Multiphase flows of N immiscible incompressible fluids: Areduction-consistent and thermodynamically-consistent formulation and associated algorithm [J].

Dong, S. .

JOURNAL OF COMPUTATIONAL PHYSICS, 2018, 361 :1-49

← 1 2 3 4 5 6 7 →