A modified batch intrinsic plasticity method for pre-training the random coefficients of extreme learning machines

被引:17
作者
Dong, Suchuan [1 ]
Li, Zongwei [2 ]
机构
[1] Purdue Univ, Dept Math, Ctr Computat & Appl Math, W Lafayette, IN 47907 USA
[2] Purdue Univ, Dept Math, Ft Wayne, IN USA
关键词
Batch intrinsic plasticity; Extreme learning machine; Neural network; Scientific machine learning; Least squares; Differential equation; ADAPTIVE FUNCTION APPROXIMATION; PARTIAL-DIFFERENTIAL-EQUATIONS; ARTIFICIAL NEURAL-NETWORKS; BOUNDARY-CONDITION; STOCHASTIC CHOICE; ALGORITHM; FRAMEWORK; FLOWS;
D O I
10.1016/j.jcp.2021.110585
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In extreme learning machines (ELM) the hidden-layer coefficients are randomly set and fixed, while the output-layer coefficients of the neural network are computed by a least squares method. The randomly-assigned coefficients in ELM are known to influence its performance and accuracy significantly. In this paper we present a modified batch intrinsic plasticity (modBIP) method for pre-training the random coefficients in the ELM neural networks. The current method is devised based on the same principle as the batch intrinsic plasticity (BIP) method, namely, by enhancing the information transmission in every node of the neural network. It differs from BIP in two prominent aspects. First, modBIP does not involve the activation function in its algorithm, and it can be applied with any activation function in the neural network. In contrast, BIP employs the inverse of the activation function in its construction, and requires the activation function to be invertible (or monotonic). The modBIP method can work with the often-used non-monotonic activation functions (e.g. Gaussian, swish, Gaussian error linear unit, and radial-basis type functions), with which BIP breaks down. Second, modBIP generates target samples on random intervals with a minimum size, which leads to highly accurate computation results when combined with ELM. The combined ELM/modBIP method is markedly more accurate than ELM/BIP in numerical simulations. Ample numerical experiments are presented with shallow and deep neural networks for function approximation and boundary/initial value problems with partial differential equations. They demonstrate that the combined ELM/modBIP method produces highly accurate simulation results, and that its accuracy is insensitive to the random-coefficient initializations in the neural network. This is in sharp contrast with the ELM results without pre-training of the random coefficients. (C) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页数:31
相关论文
共 53 条
[1]   A study on the relationship between the rank of input data and the performance of random weight neural network [J].
Cao, Weipeng ;
Hu, Lei ;
Gao, Jinzhu ;
Wang, Xizhao ;
Ming, Zhong .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (16) :12685-12696
[2]   Multiphase flows of N immiscible incompressible fluids: Areduction-consistent and thermodynamically-consistent formulation and associated algorithm [J].
Dong, S. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2018, 361 :1-49
[3]   Wall-bounded multiphase flows of N immiscible incompressible fluids: Consistency and contact-angle boundary condition [J].
Dong, S. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2017, 338 :21-67
[4]   A convective-like energy-stable open boundary condition for simulations of incompressible flows [J].
Dong, S. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2015, 302 :300-328
[5]   A pressure correction scheme for generalized form of energy-stable open boundary conditions for incompressible flows [J].
Dong, S. ;
Shen, J. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2015, 291 :254-278
[6]   An outflow boundary condition and algorithm for incompressible two-phase flows with phase field approach [J].
Dong, S. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2014, 266 :47-73
[7]  
Dong S, 2020, ARXIV201202895
[8]   A method for representing periodic functions and enforcing exactly periodic boundary conditions with deep neural networks [J].
Dong, Suchuan ;
Ni, Naxian .
JOURNAL OF COMPUTATIONAL PHYSICS, 2021, 435
[10]  
Dwivedi V, 2020, NEUROCOMPUTING, V391, P96