SelectNet: Self-paced learning for high-dimensional partial differential equations

被引:41
作者
Gu, Yiqi [1 ]
Yang, Haizhao [2 ]
Zhou, Chao [3 ]
机构
[1] Natl Univ Singapore, Dept Math, 10 Lower Kent Ridge Rd, Singapore 119076, Singapore
[2] Purdue Univ, Dept Math, W Lafayette, IN 47907 USA
[3] City Univ Hong Kong, Dept Math, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
High-dimensional PDEs; Deep neural networks; Self-paced learning; Selected sampling; Least square method; Convergence; NEURAL-NETWORKS; NUMERICAL-SOLUTION; ERROR-BOUNDS; APPROXIMATION; ALGORITHM;
D O I
10.1016/j.jcp.2021.110444
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The least squares method with deep neural networks as function parametrization has been applied to solve certain high-dimensional partial differential equations (PDEs) successfully; however, its convergence is slow and might not be guaranteed even within a simple class of PDEs. To improve the convergence of the network-based least squares model, we introduce a novel self-paced learning framework, SelectNet, which quantifies the difficulty of training samples, treats samples equally in the early stage of training, and slowly explores more challenging samples, e.g., samples with larger residual errors, mimicking the human cognitive process for more efficient learning. In particular, a selection network and the PDE solution network are trained simultaneously; the selection network adaptively weighting the training samples of the solution network achieving the goal of self-paced learning. Numerical examples indicate that the proposed SelectNet model outperforms existing models on the convergence speed and the convergence robustness, especially for low-regularity solutions. (c) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页数:18
相关论文
共 65 条
[1]  
[Anonymous], 2018, PR MACH LEARN RES
[2]  
[Anonymous], 2017, P INT C LEARN REPR I
[3]  
[Anonymous], 2018, PR MACH LEARN RES
[4]   UNIVERSAL APPROXIMATION BOUNDS FOR SUPERPOSITIONS OF A SIGMOIDAL FUNCTION [J].
BARRON, AR .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1993, 39 (03) :930-945
[5]  
Beck C., 2019, arXiv
[6]   A unified deep artificial neural network approach to partial differential equations in complex geometries [J].
Berg, Jens ;
Nystrom, Kaj .
NEUROCOMPUTING, 2018, 317 :28-41
[7]   On a Constructive Proof of Kolmogorov's Superposition Theorem [J].
Braun, Juergen ;
Griebel, Michael .
CONSTRUCTIVE APPROXIMATION, 2009, 30 (03) :653-675
[8]  
Cai JF, 2019, Arxiv, DOI arXiv:1811.09054
[9]  
Cai W, 2019, Arxiv, DOI arXiv:1910.11710
[10]  
Chui C. K., 2018, Frontiers in Applied Mathematics and Statistics, V4