Multi-stage neural networks: Function approximator of machine precision

被引:9
|
作者
Wang, Yongji [1 ,2 ]
Lai, Ching-Yao [2 ]
机构
[1] NYU, Dept Math, New York, NY 10012 USA
[2] Stanford Univ, Dept Geophys, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Scientific machine learning; Neural networks; Physics-informed neural networks; Multi-stage training;
D O I
10.1016/j.jcp.2024.112865
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Deep learning techniques are increasingly applied to scientific problems, where the precision of networks is crucial. Despite being deemed as universal function approximators, neural networks, in practice, struggle to reduce the prediction errors below O(10-5) even with large network size and extended training iterations. To address this issue, we developed the multi -stage neural networks that divides the training process into different stages, with each stage using a new network that is optimized to fit the residue from the previous stage. Across successive stages, the residue magnitudes decreases substantially and follows an inverse power -law relationship with the residue frequencies. The multi -stage neural networks effectively mitigate the spectral biases associated with regular neural networks, enabling them to capture the high frequency feature of target functions. We demonstrate that the prediction error from the multi -stage training for both regression problems and physics -informed neural networks can nearly reach the machineprecision O(10-16) of double -floating point within a finite number of iterations. Such levels of accuracy are rarely attainable using single neural networks alone.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] Automated Multi-Stage Compression of Neural Networks
    Gusak, Julia
    Kholiavchenko, Maksym
    Ponomarev, Evgeny
    Markeeva, Larisa
    Blagoveschensky, Philip
    Cichocki, Andrzej
    Oseledets, Ivan
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2501 - 2508
  • [2] Convolutional Neural Networks for Multi-Stage Semiconductor Processes
    Wu, Xiaofei
    Chen, Junghui
    Xie, Lei
    Lee, Yishan
    Chen, Chun-, I
    JOURNAL OF CHEMICAL ENGINEERING OF JAPAN, 2021, 54 (08) : 449 - 455
  • [3] Microcalcification detection using multi-stage of neural networks
    Shin, JW
    Lee, SS
    Yoon, S
    Park, DS
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IV, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING, 2003, : 229 - 234
  • [4] Multi-stage neural networks for channel assignment in cellular radio networks
    Lee, HS
    Lee, DW
    Lee, J
    ADVANCES IN NEURAL NETWORKS - ISNN 2004, PT 2, 2004, 3174 : 287 - 292
  • [5] Reconfigurable multi-stage neural networks in monitoring industrial machines
    Marzi, H
    SMCia/05: Proceedings of the 2005 IEEE Mid-Summer Workshop on Soft Computing in Industrial Applications, 2005, : 142 - 147
  • [6] Portfolio optimization for multi-stage capital investment with neural networks
    Zhang, YL
    Hua, Y
    ADVANCES IN NEURAL NETWORKS - ISNN 2004, PT 2, 2004, 3174 : 982 - 987
  • [7] A Multi-Stage Memory Augmented Neural Network for Machine Reading Comprehension
    Yu, Seunghak
    Indurthi, Sathish
    Back, Seohyun
    Lee, Haejun
    MACHINE READING FOR QUESTION ANSWERING, 2018, : 21 - 30
  • [8] Exploring Multi-Stage Information Interactions for Multi-Source Neural Machine Translation
    Lu, Ziyao
    Li, Xiang
    Liu, Yang
    Zhou, Chulun
    Cui, Jianwei
    Wang, Bin
    Zhang, Min
    Su, Jinsong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 562 - 570
  • [9] Multi-Stage Influence Function
    Chen, Hongge
    Si, Si
    Li, Yang
    Chelba, Ciprian
    Kumar, Sanjiv
    Boning, Duane
    Hsieh, Cho-Jui
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [10] Image Classification Using Convolutional Neural Networks With Multi-stage Feature
    Yim, Junho
    Ju, Jeongwoo
    Jung, Heechul
    Kim, Junmo
    ROBOT INTELLIGENCE TECHNOLOGY ANDAPPLICATIONS 3, 2015, 345 : 587 - 594