A split preconditioning scheme for nonlinear underdetermined least squares problems

被引:1
作者
Vater, Nadja [1 ]
Borzi, Alfio [1 ]
机构
[1] Univ Wurzburg, Inst Math, Emil Fischer Str 30, D-97074 Wurzburg, Germany
关键词
coarse-level correction; gradient method; nonlinear least squares problems; randomized preconditioning; OPTIMIZATION;
D O I
10.1002/nla.2558
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The convergence of preconditioned gradient methods for nonlinear underdetermined least squares problems arising in, for example, supervised learning of overparameterized neural networks is investigated. In this general setting, conditions are given that guarantee the existence of global minimizers that correspond to zero residuals and a proof of the convergence of a gradient method to these global minima is presented. In order to accelerate convergence of the gradient method, different preconditioning strategies are developed and analyzed. In particular, a left randomized preconditioner and a right coarse-level correction preconditioner are combined and investigated. It is demonstrated that the resulting split preconditioned two-level gradient method incorporates the advantages of both approaches and performs very efficiently.
引用
收藏
页数:17
相关论文
共 23 条
  • [1] Database-friendly random projections: Johnson-Lindenstrauss with binary coins
    Achlioptas, D
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2003, 66 (04) : 671 - 687
  • [2] A State-of-the-Art Survey on Deep Learning Theory and Architectures
    Alom, Md Zahangir
    Taha, Tarek M.
    Yakopcic, Chris
    Westberg, Stefan
    Sidike, Paheding
    Nasrin, Mst Shamima
    Hasan, Mahmudul
    Van Essen, Brian C.
    Awwal, Abdul A. S.
    Asari, Vijayan K.
    [J]. ELECTRONICS, 2019, 8 (03)
  • [3] [Anonymous], 2013, Least squares data fitting with applications
  • [4] BLENDENPIK: SUPERCHARGING LAPACK'S LEAST-SQUARES SOLVER
    Avron, Haim
    Maymounkov, Petar
    Toledo, Sivan
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2010, 32 (03) : 1217 - 1236
  • [5] Axelsson O., 1994, ITERATIVE SOLUTION M
  • [6] Global Minima of Overparameterized Neural Networks
    Cooper, Yaim
    [J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (02): : 676 - 691
  • [7] Garipov T, 2018, ADV NEUR IN, V31
  • [8] Loss landscapes and optimization in over-parameterized non-linear systems and neural networks
    Liu, Chaoyue
    Zhu, Libin
    Belkin, Mikhail
    [J]. APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2022, 59 : 85 - 116
  • [9] Randomized numerical linear algebra: Foundations and algorithms
    Martinsson, Per-Gunnar
    Tropp, Joel A.
    [J]. ACTA NUMERICA, 2020, 29 : 403 - 572
  • [10] Meng X., 2014, RANDOMIZED ALGORITHM