A split preconditioning scheme for nonlinear underdetermined least squares problems

被引：1

作者：

Vater, Nadja ^{[1
]}

Borzi, Alfio ^{[1
]}

机构：

[1] Univ Wurzburg, Inst Math, Emil Fischer Str 30, D-97074 Wurzburg, Germany

来源：

NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS | 2024年 / 31卷 / 05期

关键词：

coarse-level correction; gradient method; nonlinear least squares problems; randomized preconditioning; OPTIMIZATION;

D O I：

10.1002/nla.2558

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

The convergence of preconditioned gradient methods for nonlinear underdetermined least squares problems arising in, for example, supervised learning of overparameterized neural networks is investigated. In this general setting, conditions are given that guarantee the existence of global minimizers that correspond to zero residuals and a proof of the convergence of a gradient method to these global minima is presented. In order to accelerate convergence of the gradient method, different preconditioning strategies are developed and analyzed. In particular, a left randomized preconditioner and a right coarse-level correction preconditioner are combined and investigated. It is demonstrated that the resulting split preconditioned two-level gradient method incorporates the advantages of both approaches and performs very efficiently.

引用

页数：17

共 23 条

[1] Database-friendly random projections: Johnson-Lindenstrauss with binary coins
Achlioptas, D
[J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2003, 66 (04) : 671 - 687
[2] A State-of-the-Art Survey on Deep Learning Theory and Architectures
Alom, Md Zahangir
Taha, Tarek M.
Yakopcic, Chris
Westberg, Stefan
Sidike, Paheding
Nasrin, Mst Shamima
Hasan, Mahmudul
Van Essen, Brian C.
Awwal, Abdul A. S.
Asari, Vijayan K.
[J]. ELECTRONICS, 2019, 8 (03)
[3] [Anonymous], 2013, Least squares data fitting with applications
[4] BLENDENPIK: SUPERCHARGING LAPACK'S LEAST-SQUARES SOLVER
Avron, Haim
Maymounkov, Petar
Toledo, Sivan
[J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2010, 32 (03) : 1217 - 1236
[5] Axelsson O., 1994, ITERATIVE SOLUTION M
[6] Global Minima of Overparameterized Neural Networks
Cooper, Yaim
[J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (02): : 676 - 691
[7] Garipov T, 2018, ADV NEUR IN, V31
[8] Loss landscapes and optimization in over-parameterized non-linear systems and neural networks
Liu, Chaoyue
Zhu, Libin
Belkin, Mikhail
[J]. APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2022, 59 : 85 - 116
[9] Randomized numerical linear algebra: Foundations and algorithms
Martinsson, Per-Gunnar
Tropp, Joel A.
[J]. ACTA NUMERICA, 2020, 29 : 403 - 572
[10] Meng X., 2014, RANDOMIZED ALGORITHM

← 1 2 3 →