Mini-batch descent in semiflows

被引：0

作者：

Corella, Alberto Dominguez

Hernandez, Martin ^{[1
]}

机构：

[1] Friedrich Alexander Univ Erlangen Nurnberg, Dynam Control Machine Learning & Numer, D-91058 Erlangen, Germany

来源：

ESAIM-CONTROL OPTIMISATION AND CALCULUS OF VARIATIONS | 2025年 / 31卷

关键词：

Gradient flow; mini-batch; stochastic gradient descent; domain decomposition; OBSTACLE; OPTIMIZATION;

D O I：

10.1051/cocv/2025018

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates the application of mini-batch gradient descent to semiflows (gradient flows). Given a loss function (potential), we introduce a continuous version of mini-batch gradient descent by randomly selecting sub-loss functions over time, defining a piecewise flow. We prove that, under suitable assumptions on the potential generating the semiflow, the mini-batch descent flow trajectory closely approximates the original semiflow trajectory on average. In addition, we study a randomized minimizing movement scheme that also approximates the semiflow of the full loss function. We illustrate the versatility of this approach across various problems, including constrained optimization, sparse inversion, and domain decomposition. Finally, we validate our results with several numerical examples.

引用

页数：31

共 27 条

[1] Optimal control of the obstacle for a parabolic variational inequality
Adams, DR
Lenhart, S
[J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2002, 268 (02) : 602 - 614
[2] Attouch H., 2014, MOS-SIAM Series on Optimization, V17
[3] Barbu V, 2010, SPRINGER MONOGR MATH, P1, DOI 10.1007/978-1-4419-5542-5
[4] Bauschke HH, 2011, CMS BOOKS MATH, P1, DOI 10.1007/978-1-4419-9467-7
[5] On the continuity of the time derivative of the solution to the parabolic obstacle problem with variable coefficients
Blanchet, A
Dolbeault, J
Monneau, R
[J]. JOURNAL DE MATHEMATIQUES PURES ET APPLIQUEES, 2006, 85 (03): : 371 - 414
[6] Optimization Methods for Large-Scale Machine Learning
Bottou, Leon
Curtis, Frank E.
Nocedal, Jorge
[J]. SIAM REVIEW, 2018, 60 (02) : 223 - 311
[7] Large-Scale Machine Learning with Stochastic Gradient Descent
Bottou, Leon
[J]. COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 177 - 186
[8] Burachik RS, 2005, J CONVEX ANAL, V12, P279
[9] Chiril A., 2021, Distribution Theory Applied to Differential Equations
[10] A randomized operator splitting scheme inspired by stochastic optimization methods
Eisenmann, Monika
Tony, Stillfjord
[J]. NUMERISCHE MATHEMATIK, 2024, 156 (02) : 435 - 461

← 1 2 3 →