Continuous-time Models for Stochastic Optimization Algorithms

被引:0
|
作者
Orvieto, Antonio [1 ]
Lucchi, Aurelien [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose new continuous-time formulations for first-order stochastic optimization algorithms such as mini-batch gradient descent and variance-reduced methods. We exploit these continuous-time models, together with simple Lyapunov analysis as well as tools from stochastic calculus, in order to derive convergence bounds for various types of non-convex functions. Guided by such analysis, we show that the same Lyapunov arguments hold in discrete-time, leading to matching rates. In addition, we use these models and Ito calculus to infer novel insights on the dynamics of SGD, proving that a decreasing learning rate acts as time warping or, equivalently, as landscape stretching.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Learning interpretable continuous-time models of latent stochastic dynamical systems
    Duncker, Lea
    Bohner, Gergo
    Boussard, Julien
    Sahani, Maneesh
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [32] Applications of continuous-time stochastic methods to models of endogenous economic growth
    Univ of Washington, Seattle, United States
    Annu Rev Control, (155-166):
  • [33] Continuous-Time Policy Optimization
    Zhan, Guojian
    Jiang, Yuxuan
    Duan, Jingliang
    Li, Shengbo Eben
    Cheng, Bo
    Li, Keqiang
    2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 3382 - 3388
  • [34] Stochastic ordering for continuous-time processes
    Irle, A
    JOURNAL OF APPLIED PROBABILITY, 2003, 40 (02) : 361 - 375
  • [35] An Introduction to Continuous-Time Stochastic Processes
    Pascu, Mihai
    REVUE ROUMAINE DE MATHEMATIQUES PURES ET APPLIQUEES, 2007, 52 (05): : 597 - 598
  • [36] Accelerated Stochastic Mirror Descent: From Continuous-time Dynamics to Discrete-time Algorithms
    Xu, Pan
    Wang, Tianhao
    Gu, Quanquan
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [37] IDENTIFICATION OF CONTINUOUS-TIME MODELS
    JOHANSSON, R
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1994, 42 (04) : 887 - 897
  • [38] Optimization-Based Control of Constrained Nonlinear Systems with Continuous-Time Models: Adaptive Time-Grid Refinement Algorithms
    Fontes, Fernando A. C. C.
    Paiva, Luis T.
    NUMERICAL COMPUTATIONS: THEORY AND ALGORITHMS (NUMTA-2016), 2016, 1776
  • [39] Output Feedback-Based Continuous-Time Distributed PID Optimization Algorithms
    Liu, Jiaxu
    Chen, Song
    Wang, Pengkai
    Cai, Shengze
    Xu, Chao
    Chu, Jian
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2025, 12 (02): : 955 - 969
  • [40] PRINCIPLES OF STOCHASTIC DYNAMIC OPTIMIZATION IN RESOURCE-MANAGEMENT - THE CONTINUOUS-TIME CASE
    LARSON, BA
    AGRICULTURAL ECONOMICS, 1992, 7 (02) : 91 - 107