Convergence of Momentum-based Distributed Stochastic Approximation with RL Applications

被引:0
|
作者
Naskar, Ankur [1 ]
Thoppe, Gugan [1 ]
机构
[1] Indian Inst Sci, Dept Comp Sci & Automat, Bengaluru 560012, India
关键词
D O I
10.1109/ICC61519.2023.10442992
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We develop a novel proof strategy for deriving almost sure convergence of momentum-based distributed stochastic approximation (DSA) schemes. Popular momentum-based schemes such as Polyak's heavy-ball and Nesterov's Accelerated SGD can be analyzed using our template. Our technique enables us to do away with three restrictive assumptions of existing approaches. One, we do not need the communication matrix to be doubly stochastic. Two, we do not need the noise to be uniformly bounded. Lastly, our approach can handle cases where there are multiple or non-point attractors. As an application, we use our technique to derive convergence for momentum-based extensions of the multi-agent TD(0) algorithm, where the above restrictive assumptions do not hold.
引用
收藏
页码:178 / 179
页数:2
相关论文
共 50 条
  • [1] Convergence of Momentum-Based Stochastic Gradient Descent
    Jin, Ruinan
    He, Xingkang
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION (ICCA), 2020, : 779 - 784
  • [2] Distributed Momentum-Based Frank-Wolfe Algorithm for Stochastic Optimization
    Hou, Jie
    Zeng, Xianlin
    Wang, Gang
    Sun, Jian
    Chen, Jie
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (03) : 685 - 699
  • [3] Distributed Momentum-Based Frank-Wolfe Algorithm for Stochastic Optimization
    Jie Hou
    Xianlin Zeng
    Gang Wang
    Jian Sun
    Jie Chen
    IEEE/CAA Journal of Automatica Sinica, 2023, 10 (03) : 685 - 699
  • [4] Momentum-based accelerated mirror descent stochastic approximation for robust topology optimization under stochastic loads
    Li, Weichen
    Zhang, Xiaojia Shelly
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2021, 122 (17) : 4431 - 4457
  • [5] On the Global Optimum Convergence of Momentum-based Policy Gradient
    Ding, Yuhao
    Zhang, Junzi
    Lavaei, Javad
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [6] Momentum-based approximation of incompressible multiphase fluid flows
    Cappanera, Loic
    Guermond, Jean-Luc
    Herreman, Wietze
    Nore, Caroline
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2018, 86 (08) : 541 - 563
  • [7] Global Convergence of Stochastic Gradient Hamiltonian Monte Carlo for Nonconvex Stochastic Optimization: Nonasymptotic Performance Bounds and Momentum-Based Acceleration
    Gao, Xuefeng
    Gurbuzbalaban, Mert
    Zhu, Lingjiong
    OPERATIONS RESEARCH, 2021, : 2931 - 2947
  • [8] Global Convergence of Stochastic Gradient Hamiltonian Monte Carlo for Nonconvex Stochastic Optimization: Nonasymptotic Performance Bounds and Momentum-Based Acceleration
    Gao, Xuefeng
    Gürbüzbalaban, Mert
    Zhu, Lingjiong
    Operations Research, 2022, 70 (05) : 2931 - 2947
  • [9] Distributed Momentum-Based Multiagent Optimization With Different Constraint Sets
    Zhou, Xu
    Ma, Zhongjing
    Zou, Suli
    Margellos, Kostas
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2025, 70 (02) : 963 - 978
  • [10] Momentum-Based Load Prescriptions: Applications to Jump Squat Training
    Harry, John R.
    Krzyszkowski, John
    Harris, Katie
    Chowning, Luke
    Mackey, Ethan
    Bishop, Chris
    Barker, Leland A.
    JOURNAL OF STRENGTH AND CONDITIONING RESEARCH, 2022, 36 (09) : 2657 - 2662