Compressed Gradient Methods With Hessian-Aided Error Compensation

被引:11
|
作者
Khirirat, Sarit [1 ,2 ]
Magnusson, Sindri [3 ]
Johansson, Mikael [1 ,2 ]
机构
[1] Royal Inst Technol KTH, Dept Automat Control, Sch Elect Engn, S-11428 Stockholm, Sweden
[2] Royal Inst Technol KTH, ACCESS Linnaeus Ctr, S-11428 Stockholm, Sweden
[3] Stockholm Univ, Dept Comp & Syst Sci DSV, S-11419 Stockholm, Sweden
基金
瑞典研究理事会;
关键词
Signal processing algorithms; Optimization; Error compensation; Compressors; Approximation algorithms; Machine learning algorithms; Convergence; Quantization; gradient methods; stochastic gradient descent; DECOMPOSITION;
D O I
10.1109/TSP.2020.3048229
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The emergence of big data has caused a dramatic shift in the operating regime for optimization algorithms. The performance bottleneck, which used to be computations, is now often communications. Several gradient compression techniques have been proposed to reduce the communication load at the price of a loss in solution accuracy. Recently, it has been shown how compression errors can be compensated for in the optimization algorithm to improve the solution accuracy. Even though convergence guarantees for error-compensated algorithms have been established, there is very limited theoretical support for quantifying the observed improvements in solution accuracy. In this paper, we show that Hessian-aided error compensation, unlike other existing schemes, avoids accumulation of compression errors on quadratic problems. We also present strong convergence guarantees of Hessian-based error compensation for stochastic gradient descent. Our numerical experiments highlight the benefits of Hessian-based error compensation, and demonstrate that similar convergence improvements are attained when only a diagonal Hessian approximation is used.
引用
收藏
页码:998 / 1011
页数:14
相关论文
共 50 条
  • [1] Global Convergence of Natural Policy Gradient with Hessian-Aided Momentum Variance Reduction
    Feng, Jie
    Wei, Ke
    Chen, Jinchi
    JOURNAL OF SCIENTIFIC COMPUTING, 2024, 101 (02)
  • [2] CONVERGENCE BOUNDS FOR COMPRESSED GRADIENT METHODS WITH MEMORY BASED ERROR COMPENSATION
    Khirirat, Sarit
    Magnusson, Sindri
    Johansson, Mikael
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2857 - 2861
  • [3] Hessian Aided Policy Gradient
    Shen, Zebang
    Hassani, Hamed
    Mi, Chao
    Qian, Hui
    Ribeiro, Alejandro
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [4] Hessian-Aided Random Perturbation (HARP) Using Noisy Zeroth-Order Oracles
    Zhu, Jingyi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3717 - 3726
  • [5] Hessian-Aided Random Perturbation (HARP) Using Noisy Zeroth-Order Queries
    Zhu, Jingyi
    MATHEMATICAL AND SCIENTIFIC MACHINE LEARNING, VOL 145, 2021, 145 : 1137 - 1160
  • [6] Gradient Error Compensation in SC-MDACs
    Mohapatra, Satyajit
    Mohapatra, Nihar Ranjan
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (12) : 4359 - 4374
  • [7] Combining Natural Gradient with Hessian Free Methods for Sequence Training
    Haider, Adnan
    Woodland, Philip C.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2918 - 2922
  • [9] Compressed Sensing Radar Imaging With Compensation of Observation Position Error
    Yang, Jungang
    Huang, Xiaotao
    Thompson, John
    Jin, Tian
    Zhou, Zhimin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (08): : 4608 - 4620
  • [10] Phase Error Compensation in Imaging Systems Using Compressed Sensing Techniques
    Alvarez, Yuri
    Rodriguez-Vaqueiro, Yolanda
    Gonzalez-Valdes, Borja
    Martinez-Lorenzo, Jose Angel
    Las-Heras, Fernando
    Rappaport, Carey M.
    IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2013, 12 : 1574 - 1577