Distributed Stochastic Optimization Under a General Variance Condition
被引:1
|
作者:
Huang, Kun
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Univ Hong Kong, Sch Data Sci, Shenzhen CUHK Shenzhen, Shenzhen 518172, Peoples R ChinaChinese Univ Hong Kong, Sch Data Sci, Shenzhen CUHK Shenzhen, Shenzhen 518172, Peoples R China
Huang, Kun
[1
]
Li, Xiao
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Univ Hong Kong, Sch Data Sci, Shenzhen CUHK Shenzhen, Shenzhen 518172, Peoples R ChinaChinese Univ Hong Kong, Sch Data Sci, Shenzhen CUHK Shenzhen, Shenzhen 518172, Peoples R China
Li, Xiao
[1
]
Pu, Shi
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Univ Hong Kong, Sch Data Sci, Shenzhen CUHK Shenzhen, Shenzhen 518172, Peoples R ChinaChinese Univ Hong Kong, Sch Data Sci, Shenzhen CUHK Shenzhen, Shenzhen 518172, Peoples R China
Pu, Shi
[1
]
机构:
[1] Chinese Univ Hong Kong, Sch Data Sci, Shenzhen CUHK Shenzhen, Shenzhen 518172, Peoples R China
Distributed stochastic optimization has drawn great attention recently due to its effectiveness in solving large-scale machine learning problems. Although numerous algorithms have been proposed and successfully applied to general practical problems, their theoretical guarantees mainly rely on certain boundedness conditions on the stochastic gradients, varying from uniform boundedness to the relaxed growth condition. In addition, how to characterize the data heterogeneity among the agents and its impacts on the algorithmic performance remains challenging. In light of such motivations, we revisit the classical federated averaging algorithm (McMahan et al., 2017) as well as the more recent SCAFFOLD method (Karimireddy et al., 2020) for solving the distributed stochastic optimization problem and establish the convergence results under only a mild variance condition on the stochastic gradients for smooth nonconvex objective functions. Almost sure convergence to a stationary point is also established under the condition. Moreover, we discuss a more informative measurement for data heterogeneity as well as its implications.
机构:
KTH Royal Inst Technol, Div Decis & Control Syst, SE-10044 Stockholm, SwedenKTH Royal Inst Technol, Div Decis & Control Syst, SE-10044 Stockholm, Sweden
Wu, Xuyang
Wang, He
论文数: 0引用数: 0
h-index: 0
机构:
ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai 201210, Peoples R ChinaKTH Royal Inst Technol, Div Decis & Control Syst, SE-10044 Stockholm, Sweden
Wang, He
Lu, Jie
论文数: 0引用数: 0
h-index: 0
机构:
ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai 201210, Peoples R ChinaKTH Royal Inst Technol, Div Decis & Control Syst, SE-10044 Stockholm, Sweden