Asynchronous Saddle Point Algorithm for Stochastic Optimization in Heterogeneous Networks

被引：26

作者：

Bedi, Amrit Singh ^{[1
]}

Koppel, Alec ^{[1
]}

Rajawat, Ketan ^{[2
]}

机构：

[1] US Army, Res Lab, Adelphi, MD 20783 USA

[2] Indian Inst Technol Kanpur, Dept Elect Engn, Kanpur 208016, Uttar Pradesh, India

来源：

IEEE TRANSACTIONS ON SIGNAL PROCESSING | 2019年 / 67卷 / 07期

关键词：

Stochastic optimizatiokn; gradient descent; multi-agent systems; interference management; ALTERNATING DIRECTION METHOD; APPROXIMATION;

D O I：

10.1109/TSP.2019.2894803

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We consider expected risk minimization in multi-agent systems comprised of distinct subsets of agents operating without a common time scale. Each individual in the network is charged with minimizing the global objective function, which is an average of sum of the statistical average loss function of each agent in the network. Since agents are not assumed to observe data from identical distributions, the hypothesis that all agents seek a common action is violated, and thus the hypothesis upon that consensus constraints are formulated is violated. Thus, we consider nonlinear network proximity constraints, which incentivize nearby nodes to make decisions that are close to one another but do not necessarily coincide. Moreover, agents are not assumed to receive their sequentially arriving observations on a common time index, and thus seek to learn in an asynchronous manner. An asynchronous stochastic variant of the Arrow-Hurwicz saddle point method is proposed to solve this problem that operates by alternating primal stochastic descent steps and Lagrange multiplier updates that penalize the discrepancies between agents. This tool leads to an implementation that allows for each agent to operate asynchronously with local information only and message passing with neighbors. Our main result establishes that the proposed method yields convergence in expectation both in terms of the primal sub-optimality and constraint violation to radii of sizes O(root T) and O(T-3/4), respectively. Empirical evaluation on an asynchronously operating wireless network that manages user channel interference through an adaptive communications pricing mechanism demonstrates that our theoretical results translates well to practice.

引用

页码：1742 / 1757

页数：16

共 52 条

[1]

Agarwal A., 2011, Advances in NeurIPS, V24

[2]

[Anonymous], 1989, PARALLEL DISTRIBUTED

[3]

[Anonymous], FOUND TRENDS MACH LE

[4]

[Anonymous], 2014, P INT C MACH LEARN

[5]

[Anonymous], 1998, Online Algorithms and Stochastic Approximations

[6]

Arrow K., 1958, Stanford Math. Stud. Social Sci, VII

[7]

Bach F, 2014, J MACH LEARN RES, V15, P595

[8]

Bedi AS, 2018, IEEE DECIS CONTR P, P3229, DOI 10.1109/CDC.2018.8619362

[9] Asynchronous Incremental Stochastic Dual Descent Algorithm for Network Resource Allocation [J].

Bedi, Amrit Singh ;

Rajawat, Ketan .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (09) :2229-2244

[10]

Bertsekas D., 2003, Convex Analysis and Optimization, V1

← 1 2 3 4 5 6 →