Online Bandit Convex Optimization Over A Network

被引:0
作者
Yuan, Deming [1 ]
Ho, Daniel W. C. [2 ]
Hong, Yiguang [3 ]
Jiang, Guoping [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210023, Peoples R China
[2] City Univ Hong Kong, Dept Math, Hong Kong, Hong Kong, Peoples R China
[3] Chinese Acad Sci, Acad Math & Syst Sci, Key Lab Syst & Control, Beijing 100190, Peoples R China
来源
PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016 | 2016年
基金
中国国家自然科学基金;
关键词
Online Convex Optimization; Distributed Optimization; Bandit; Regret; MULTIAGENT OPTIMIZATION; CONSENSUS; ALGORITHM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we study the online bandit optimization over an undirected and connected network. In this setting, a network consists of multiple interacting agents plays a repeated game of T rounds with the dynamic environment. The objective function is a sum of all the agents' local cost functions that change over time, due to the uncertainties in the environment; moreover, each agent can only observe the value of each cost function at a single point at each round. We propose an efficient algorithm for distributed online bandit optimization with strongly convex cost functions and obtain a bound of (O) over tilde (T-2/3) on the expected regret.
引用
收藏
页码:8090 / 8095
页数:6
相关论文
共 16 条
[1]  
[Anonymous], 2014, Advances in Neural Information Processing Systems
[2]   Dual Averaging for Distributed Optimization: Convergence Analysis and Network Scaling [J].
Duchi, John C. ;
Agarwal, Alekh ;
Wainwright, Martin J. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (03) :592-606
[3]  
Flaxman AD, 2005, PROCEEDINGS OF THE SIXTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P385
[4]  
HAZAN E., INTRO ONLINE CONVEX
[5]  
Hosseini S, 2013, IEEE DECIS CONTR P, P1484, DOI 10.1109/CDC.2013.6760092
[6]   Distributed Random Projection Algorithm for Convex Optimization [J].
Lee, Soomin ;
Nedic, Angelia .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2013, 7 (02) :221-229
[7]   Approximate Projected Consensus for Convex Intersection Computation: Convergence Analysis and Critical Error Angle [J].
Lou, Youcheng ;
Shi, Guodong ;
Johansson, Karl Henrik ;
Hong, Yiguang .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (07) :1722-1736
[8]   Gossip Algorithms for Convex Consensus Optimization Over Networks [J].
Lu, Jie ;
Tang, Choon Yik ;
Regier, Paul R. ;
Bow, Travis D. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2011, 56 (12) :2911-2918
[9]   Constrained Consensus and Optimization in Multi-Agent Networks [J].
Nedic, Angelia ;
Ozdaglar, Asuman ;
Parrilo, Pablo A. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2010, 55 (04) :922-938
[10]   Distributed Subgradient Methods for Multi-Agent Optimization [J].
Nedic, Angelia ;
Ozdaglar, Asurrian .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2009, 54 (01) :48-61