Federated Linear Contextual Bandits with Heterogeneous Clients

被引:0
作者
Blaser, Ethan [1 ]
Li, Chuanhao [1 ]
Wang, Hongning [1 ]
机构
[1] Univ Virginia, Charlottesville, VA 22903 USA
来源
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238 | 2024年 / 238卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The demand for collaborative and private bandit learning across multiple agents is surging due to the growing quantity of data generated from distributed systems. Federated bandit learning has emerged as a promising framework for private, efficient, and decentralized online learning. However, almost all previous works rely on strong assumptions of client homogeneity, i.e., all participating clients shall share the same bandit model; otherwise, they all would suffer linear regret. This greatly restricts the application of federated bandit learning in practice. In this work, we introduce a new approach for federated bandits for heterogeneous clients, which clusters clients for collaborative bandit learning under the federated learning setting. Our proposed algorithm achieves non-trivial sub-linear regret and communication cost for all clients, subject to the communication protocol under federated learning that at anytime only one model can be shared by the server.
引用
收藏
页数:24
相关论文
共 37 条
[1]  
Abbasi-Yadkori Y., 2011, Advances in neural information processing systems, P2312, DOI DOI 10.5555/2986459.2986717
[2]   Finite-time analysis of the multiarmed bandit problem [J].
Auer, P ;
Cesa-Bianchi, N ;
Fischer, P .
MACHINE LEARNING, 2002, 47 (2-3) :235-256
[3]  
Auer P, 1995, AN S FDN CO, P322, DOI 10.1109/SFCS.1995.492488
[4]  
Besson L., 2018, What doubling tricks can and can't do for multi-armed bandits.
[5]  
Bonawitz K., 2019, ARXIV190201046, P374
[6]  
Buccapatnam S, 2013, IEEE DECIS CONTR P, P7309, DOI 10.1109/CDC.2013.6761049
[7]  
Cantador I, 2011, P 5 ACM C REC SYST R
[8]   INTERPRETATION AND USE OF GENERALIZED CHOW TESTS [J].
CANTRELL, RS ;
BURROWS, PM ;
VUONG, QH .
INTERNATIONAL ECONOMIC REVIEW, 1991, 32 (03) :725-741
[9]  
Caron Stephane, 2012, ARXIV
[10]  
Cesa-Bianchi N., 2013, ADV NEURAL INFORM PR, P737