ONLINE LEARNING WITH PROBABILISTIC FEEDBACK

被引：6

作者：

Ghari, Pouya M. ^{[1
]}

Shen, Yanning ^{[1
]}

机构：

[1] Univ Calif Irvine, Dept Elect Engn & Comp Sci, Irvine, CA 92717 USA

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

关键词：

Online Learning; Graphs; Expert Advice; BANDITS;

D O I：

10.1109/ICASSP43922.2022.9746797

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Online learning with expert advice is widely used in various machine learning tasks. It considers the problem where a learner chooses one from a set of experts to take advice and make a decision. In many learning problems, experts may be related, henceforth the learner can observe the losses associated with a subset of experts that are related to the chosen one. In this context, the relationship among experts can be captured by a feedback graph, which can be used to assist the learner's decision-making. However, in practice, the nominal feedback graph often entails uncertainties, which renders it impossible to reveal the actual relationship among experts. To cope with this challenge, the present work develops a novel online learning algorithm to deal with uncertainties while making use of the uncertain feedback graph. The proposed algorithm is proved to enjoy sublinear regret under mild conditions. Experiments on real datasets are presented to demonstrate the effectiveness of the novel algorithm.

引用

页码：4183 / 4187

页数：5

共 29 条

[1] NONSTOCHASTIC MULTI-ARMED BANDITS WITH GRAPH-STRUCTURED FEEDBACK
Alon, Noga
Cesa-Bianchi, Nicolo
Gentile, Claudio
Mannor, Shie
Mansour, Yishay
Shamir, Ohad
[J]. SIAM JOURNAL ON COMPUTING, 2017, 46 (06) : 1785 - 1826
[2] Alon Noga, 2015, C LEARNING THEORY, P23
[3] Amin Kareem, 2015, AAAI C ART INT
[4] [Anonymous], 2019, P INT C MACH LEARN J
[5] Auer P, 2003, SIAM J COMPUT, V32, P48, DOI 10.1137/S0097539701398375
[6] Cesa-Bianchi Nicolo, 2006, Prediction, Learning, and Games
[7] How to use expert advice
CesaBianchi, N
Freund, Y
Haussler, D
Helmbold, DP
Schapire, RE
Warmuth, MK
[J]. JOURNAL OF THE ACM, 1997, 44 (03) : 427 - 485
[8] Chvatal V., 1979, Mathematics of Operations Research, V4, P233, DOI 10.1287/moor.4.3.233
[9] Cohen A, 2016, PR MACH LEARN RES, V48
[10] Cortes Corinna, 2020, PMLR, P2154

← 1 2 3 →