ONLINE LEARNING WITH PROBABILISTIC FEEDBACK

被引:6
作者
Ghari, Pouya M. [1 ]
Shen, Yanning [1 ]
机构
[1] Univ Calif Irvine, Dept Elect Engn & Comp Sci, Irvine, CA 92717 USA
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
Online Learning; Graphs; Expert Advice; BANDITS;
D O I
10.1109/ICASSP43922.2022.9746797
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Online learning with expert advice is widely used in various machine learning tasks. It considers the problem where a learner chooses one from a set of experts to take advice and make a decision. In many learning problems, experts may be related, henceforth the learner can observe the losses associated with a subset of experts that are related to the chosen one. In this context, the relationship among experts can be captured by a feedback graph, which can be used to assist the learner's decision-making. However, in practice, the nominal feedback graph often entails uncertainties, which renders it impossible to reveal the actual relationship among experts. To cope with this challenge, the present work develops a novel online learning algorithm to deal with uncertainties while making use of the uncertain feedback graph. The proposed algorithm is proved to enjoy sublinear regret under mild conditions. Experiments on real datasets are presented to demonstrate the effectiveness of the novel algorithm.
引用
收藏
页码:4183 / 4187
页数:5
相关论文
共 29 条
  • [1] NONSTOCHASTIC MULTI-ARMED BANDITS WITH GRAPH-STRUCTURED FEEDBACK
    Alon, Noga
    Cesa-Bianchi, Nicolo
    Gentile, Claudio
    Mannor, Shie
    Mansour, Yishay
    Shamir, Ohad
    [J]. SIAM JOURNAL ON COMPUTING, 2017, 46 (06) : 1785 - 1826
  • [2] Alon Noga, 2015, C LEARNING THEORY, P23
  • [3] Amin Kareem, 2015, AAAI C ART INT
  • [4] [Anonymous], 2019, P INT C MACH LEARN J
  • [5] Auer P, 2003, SIAM J COMPUT, V32, P48, DOI 10.1137/S0097539701398375
  • [6] Cesa-Bianchi Nicolo, 2006, Prediction, Learning, and Games
  • [7] How to use expert advice
    CesaBianchi, N
    Freund, Y
    Haussler, D
    Helmbold, DP
    Schapire, RE
    Warmuth, MK
    [J]. JOURNAL OF THE ACM, 1997, 44 (03) : 427 - 485
  • [8] Chvatal V., 1979, Mathematics of Operations Research, V4, P233, DOI 10.1287/moor.4.3.233
  • [9] Cohen A, 2016, PR MACH LEARN RES, V48
  • [10] Cortes Corinna, 2020, PMLR, P2154