Learning Multiclass Classifier Under Noisy Bandit Feedback

被引:3
|
作者
Agarwal, Mudit [1 ]
Manwani, Naresh [1 ]
机构
[1] Int Inst Informat Technol Hyderabad, Machine Learning Lab, KCIS, Hyderabad, India
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT II | 2021年 / 12713卷
关键词
Online learning; Recommender system; Classification;
D O I
10.1007/978-3-030-75765-6_36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of multiclass classification with corrupted or noisy bandit feedback. In this setting, the learner may not receive true feedback. Instead, it receives feedback that has been flipped with some non-zero probability. We propose a novel approach to deal with noisy bandit feedback based on the unbiased estimator technique. We further offer a method that can efficiently estimate the noise rates, thus providing an end-to-end framework. The proposed algorithm enjoys a mistake bound of the order of O(root T) in the high noise case and of the order of O(T-2/3) in the worst case. We show our approach's effectiveness using extensive experiments on several benchmark datasets.
引用
收藏
页码:448 / 460
页数:13
相关论文
共 50 条
  • [1] Multiclass Online Learnability under Bandit Feedback
    Raman, Ananth
    Raman, Vinod
    Subedi, Unique
    Mehalel, Idan
    Tewari, Ambuj
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
  • [2] Online Multiclass Learning with "Bandit" Feedback under a Confidence-Weighted Approach
    Shi, Chaoran
    Wang, Xiong
    Tian, Xiaohua
    Gan, Xiaoying
    Wang, Xinbing
    2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,
  • [3] Online Multiclass Boosting with Bandit Feedback
    Zhang, Daniel T.
    Jung, Young Hun
    Tewari, Ambuj
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [4] Beyond Bandit Feedback in Online Multiclass Classification
    van der Hoeven, Dirk
    Fusco, Federico
    Cesa-Bianchi, Nicole
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Multiclass classification with bandit feedback using adaptive regularization
    Crammer, Koby
    Gentile, Claudio
    MACHINE LEARNING, 2013, 90 (03) : 347 - 383
  • [6] Mixtron: Bandit Online Multiclass Prediction with Implicit Feedback
    Feng, Wanjin
    Shi, Hailong
    Zhao, Peilin
    Gao, Xingyu
    23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1004 - 1012
  • [7] New bounds on the price of bandit feedback for mistake-bounded online multiclass learning
    Long, Philip M.
    THEORETICAL COMPUTER SCIENCE, 2020, 808 : 159 - 163
  • [8] Multiclass classification with bandit feedback using adaptive regularization
    Koby Crammer
    Claudio Gentile
    Machine Learning, 2013, 90 : 347 - 383
  • [9] Bandit Learning with Implicit Feedback
    Qi, Yi
    Wu, Qingyun
    Wang, Hongning
    Tang, Jie
    Sun, Maosong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [10] AMOS - LEARNING MULTICLASS PATTERN CLASSIFIER
    POSPISIL, A
    PATTERN RECOGNITION, 1971, 3 (03) : 253 - &