Learning Multiclass Classifier Under Noisy Bandit Feedback

被引：3

作者：

Agarwal, Mudit ^{[1
]}

Manwani, Naresh ^{[1
]}

机构：

[1] Int Inst Informat Technol Hyderabad, Machine Learning Lab, KCIS, Hyderabad, India

来源：

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT II | 2021年 / 12713卷

关键词：

Online learning; Recommender system; Classification;

D O I：

10.1007/978-3-030-75765-6_36

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the problem of multiclass classification with corrupted or noisy bandit feedback. In this setting, the learner may not receive true feedback. Instead, it receives feedback that has been flipped with some non-zero probability. We propose a novel approach to deal with noisy bandit feedback based on the unbiased estimator technique. We further offer a method that can efficiently estimate the noise rates, thus providing an end-to-end framework. The proposed algorithm enjoys a mistake bound of the order of O(root T) in the high noise case and of the order of O(T-2/3) in the worst case. We show our approach's effectiveness using extensive experiments on several benchmark datasets.

引用

页码：448 / 460

页数：13

共 50 条

[1] Multiclass Online Learnability under Bandit Feedback
Raman, Ananth
Raman, Vinod
Subedi, Unique
Mehalel, Idan
Tewari, Ambuj
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
[2] Online Multiclass Learning with "Bandit" Feedback under a Confidence-Weighted Approach
Shi, Chaoran
Wang, Xiong
Tian, Xiaohua
Gan, Xiaoying
Wang, Xinbing
2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,
[3] Online Multiclass Boosting with Bandit Feedback
Zhang, Daniel T.
Jung, Young Hun
Tewari, Ambuj
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
[4] Beyond Bandit Feedback in Online Multiclass Classification
van der Hoeven, Dirk
Fusco, Federico
Cesa-Bianchi, Nicole
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[5] Multiclass classification with bandit feedback using adaptive regularization
Crammer, Koby
Gentile, Claudio
MACHINE LEARNING, 2013, 90 (03) : 347 - 383
[6] Mixtron: Bandit Online Multiclass Prediction with Implicit Feedback
Feng, Wanjin
Shi, Hailong
Zhao, Peilin
Gao, Xingyu
23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1004 - 1012
[7] New bounds on the price of bandit feedback for mistake-bounded online multiclass learning
Long, Philip M.
THEORETICAL COMPUTER SCIENCE, 2020, 808 : 159 - 163
[8] Multiclass classification with bandit feedback using adaptive regularization
Koby Crammer
Claudio Gentile
Machine Learning, 2013, 90 : 347 - 383
[9] Bandit Learning with Implicit Feedback
Qi, Yi
Wu, Qingyun
Wang, Hongning
Tang, Jie
Sun, Maosong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[10] AMOS - LEARNING MULTICLASS PATTERN CLASSIFIER
POSPISIL, A
PATTERN RECOGNITION, 1971, 3 (03) : 253 - &

← 1 2 3 4 5 →