A Differentially Private Approach for Budgeted Combinatorial Multi-Armed Bandits

被引：0

作者：

Wang, Hengzhi ^{[1
]}

Cui, Laizhong ^{[1
]}

Wang, En ^{[2
,3
]}

Liu, Jiangchuan ^{[4
]}

机构：

[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China

[2] Jilin Univ, Dept Comp Sci & Technol, Changchun 130012, Peoples R China

[3] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130012, Peoples R China

[4] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada

来源：

IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING | 2025年 / 22卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Combinatorial multi-armed bandits; differential privacy; online learning; truthfulness;

D O I：

10.1109/TDSC.2024.3401836

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

As a fundamental tool for sequential decision-making, the Combinatorial Multi-Armed Bandits model (CMAB) has been extensively analyzed and applied in various online applications. However, the privacy concerns in budgeted CMAB are rarely investigated thus far. Few bandit algorithms have adequately addressed the privacy-preserving budgeted CMAB setting. Motivated by this, we study this setting using differential privacy as the formal measure of privacy. In this setting, playing an arm yields both a random reward and a random cost, and these values are kept private. In addition, multiple arms can be played in each round. The objective of the decision-maker is to minimize regret while subject to a budget constraint on the cumulative cost of all played arms. We demonstrate an exploration-exploitation-balanced bandit policy, which preserves the privacy of both rewards and costs under budgeted CMAB settings. This policy is proven differentially private and achieves an upper bound on regret. Furthermore, to provide incentives for the differentially private bandit policy so as to ensure that the reported costs are truthful, we introduce the concept of truthfulness and incorporate a payment mechanism that has been proven to be $\sigma$sigma-truthful. Numerical simulations based on multiple real-world datasets validate the theoretical findings and demonstrate the effectiveness of our policy compared to state-of-the-art policies.

引用

页码：424 / 439

页数：16

共 50 条

[21] Combinatorial Multi-armed Bandits for Real-Time Strategy Games
Ontanon, Santiago
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2017, 58 : 665 - 702
[22] Combinatorial Multi-Armed Bandits in Cognitive Radio Networks: A Brief Overview
Kang, Sunjung
Joo, Changhee
2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2017, : 1086 - 1088
[23] Diversity-Based Recruitment in Crowdsensing by Combinatorial Multi-Armed Bandits
Sawwan, Abdalaziz
Wu, Jie
TSINGHUA SCIENCE AND TECHNOLOGY, 2025, 30 (02): : 732 - 747
[24] On a new approach to the analysis of complex multi-armed bandits
R. Garbe
K. D. Glazebrook
Mathematical Methods of Operations Research, 1998, 48 : 419 - 442
[25] Combinatorial multi-armed bandits for real-time strategy games
Ontañón, Santiago (santi@cs.drexel.edu), 1600, AI Access Foundation (58):
[26] Transfer Learning in Multi-Armed Bandits: A Causal Approach
Zhang, Junzhe
Bareinboim, Elias
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1340 - 1346
[27] On a new approach to the analysis of complex multi-armed bandits
Garbe, R
Glazebrook, KD
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 1998, 48 (03) : 419 - 442
[28] KL-UCB-Based Policy for Budgeted Multi-Armed Bandits with Stochastic Action Costs
Watanabe, Ryo
Komiyama, Junpei
Nakamura, Atsuyoshi
Kudo, Mineichi
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (11): : 2470 - 2486
[29] Multi-armed bandits for performance marketing
Gigli, Marco
Stella, Fabio
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
[30] Lenient Regret for Multi-Armed Bandits
Merlis, Nadav
Mannor, Shie
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8950 - 8957

← 1 2 3 4 5 →