A Differentially Private Approach for Budgeted Combinatorial Multi-Armed Bandits

被引:0
|
作者
Wang, Hengzhi [1 ]
Cui, Laizhong [1 ]
Wang, En [2 ,3 ]
Liu, Jiangchuan [4 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Jilin Univ, Dept Comp Sci & Technol, Changchun 130012, Peoples R China
[3] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130012, Peoples R China
[4] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
基金
中国国家自然科学基金;
关键词
Combinatorial multi-armed bandits; differential privacy; online learning; truthfulness;
D O I
10.1109/TDSC.2024.3401836
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As a fundamental tool for sequential decision-making, the Combinatorial Multi-Armed Bandits model (CMAB) has been extensively analyzed and applied in various online applications. However, the privacy concerns in budgeted CMAB are rarely investigated thus far. Few bandit algorithms have adequately addressed the privacy-preserving budgeted CMAB setting. Motivated by this, we study this setting using differential privacy as the formal measure of privacy. In this setting, playing an arm yields both a random reward and a random cost, and these values are kept private. In addition, multiple arms can be played in each round. The objective of the decision-maker is to minimize regret while subject to a budget constraint on the cumulative cost of all played arms. We demonstrate an exploration-exploitation-balanced bandit policy, which preserves the privacy of both rewards and costs under budgeted CMAB settings. This policy is proven differentially private and achieves an upper bound on regret. Furthermore, to provide incentives for the differentially private bandit policy so as to ensure that the reported costs are truthful, we introduce the concept of truthfulness and incorporate a payment mechanism that has been proven to be $\sigma$sigma-truthful. Numerical simulations based on multiple real-world datasets validate the theoretical findings and demonstrate the effectiveness of our policy compared to state-of-the-art policies.
引用
收藏
页码:424 / 439
页数:16
相关论文
共 50 条
  • [21] Combinatorial Multi-armed Bandits for Real-Time Strategy Games
    Ontanon, Santiago
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2017, 58 : 665 - 702
  • [22] Combinatorial Multi-Armed Bandits in Cognitive Radio Networks: A Brief Overview
    Kang, Sunjung
    Joo, Changhee
    2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2017, : 1086 - 1088
  • [23] Diversity-Based Recruitment in Crowdsensing by Combinatorial Multi-Armed Bandits
    Sawwan, Abdalaziz
    Wu, Jie
    TSINGHUA SCIENCE AND TECHNOLOGY, 2025, 30 (02): : 732 - 747
  • [24] On a new approach to the analysis of complex multi-armed bandits
    R. Garbe
    K. D. Glazebrook
    Mathematical Methods of Operations Research, 1998, 48 : 419 - 442
  • [25] Combinatorial multi-armed bandits for real-time strategy games
    Ontañón, Santiago (santi@cs.drexel.edu), 1600, AI Access Foundation (58):
  • [26] Transfer Learning in Multi-Armed Bandits: A Causal Approach
    Zhang, Junzhe
    Bareinboim, Elias
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1340 - 1346
  • [27] On a new approach to the analysis of complex multi-armed bandits
    Garbe, R
    Glazebrook, KD
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 1998, 48 (03) : 419 - 442
  • [28] KL-UCB-Based Policy for Budgeted Multi-Armed Bandits with Stochastic Action Costs
    Watanabe, Ryo
    Komiyama, Junpei
    Nakamura, Atsuyoshi
    Kudo, Mineichi
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (11): : 2470 - 2486
  • [29] Multi-armed bandits for performance marketing
    Gigli, Marco
    Stella, Fabio
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [30] Lenient Regret for Multi-Armed Bandits
    Merlis, Nadav
    Mannor, Shie
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8950 - 8957