Shuffle Differential Private Data Aggregation for Random Population

被引:7
|
作者
Wang, Shaowei [1 ]
Luo, Xuandi [1 ]
Qian, Yuqiu [2 ]
Zhu, Youwen [4 ]
Chen, Kongyang [1 ,3 ]
Chen, Qi [1 ]
Xin, Bangzhou [5 ]
Yang, Wei [5 ]
机构
[1] Guangzhou Univ, Inst Artificial Intelligence & Blockchain, Guangzhou 511442, Peoples R China
[2] Tencent Inc, Interact Entertainment Grp, Shenzhen, Guangdong, Peoples R China
[3] Pazhou Lab, Guangzhou 510330, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Sch Comp Sci & Technol, Nanjing 210016, Jiangsu, Peoples R China
[5] Univ Sci & Technol China, Dept Comp Sci & Technol, Hefei 230052, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Sociology; Privacy; Differential privacy; Data models; Servers; Protocols; Data aggregation; data privacy; differential privacy; shuffle privacy; statistical estimation; UTILITY;
D O I
10.1109/TPDS.2023.3247541
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Bridging the advantages of differential privacy in both centralized model (i.e., high accuracy) and local model (i.e., minimum trust), the shuffle privacy model has potential applications in many privacy-sensitive scenarios, such as mobile user data aggregation and federated learning. Since messages from users are anonymized by semi-trusted shufflers (e.g., anonymous channels, edge servers), every user could hide message among other users' messages and inject only part of noises (a.k.a. privacy amplification). However, existing works assume that the participating user population is known in advance, which is unrealistic for dynamic environments (e.g., mobile computing, vehicular networks). In this work, we study the shuffle privacy model with a random participating population, and give privacy amplification bounds for population size with commonly encountered binomial, Poisson, sub-Gaussian distribution and etc. For further improving accuracy, we formulate and derive optimal dummy sizes for both non-adaptive and adaptive dummies. Finally, to break the error barrier due to the constraint of sending one single message per user, we design a multi-message shuffle private protocol supporting random population. Experiment results show that our approaches reduce more than 60% error when compared to the local model and naive approaches. We hope this work provides tailored solutions of shuffle privacy for dynamic mobile/distributed computing.
引用
收藏
页码:1667 / 1681
页数:15
相关论文
共 50 条
  • [21] Synthesizing Realistic Trajectory Data With Differential Privacy
    Sun, Xinyue
    Ye, Qingqing
    Hu, Haibo
    Wang, Yuandong
    Huang, Kai
    Wo, Tianyu
    Xu, Jie
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (05) : 5502 - 5515
  • [22] Benchmarking Evaluation Protocols for Classifiers Trained on Differentially Private Synthetic Data
    Movahedi, Parisa
    Nieminen, Valtteri
    Perez, Ileana Montoya
    Daafane, Hiba
    Sukhwal, Dishant
    Pahikkala, Tapio
    Airola, Antti
    IEEE ACCESS, 2024, 12 : 118637 - 118648
  • [23] An Assessment of the Application of Private Aggregation of Ensemble Models to Sensible Data
    Yovine, Sergio
    Mayr, Franz
    Sosa, Sebastian
    Visca, Ramiro
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2021, 3 (04): : 788 - 801
  • [24] A Differentially Private Data Aggregation Method Based on Worker Partition and Location Obfuscation for Mobile Crowdsensing
    Li, Shuyu
    Zhang, Guozheng
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 63 (01): : 223 - 241
  • [25] A Survey on Privacy Enhanced Role Based Data Aggregation via Differential Privacy
    Shaikh, Azharuddin
    Patil, Shruti
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMMUNICATION AND COMPUTING TECHNOLOGY (ICACCT), 2018, : 285 - 290
  • [26] A lightweight data aggregation scheme achieving privacy preservation and data integrity with differential privacy and fault tolerance
    Bao, Haiyong
    Lu, Rongxing
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2017, 10 (01) : 106 - 121
  • [27] A lightweight data aggregation scheme achieving privacy preservation and data integrity with differential privacy and fault tolerance
    Haiyong Bao
    Rongxing Lu
    Peer-to-Peer Networking and Applications, 2017, 10 : 106 - 121
  • [28] Verifiable Federated Learning With Privacy-Preserving Data Aggregation for Consumer Electronics
    Xie, Haoran
    Wang, Yujue
    Ding, Yong
    Yang, Changsong
    Zheng, Haibin
    Qin, Bo
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 2696 - 2707
  • [29] Differential Privacy for Protecting Private Patterns in Data Streams
    Gu, He
    Plagemann, Thomas
    Benndorf, Maik
    Goebel, Vera
    Koldehofe, Boris
    2023 IEEE 39TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS, ICDEW, 2023, : 118 - 124
  • [30] EPPDA: An Efficient Privacy-Preserving Data Aggregation Federated Learning Scheme
    Song, Jingcheng
    Wang, Weizheng
    Gadekallu, Thippa Reddy
    Cao, Jianyu
    Liu, Yining
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (05): : 3047 - 3057