Locally Differentially Private Heavy Hitter Identification

被引:53
|
作者
Wang, Tianhao [1 ]
Li, Ninghui [1 ]
Jha, Somesh [2 ]
机构
[1] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
[2] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
关键词
Protocols; Frequency estimation; Differential privacy; Frequency-domain analysis; Estimation; Privacy; Sociology; Local differential privacy; heavy hitter;
D O I
10.1109/TDSC.2019.2927695
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The notion of Local Differential Privacy (LDP) enables users to answer sensitive questions while preserving their privacy. The basic LDP frequency oracle protocol enables the aggregator to estimate the frequency of any value. But when the domain of input values is large, finding the most frequent values, also known as the heavy hitters, by estimating the frequencies of all possible values, is computationally infeasible. In this paper, we propose an LDP protocol for identifying heavy hitters. In our proposed protocol, which we call Prefix Extending Method (PEM), users are divided into groups, with each group reporting a prefix of her value. We analyze how to choose optimal parameters for the protocol and identify two design principles for designing LDP protocols with high utility. Experiments show that under the same privacy guarantee and computational cost, PEM has better utility on both synthetic and real-world datasets than existing solutions.
引用
收藏
页码:982 / 993
页数:12
相关论文
共 50 条
  • [1] Differentially Private Heavy Hitter Detection using Federated Analytics
    Chadha, Karan
    Chen, Junye
    Duchi, John
    Feldman, Vitaly
    Hashemi, Hanieh
    Javidbakht, Omid
    McMillan, Audra
    Talwar, Kunal
    IEEE CONFERENCE ON SAFE AND TRUSTWORTHY MACHINE LEARNING, SATML 2024, 2024, : 512 - 533
  • [2] Locally Private Set-Valued Data Analyses: Distribution and Heavy Hitters Estimation
    Wang, Shaowei
    Li, Yuntong
    Zhong, Yusen
    Chen, Kongyang
    Wang, Xianmin
    Zhou, Zhili
    Peng, Fei
    Qian, Yuqiu
    Du, Jiachun
    Yang, Wei
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (08) : 8050 - 8065
  • [3] Locally Differentially Private Minimum Finding
    Fukuchi, Kazuto
    Yu, Chia-Mu
    Sakuma, Jun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (08) : 1418 - 1430
  • [4] Efficient protocols for heavy hitter identification with local differential privacy
    Dan Zhao
    Suyun Zhao
    Hong Chen
    Ruixuan Liu
    Cuiping Li
    Wenjuan Liang
    Frontiers of Computer Science, 2022, 16
  • [5] Efficient protocols for heavy hitter identification with local differential privacy
    Dan ZHAO
    Suyun ZHAO
    Hong CHEN
    Ruixuan LIU
    Cuiping LI
    Wenjuan LIANG
    Frontiers of Computer Science, 2022, 16 (05) : 188 - 198
  • [6] Efficient protocols for heavy hitter identification with local differential privacy
    Zhao, Dan
    Zhao, Suyun
    Chen, Hong
    Liu, Ruixuan
    Li, Cuiping
    Liang, Wenjuan
    FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (05)
  • [7] Practical Locally Private Heavy Hitters
    Bassily, Raef
    Nissim, Kobbi
    Stemmer, Uri
    Thakurta, Abhradeep
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [8] Re-Identification in Differentially Private Incomplete Datasets
    Sei, Yuichi
    Okumura, Hiroshi
    Ohsuga, Akihiko
    IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2022, 3 : 62 - 72
  • [9] Heavy Hitter Detection and Identification in Software Defined Networking
    Yang, Liang
    Ng, Bryan
    Seah, Winston K. G.
    2016 25TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN), 2016,
  • [10] Heavy hitter identification based on adaptive sampling with mapreduce
    Zhou, Aiping
    Cheng, Guang
    Guo, Xiaojun
    Dinh Tu Truong
    Zhu, Chengang
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2015, 30 (06): : 451 - 459