Collecting and Analyzing Multidimensional Data with Local Differential Privacy

被引:232
|
作者
Wang, Ning [1 ]
Xiao, Xiaokui [2 ]
Yang, Yin [3 ]
Zhao, Jun [4 ]
Hui, Siu Cheung [4 ]
Shin, Hyejin [5 ]
Shin, Junbum [5 ]
Yu, Ge [6 ]
机构
[1] Ocean Univ China, Sch Informat Sci & Engn, Qingdao, Shandong, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[3] Hamad Bin Khalifa Univ, Div Informat & Comp Techol, Coll Sci & Engn, Doha, Qatar
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[5] Samsung Elect, Samsung Res, Seoul, South Korea
[6] Northeastern Univ, Sch Comp Sci & Engn, Shenyang, Liaoning, Peoples R China
来源
2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019) | 2019年
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
Local differential privacy; multidimensional data; stochastic gradient descent; NOISE;
D O I
10.1109/ICDE.2019.00063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Local differential privacy (LDP) is a recently proposed privacy standard for collecting and analyzing data, which has been used, e.g., in the Chrome browser, iOS and macOS. In LDP, each user perturbs her information locally, and only sends the randomized version to an aggregator who performs analyses, which protects both the users and the aggregator against private information leaks. Although LDP has attracted much research attention in recent years, the majority of existing work focuses on applying LDP to complex data and/or analysis tasks. In this paper, we point out that the fundamental problem of collecting multidimensional data under LDP has not been addressed sufficiently, and there remains much room for improvement even for basic tasks such as computing the mean value over a single numeric attribute under LDP. Motivated by this, we first propose novel LDP mechanisms for collecting a numeric attribute, whose accuracy is at least no worse (and usually better) than existing solutions in terms of worst-case noise variance. Then, we extend these mechanisms to multidimensional data that can contain both numeric and categorical attributes, where our mechanisms always outperform existing solutions regarding worst-case noise variance. As a case study, we apply our solutions to build an LDP-compliant stochastic gradient descent algorithm (SGD), which powers many important machine learning tasks. Experiments using real datasets confirm the effectiveness of our methods, and their advantages over existing solutions.
引用
收藏
页码:638 / 649
页数:12
相关论文
共 50 条
  • [41] A Lightweight Matrix Factorization for Recommendation With Local Differential Privacy in Big Data
    Zhou, Hao
    Yang, Geng
    Xiang, Yang
    Bai, Yunlu
    Wang, Weiya
    IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (01) : 160 - 173
  • [42] LDP-IDS: Local Differential Privacy for Infinite Data Streams
    Ren, Xuebin
    Shi, Liang
    Yu, Weiren
    Yang, Shusen
    Zhao, Cong
    Xu, Zongben
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 1064 - 1077
  • [43] A method of decreasing connectability of derived data, using local differential privacy
    Oguri, Hidenobu
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI-2019), 2019,
  • [44] Longitudinal attacks against iterative data collection with local differential privacy
    Gursoy, Mehmet Emre
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2024, 32 (01) : 198 - 218
  • [45] Research on Governmental Data Sharing Based on Local Differential Privacy Approach
    Liu, Liping
    Piao, Chunhui
    Jiang, Xuehong
    Zheng, Lijuan
    2018 IEEE 15TH INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE 2018), 2018, : 39 - 45
  • [46] Transaction Data Collection for Itemset Mining Under Local Differential Privacy
    Ouyang J.
    Yin J.
    Xiao Z.-H.
    Zhao H.-M.
    Liu S.-P.
    Liang P.
    Xiao Y.-Y.
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (11): : 3541 - 3562
  • [47] Computing Aggregates Over Numeric Data with Personalized Local Differential Privacy
    Akter, Mousumi
    Hashem, Tanzima
    INFORMATION SECURITY AND PRIVACY, ACISP 2017, PT II, 2017, 10343 : 249 - 260
  • [48] BiSample: Bidirectional Sampling for Handling Missing Data with Local Differential Privacy
    Sun, Lin
    Ye, Xiaojun
    Zhao, Jun
    Lu, Chenhui
    Yang, Mengmeng
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT I, 2020, 12112 : 88 - 104
  • [49] PrivGMM: Probability Density Estimation with Local Differential Privacy
    Diao, Xinrong
    Yang, Wei
    Wang, Shaowei
    Huang, Liusheng
    Xu, Yang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT I, 2020, 12112 : 105 - 121
  • [50] Local Differential Privacy Is Not Enough: A Sample Reconstruction Attack Against Federated Learning With Local Differential Privacy
    You, Zhichao
    Dong, Xuewen
    Li, Shujun
    Liu, Ximeng
    Ma, Siqi
    Shen, Yulong
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 1519 - 1534