Statistical Privacy and Consent in Data Aggregation

被引:0
|
作者
Scope, Nick [1 ]
Rasin, Alexander [1 ]
Ben Lenard [2 ]
Wagner, James [3 ]
机构
[1] DePaul Univ, Chicago, IL 60604 USA
[2] DePaul Univ, Argonne Natl Lab, Chicago, IL USA
[3] Univ New Orleans, New Orleans, LA 70148 USA
基金
美国国家科学基金会;
关键词
GDPR; Compliance; Processing consent; Privacy;
D O I
10.1145/3676288.3676298
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As new laws governing management of personal data are introduced, e.g., the European Union's General Data Protection Regulation of 2016 and the California Consumer Privacy Act of 2018, compliance with data governance legislation is becoming an increasingly important aspect of data management. An important component of many data privacy laws is that they require companies to only use an individual's data for a purpose the individual has explicitly consented to. Prior methods for enforcing consent for aggregate queries either use access control to eliminate data without consent from query evaluation or apply differential privacy algorithms to inject synthetic noise into the outcomes of queries (or input data) to ensure that the anonymity of non-consenting individuals is preserved with high probability. Both approaches return query results that differ from the ground truth results corresponding to the full input containing data from both consenting and non-consenting individuals. We present an alternative framework for group-by aggregate queries, tailored for applications, e.g., medicine, where even a small deviation from the correct answer to a query cannot be tolerated. Our approach uses provenance to determine, for each output tuple of a group-by aggregate query, which individual's data was used to derive the result for this group. We then use statistical tests to determine how likely it is that the presence of data for a non-consenting individual will be revealed by such an output tuple. We filter out tuples for which this test fails, i.e., which are deemed likely to reveal non-consenting data. Thus, our approach always returns a subset of the ground truth query answers. Our experiments successfully return only 100% accurate results in instances where access control or differential privacy would have either returned less total or less accurate results.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Blockchain for Student Data Privacy and Consent
    Gilda, Shlok
    Mehrotra, Maanav
    2018 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2018,
  • [2] On the privacy of concealed data aggregation
    Chan, Aldar C. -F.
    Castelluccia, Claude
    COMPUTER SECURITY - ESORICS 2007, PROCEEDINGS, 2007, 4734 : 390 - +
  • [3] Statistical Data Privacy: A Song of Privacy and Utility
    Slavkovic, Aleksandra
    Seeman, Jeremy
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2023, 10 : 189 - 218
  • [4] Consent to data collection: privacy policies and data collection notices
    Piccolo, Daiane Marcela
    Affonso, Elaine Parra
    Sant'Ana, Ricardo Cesar Goncalves
    BIBLIOS-REVISTA DE BIBLIOTECOLOGIA Y CIENCIAS DE LA INFORMACION, 2023, (86): : 220 - 236
  • [5] PUDA - Privacy and Unforgeability for Data Aggregation
    Leontiadis, Iraklis
    Elkhiyaoui, Kaoutar
    Onen, Melek
    Molva, Refik
    CRYPTOLOGY AND NETWORK SECURITY, CANS 2015, 2015, 9476 : 3 - 18
  • [6] On the Applications of Aggregation Operators in Data Privacy
    Torra, Vicenc
    Navarro-Arribas, Guillermo
    Abril, Daniel
    INTEGRATED UNCERTAINTY MANAGEMENT AND APPLICATIONS, 2010, 68 : 479 - 488
  • [7] An efficient and privacy-preserving data aggregation scheme supporting arbitrary statistical functions in IoT
    Liu, Haihui
    Chen, Jianwei
    Lin, Liwei
    Ye, Ayong
    Huang, Chuan
    CHINA COMMUNICATIONS, 2022, 19 (06) : 91 - 104
  • [8] An Efficient and Privacy-Preserving Data Aggregation Scheme Supporting Arbitrary Statistical Functions in IoT
    Haihui Liu
    Jianwei Chen
    Liwei Lin
    Ayong Ye
    Chuan Huang
    ChinaCommunications, 2022, 19 (06) : 91 - 104
  • [9] Preserving data privacy in outsourcing data aggregation services
    Xiong, Li
    Chitti, Subramanyam
    Liu, Ling
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2007, 7 (03)
  • [10] Privacy of Synthetic Data: A Statistical Framework
    Boedihardjo, March
    Strohmer, Thomas
    Vershynin, Roman
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2023, 69 (01) : 520 - 527