Arboretum: A Planner for Large-Scale Federated Analytics with Differential Privacy

被引:0
作者
Margolin, Elizabeth [1 ]
Newatia, Karan [1 ]
Luo, Tao [1 ]
Roth, Edo [1 ]
Haeberlen, Andreas [1 ,2 ]
机构
[1] Univ Penn, Philadelphia, PA 19104 USA
[2] Roblox, San Mateo, CA USA
来源
PROCEEDINGS OF THE TWENTY-NINTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, SOSP 2023 | 2023年
关键词
federated analytics; differential privacy; query planner;
D O I
10.1145/3600006.3624566
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Federated analytics is a way to answer queries over sensitive data that is spread across multiple parties, without sharing the data or collecting it in a single place. Prior work has developed solutions that can scale to large deployments with millions of devices but, due to the distributed nature of federated analytics, these solutions can support only a limited class of queries - typically various forms of numerical queries, which can be answered with lightweight cryptographic primitives. Supporting richer queries, such as categorical queries, requires heavier cryptography, whose cost can quickly exceed even the resources of a powerful data center. In this paper, we present Arboretum, a new federated analytics system that can efficiently answer a broader range of queries, including categorical queries, in deployments with millions or even billions of participants. Arboretum achieves this by 1) automatically optimizing query plans to find highly efficient ways to answer each query, and by 2) including the participant devices in the computation. Our evaluation shows that Arboretum can match the cost of earlier systems that have been hand-optimized for particular kinds of queries, and that it can additionally support a range of new queries for which no efficient solution exists today.
引用
收藏
页码:451 / 465
页数:15
相关论文
共 61 条
[1]   Deep Learning with Differential Privacy [J].
Abadi, Martin ;
Chu, Andy ;
Goodfellow, Ian ;
McMahan, H. Brendan ;
Mironov, Ilya ;
Talwar, Kunal ;
Zhang, Li .
CCS'16: PROCEEDINGS OF THE 2016 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2016, :308-318
[2]  
Acs G., 2011, Information Hiding
[3]  
Agarwal N, 2018, ADV NEUR IN, V31
[4]  
Albrecht Martin., 2018, Homomorphic encryption security standard
[5]  
[Anonymous], 2011, P NETWORK DISTRIBUTE
[6]  
Apple, Differential privacy technical overview
[7]  
Bagdasaryan E, 2022, Arxiv, DOI arXiv:2111.02356
[8]  
Balle B, 2018, ADV NEUR IN, V31
[9]   Probabilistic Relational Reasoning for Differential Privacy [J].
Barthe, Gilles ;
Koepf, Boris ;
Olmedo, Federico ;
Zanella-Beguelin, Santiago .
ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2013, 35 (03)
[10]   SMCQL: Secure Querying for Federated Databases [J].
Bater, Johes ;
Elliott, Gregory ;
Eggen, Craig ;
Goel, Satyender ;
Kho, Abel ;
Rogers, Jennie .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (06) :673-684