BOLT: Privacy-Preserving, Accurate and Efficient Inference for Transformers

被引:9
作者
Pang, Qi [1 ]
Zhu, Jinhao [2 ]
Moellering, Helen M. [3 ]
Zheng, Wenting [1 ]
Schneider, Thomas [3 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Calif Berkeley, Berkeley, CA USA
[3] Tech Univ Darmstadt, Darmstadt, Germany
来源
45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP 2024 | 2024年
基金
欧盟地平线“2020”;
关键词
secure multi-party computation; homomorphic encryption; secure machine learning inference; transformer;
D O I
10.1109/SP54263.2024.00130
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The advent of transformers has brought about significant advancements in traditional machine learning tasks. However, their pervasive deployment has raised concerns about the potential leakage of sensitive information during inference. Existing approaches using secure multiparty computation (MPC) face limitations when applied to transformers due to the extensive model size and resource-intensive matrix-matrix multiplications. In this paper, we present BOLT, a privacy-preserving inference framework for transformer models that supports efficient matrix multiplications and nonlinear computations. Combined with our novel machine learning optimizations, BOLT reduces the communication cost by 10.91x. Our evaluation on diverse datasets demonstrates that BOLT maintains comparable accuracy to floating-point models and achieves 4.8-9.5x faster inference across various network settings compared to the state-of-the-art system.
引用
收藏
页码:4753 / 4771
页数:19
相关论文
共 50 条
[41]   Efficient Privacy-Preserving Federated Learning With Improved Compressed Sensing [J].
Zhang, Yifan ;
Miao, Yinbin ;
Li, Xinghua ;
Wei, Linfeng ;
Liu, Zhiquan ;
Choo, Kim-Kwang Raymond ;
Deng, Robert H. .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (03) :3316-3326
[42]   An Efficient Framework for Privacy-Preserving Computations on Encrypted IoT Data [J].
Ramesh, Shruthi ;
Govindarasu, Manimaran .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (09) :8700-8708
[43]   An efficient privacy-preserving aggregation and billing protocol for smart grid [J].
Wang, Xiao-Fen ;
Mu, Yi ;
Chen, Rong-Mao .
SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (17) :4536-4547
[44]   An efficient privacy-preserving comparison protocol in smart metering systems [J].
Nateghizad M. ;
Erkin Z. ;
Lagendijk R.L. .
Eurasip Journal on Information Security, 2016, 2016 (01)
[45]   An efficient privacy-preserving friendship-based recommendation system [J].
Ou, Bingpeng ;
Guo, Jingjing ;
Tao, Xiaoling .
INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2019, 11 (04) :516-525
[46]   Privacy-Preserving Neural Network Inference Framework via Homomorphic Encryption and SGX [J].
Xiao, Huizi ;
Zhang, Qingyang ;
Pei, Qingqi ;
Shi, Weisong .
2021 IEEE 41ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2021), 2021, :751-761
[47]   SecureGPT: A Framework for Multi-Party Privacy-Preserving Transformer Inference in GPT [J].
Zeng, Chenkai ;
He, Debiao ;
Feng, Qi ;
Yang, Xiaolin ;
Luo, Qingcai .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 :9480-9493
[48]   Optimizations of Privacy-Preserving DNN for Low-Latency Inference on Encrypted Data [J].
Lee, Hyunhoon ;
Lee, Youngjoo .
IEEE ACCESS, 2023, 11 :104775-104788
[49]   PhD Forum: Efficient Privacy-Preserving Processing via Memory-Centric Computing [J].
Mwaisela, Mpoki .
2024 43RD INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, SRDS 2024, 2024, :322-325
[50]   PRIVACY-PRESERVING STATISTICAL ANALYSIS ON HEALTH DATA [J].
Samet, Saeed .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCES ON E-HEALTH 2015 E-COMMERCE AND DIGITAL MARKETING 2015 AND INFORMATION SYSTEMS POST-IMPLEMENTATION AND CHANGE MANAGEMENT 2015, 2015, :3-9