BOLT: Privacy-Preserving, Accurate and Efficient Inference for Transformers

被引:9
作者
Pang, Qi [1 ]
Zhu, Jinhao [2 ]
Moellering, Helen M. [3 ]
Zheng, Wenting [1 ]
Schneider, Thomas [3 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Calif Berkeley, Berkeley, CA USA
[3] Tech Univ Darmstadt, Darmstadt, Germany
来源
45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP 2024 | 2024年
基金
欧盟地平线“2020”;
关键词
secure multi-party computation; homomorphic encryption; secure machine learning inference; transformer;
D O I
10.1109/SP54263.2024.00130
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The advent of transformers has brought about significant advancements in traditional machine learning tasks. However, their pervasive deployment has raised concerns about the potential leakage of sensitive information during inference. Existing approaches using secure multiparty computation (MPC) face limitations when applied to transformers due to the extensive model size and resource-intensive matrix-matrix multiplications. In this paper, we present BOLT, a privacy-preserving inference framework for transformer models that supports efficient matrix multiplications and nonlinear computations. Combined with our novel machine learning optimizations, BOLT reduces the communication cost by 10.91x. Our evaluation on diverse datasets demonstrates that BOLT maintains comparable accuracy to floating-point models and achieves 4.8-9.5x faster inference across various network settings compared to the state-of-the-art system.
引用
收藏
页码:4753 / 4771
页数:19
相关论文
共 50 条
[31]   Privacy-Preserving Machine Learning (PPML) Inference for Clinically Actionable Models [J].
Balaban, Baris ;
Magara, Seyma Selcan ;
Yilgor, Caglar ;
Yucekul, Altug ;
Obeid, Ibrahim ;
Pizones, Javier ;
Kleinstueck, Frank ;
Perez-Grueso, Francisco Javier Sanchez ;
Pellise, Ferran ;
Alanay, Ahmet ;
Savas, Erkay ;
Bagci, Cetin ;
Sezerman, Osman Ugur ;
European Spine Study Group, European Spine Study .
IEEE ACCESS, 2025, 13 :37431-37456
[32]   Towards Practical Privacy-Preserving Solution for Outsourced Neural Network Inference [J].
Liu, Pinglan ;
Zhang, Wensheng .
2022 IEEE 15TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2022), 2022, :357-362
[33]   Privacy-preserving boosting [J].
Sébastien Gambs ;
Balázs Kégl ;
Esma Aïmeur .
Data Mining and Knowledge Discovery, 2007, 14 :131-170
[34]   Privacy-preserving boosting [J].
Gambs, Sebastien ;
Kegl, Balazs ;
Aimeur, Esma .
DATA MINING AND KNOWLEDGE DISCOVERY, 2007, 14 (01) :131-170
[35]   Privacy-Preserving Statistics [J].
Vaidya, Jaideep .
COMPUTER, 2018, 51 (09) :8-9
[36]   An Efficient Privacy-Preserving Outsourced Calculation Toolkit With Multiple Keys [J].
Liu, Ximeng ;
Deng, Robert H. ;
Choo, Kim-Kwang Raymond ;
Weng, Jian .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2016, 11 (11) :2401-2414
[37]   Efficient and Privacy-Preserving Feature Selection Based on Multiparty Computation [J].
Wang, Luyao ;
Guo, Hao ;
Wu, Weibin ;
Zhou, Lu .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 :3505-3518
[38]   Efficient Integration of Exchange Chains in Privacy-Preserving Kidney Exchange [J].
Breuer, Malte ;
Meyer, Ulrike ;
Wetzel, Susanne .
2024 21ST ANNUAL INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY AND TRUST, PST 2024, 2024, :26-35
[39]   A Strong Privacy-Preserving and Efficient Fingerprint Authentication via Clustering [J].
Liu, Jingwei ;
Zhou, Zihan ;
Sun, Rong ;
Du, Xiaojiang ;
Guizani, Mohsen .
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, :5889-5894
[40]   ELXGB: An Efficient and Privacy-Preserving XGBoost for Vertical Federated Learning [J].
Xu, Wei ;
Zhu, Hui ;
Zheng, Yandong ;
Wang, Fengwei ;
Zhao, Jiaqi ;
Liu, Zhe ;
Li, Hui .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (03) :878-892