BOLT: Privacy-Preserving, Accurate and Efficient Inference for Transformers

被引：2

作者：

Pang, Qi ^{[1
]}

Zhu, Jinhao ^{[2
]}

Moellering, Helen M. ^{[3
]}

Zheng, Wenting ^{[1
]}

Schneider, Thomas ^{[3
]}

机构：

[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

[2] Univ Calif Berkeley, Berkeley, CA USA

[3] Tech Univ Darmstadt, Darmstadt, Germany

来源：

45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP 2024 | 2024年

基金：

欧盟地平线“2020”;

关键词：

secure multi-party computation; homomorphic encryption; secure machine learning inference; transformer;

D O I：

10.1109/SP54263.2024.00130

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The advent of transformers has brought about significant advancements in traditional machine learning tasks. However, their pervasive deployment has raised concerns about the potential leakage of sensitive information during inference. Existing approaches using secure multiparty computation (MPC) face limitations when applied to transformers due to the extensive model size and resource-intensive matrix-matrix multiplications. In this paper, we present BOLT, a privacy-preserving inference framework for transformer models that supports efficient matrix multiplications and nonlinear computations. Combined with our novel machine learning optimizations, BOLT reduces the communication cost by 10.91x. Our evaluation on diverse datasets demonstrates that BOLT maintains comparable accuracy to floating-point models and achieves 4.8-9.5x faster inference across various network settings compared to the state-of-the-art system.

引用

页码：4753 / 4771

页数：19

共 50 条

[1] EPIDL: Towards efficient and privacy-preserving inference in deep learning
Nie, Chenfei
Zhou, Zhipeng
Dong, Mianxiong
Ota, Kaoru
Li, Qiang
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (14):
[2] PPCNN: An efficient privacy-preserving CNN training and inference framework
Zhao, Fan
Li, Zhi
Wang, Hao
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 10988 - 11018
[3] PPTIF: Privacy-Preserving Transformer Inference Framework for Language Translation
Liu, Yanxin
Su, Qianqian
IEEE ACCESS, 2024, 12 : 48881 - 48897
[4] Privacy-Preserving Deep Learning and Inference
Riazi, M. Sadegh
Koushanfar, Farinaz
2018 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) DIGEST OF TECHNICAL PAPERS, 2018,
[5] Efficient and Privacy-Preserving Outsourcing of Gradient Boosting Decision Tree Inference
Yuan, Shuai
Li, Hongwei
Qian, Xinyuan
Hao, Meng
Zhai, Yixiao
Xu, Guowen
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (05) : 2334 - 2348
[6] PRIVACY-PRESERVING OLAP FOR ACCURATE ANSWER
Zhu, Youwen
Huang, Liusheng
Takagi, Tsuyoshi
Zhang, Mingwu
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2012, 21 (01)
[7] Privacy-preserving inference resistant to model extraction attacks
Byun, Junyoung
Choi, Yujin
Lee, Jaewook
Park, Saerom
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 256
[8] Novel and Efficient Privacy-Preserving Continuous Authentication
Baig, Ahmed Fraz
Eskeland, Sigurd
Yang, Bian
CRYPTOGRAPHY, 2024, 8 (01)
[9] Efficient privacy-preserving face verification scheme
Huang, Hai
Wang, Luyao
JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2021, 63
[10] An efficient and practical approach for privacy-preserving Naive Bayes classification
Vu, Duy-Hien
Vu, Trong-Sinh
Luong, The-Dung
JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2022, 68

← 1 2 3 4 5 →