BOLT: Privacy-Preserving, Accurate and Efficient Inference for Transformers

被引:2
|
作者
Pang, Qi [1 ]
Zhu, Jinhao [2 ]
Moellering, Helen M. [3 ]
Zheng, Wenting [1 ]
Schneider, Thomas [3 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Calif Berkeley, Berkeley, CA USA
[3] Tech Univ Darmstadt, Darmstadt, Germany
来源
45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP 2024 | 2024年
基金
欧盟地平线“2020”;
关键词
secure multi-party computation; homomorphic encryption; secure machine learning inference; transformer;
D O I
10.1109/SP54263.2024.00130
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The advent of transformers has brought about significant advancements in traditional machine learning tasks. However, their pervasive deployment has raised concerns about the potential leakage of sensitive information during inference. Existing approaches using secure multiparty computation (MPC) face limitations when applied to transformers due to the extensive model size and resource-intensive matrix-matrix multiplications. In this paper, we present BOLT, a privacy-preserving inference framework for transformer models that supports efficient matrix multiplications and nonlinear computations. Combined with our novel machine learning optimizations, BOLT reduces the communication cost by 10.91x. Our evaluation on diverse datasets demonstrates that BOLT maintains comparable accuracy to floating-point models and achieves 4.8-9.5x faster inference across various network settings compared to the state-of-the-art system.
引用
收藏
页码:4753 / 4771
页数:19
相关论文
共 50 条
  • [1] EPIDL: Towards efficient and privacy-preserving inference in deep learning
    Nie, Chenfei
    Zhou, Zhipeng
    Dong, Mianxiong
    Ota, Kaoru
    Li, Qiang
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (14):
  • [2] PPCNN: An efficient privacy-preserving CNN training and inference framework
    Zhao, Fan
    Li, Zhi
    Wang, Hao
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 10988 - 11018
  • [3] PPTIF: Privacy-Preserving Transformer Inference Framework for Language Translation
    Liu, Yanxin
    Su, Qianqian
    IEEE ACCESS, 2024, 12 : 48881 - 48897
  • [4] Privacy-Preserving Deep Learning and Inference
    Riazi, M. Sadegh
    Koushanfar, Farinaz
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) DIGEST OF TECHNICAL PAPERS, 2018,
  • [5] Efficient and Privacy-Preserving Outsourcing of Gradient Boosting Decision Tree Inference
    Yuan, Shuai
    Li, Hongwei
    Qian, Xinyuan
    Hao, Meng
    Zhai, Yixiao
    Xu, Guowen
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (05) : 2334 - 2348
  • [6] PRIVACY-PRESERVING OLAP FOR ACCURATE ANSWER
    Zhu, Youwen
    Huang, Liusheng
    Takagi, Tsuyoshi
    Zhang, Mingwu
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2012, 21 (01)
  • [7] Privacy-preserving inference resistant to model extraction attacks
    Byun, Junyoung
    Choi, Yujin
    Lee, Jaewook
    Park, Saerom
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 256
  • [8] Novel and Efficient Privacy-Preserving Continuous Authentication
    Baig, Ahmed Fraz
    Eskeland, Sigurd
    Yang, Bian
    CRYPTOGRAPHY, 2024, 8 (01)
  • [9] Efficient privacy-preserving face verification scheme
    Huang, Hai
    Wang, Luyao
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2021, 63
  • [10] An efficient and practical approach for privacy-preserving Naive Bayes classification
    Vu, Duy-Hien
    Vu, Trong-Sinh
    Luong, The-Dung
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2022, 68