F1: A Fast and Programmable Accelerator for Fully Homomorphic Encryption

被引:116
作者
Feldmann, Axel [1 ]
Samardzic, Nikola [1 ]
Krastev, Aleksandar [1 ]
Devadas, Srini [1 ]
Dreslinski, Ron [2 ]
Peikert, Christopher [2 ]
Sanchez, Daniel [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Univ Michigan, Ann Arbor, MI 48109 USA
来源
PROCEEDINGS OF 54TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2021 | 2021年
关键词
fully homomorphic encryption; hardware acceleration;
D O I
10.1145/3466752.3480070
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Fully Homomorphic Encryption (FHE) allows computing on encrypted data, enabling secure offloading of computation to untrusted servers. Though it provides ideal security, FHE is expensive when executed in software, 4 to 5 orders of magnitude slower than computing on unencrypted data. These overheads are a major barrier to FHE's widespread adoption. We present F1, the first FHE accelerator that is programmable, i.e., capable of executing full FHE programs. F1 builds on an indepth architectural analysis of the characteristics of FHE computations that reveals acceleration opportunities. F1 is a wide-vector processor with novel functional units deeply specialized to FHE primitives, such as modular arithmetic, number-theoretic transforms, and structured permutations. This organization provides so much compute throughput that data movement becomes the key bottleneck. Thus, F1 is primarily designed to minimize data movement. Hardware provides an explicitly managed memory hierarchy and mechanisms to decouple data movement from execution. A novel compiler leverages these mechanisms to maximize reuse and schedule off-chip and on-chip data movement. We evaluate F1 using cycle-accurate simulation and RTL synthesis. F1 is the first system to accelerate complete FHE programs, and outperforms state-of-the-art software implementations by gmean 5,400x and by up to 17,000x. These speedups counter most of FHE's overheads and enable new applications, like real-time private deep learning in the cloud.
引用
收藏
页码:238 / 252
页数:15
相关论文
共 78 条
[1]   Implementation and Performance Evaluation of RNS Variants of the BFV Homomorphic Encryption Scheme [J].
Al Badawi, Ahmad ;
Polyakov, Yuriy ;
Aung, Khin Mi Mi ;
Veeravalli, Bharadwaj ;
Rohloff, Kurt .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2021, 9 (02) :941-956
[2]  
Albrecht M., 2018, Technical report
[3]  
Alperin-Sheriff J, 2013, LECT NOTES COMPUT SC, V8042, P1, DOI 10.1007/978-3-642-40041-4_1
[4]  
Altavilla Dave, 2021, Intel and Microsoft Collaborate on DARPA Program that Pioneers A New Frontier Of Ultra-Secure Computing
[5]  
[Anonymous], 2020, FINANCIAL IMPACT INT
[6]  
Asanovic K, 1998, Ph. D. Dissertation
[7]   FFTS IN EXTERNAL OR HIERARCHICAL MEMORY [J].
BAILEY, DH .
PROCEEDINGS : SUPERCOMPUTING 89, 1989, :234-242
[8]  
Barany Gergo, 2011, 9 WORKSHOP OPTIMIZAT
[9]   A STUDY OF REPLACEMENT ALGORITHMS FOR A VIRTUAL-STORAGE COMPUTER [J].
BELADY, LA .
IBM SYSTEMS JOURNAL, 1966, 5 (02) :78-&
[10]  
Bergamaschi Flavio, 2020, IBM Releases Fully Homomorphic Encryption Toolkit for MacOS and iOS