PSSM: Achieving Secure Memory for GPUs with Partitioned and Sectored Security Metadata

被引:10
作者
Yuan, Shougang [1 ]
Yan Solihin [2 ]
Zhou, Huiyang [1 ]
机构
[1] North Carolina State Univ, Raleigh, NC 27695 USA
[2] Univ Cent Florida, Orlando, FL 32816 USA
来源
PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ICS 2021 | 2021年
关键词
GPUs; secure memory; memory encryption; memory integrity; metadata cache; PERFORMANCE; ENCRYPTION;
D O I
10.1145/3447818.3460374
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we investigate the secure memory architecture for GPUs and point out that conventional CPU secure memory architecture can not be directly adopted to the GPUs. The key reasons include: (1) accessing the security metadata, including encryption counters, message authentication codes (MACs) and integrity trees, requires significant memory bandwidth, which may lead to severe bandwidth competition with normal data accesses and degrade the GPU performance; (2) contemporary GPUs use partitioned memory organization, which results in storage and coherence problems for encryption counters and integrity trees since different partitions may need to update the same counter/integrity tree blocks; and (3) the existing split-counter block organization is not friendly to sectored caches, which are commonly used in GPU for bandwidth savings. Based on these observations, we propose partitioned and sectored security metadata (PSSM), which has two components: (a) using the offset addresses (referred to as local addresses) within each partition, instead of the virtual or physical addresses, to generate the metadata so as to solve the counter or integrity tree storage and coherence problems and (b) reorganizing the security metadata to make them friendly to the sectored cache structure so as to reduce the memory bandwidth consumption of metadata accesses. With these proposed schemes, the performance overhead of secure GPU memory is reduced from 59.22% to 16.84% on average. If only memory encryption is required, the performance overhead is reduced from 29.53% to 5.18%.
引用
收藏
页码:139 / 151
页数:13
相关论文
共 39 条
[1]   Bounding the Effect of Partition Camping in GPU Kernels [J].
Aji, Ashwin M. ;
Daga, Mayank ;
Feng, Wu-chun .
PROCEEDINGS OF THE 2011 8TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS (CF 2011), 2011,
[2]  
[Anonymous], 2003, ACM INT C SUPERCOMPU
[3]   Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse and Batched Computations [J].
Anzt, Hartwig ;
Tsai, Yuhsiang M. ;
Abdelfattah, Ahmad ;
Cojean, Terry ;
Dongarra, Jack .
PROCEEDINGS OF 2020 IEEE/ACM PERFORMANCE MODELING, BENCHMARKING AND SIMULATION OF HIGH PERFORMANCE COMPUTER SYSTEMS (PMBS 2020), 2020, :26-38
[4]  
Che SA, 2009, I S WORKL CHAR PROC, P44, DOI 10.1109/IISWC.2009.5306797
[5]   Making Secure Processors OS- and Performance-Friendly [J].
Chhabra, Siddhartha ;
Rogers, Brian ;
Solihin, Yan ;
Prvulovic, Milos .
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2008, 5 (04)
[6]  
Costan V., 2016, IACR Cryptology ePrint Archive
[7]  
DAN GOODIN, 2019, RAMBleed sidechannel attack works even when DRAM is protected by error-correcting code
[8]   Electro-Magnetic Analysis of GPU-based AES Implementation [J].
Gao, Yiwen ;
Zhang, Hailong ;
Cheng, Wei ;
Zhou, Yongbin ;
Cao, Yuchen .
2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
[9]   Optimizing and Auto-tuning Belief Propagation on the GPU [J].
Grauer-Gray, Scott ;
Cavazos, John .
LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2011, 6548 :121-135
[10]   Memory Encryption for General-Purpose Processors [J].
Gueron, Shay .
IEEE SECURITY & PRIVACY, 2016, 14 (06) :54-62