HePREM: A Predictable Execution Model for GPU-based Heterogeneous SoCs

被引:9
作者
Forsberg, Bjorn [1 ]
Benini, Luca [1 ,2 ]
Marongiu, Andrea [3 ]
机构
[1] Swiss Fed Inst Technol, Integrated Syst Lab, CH-8092 Zurich, Switzerland
[2] Univ Bologna, Elect Elect & Informat Engn Dept, I-40126 Bologna, Italy
[3] Univ Modena & Reggio Emilia, Dept Phys Informat & Math, I-41121 Modena, MO, Italy
关键词
Graphics processing units; Central Processing Unit; Interference; Random access memory; Timing; Robustness; Memory management; Real-time and embedded systems; languages and compilers; graphics processors; memory management; reliability; runtime environments; parallel systems; FRAMEWORK; SYSTEM;
D O I
10.1109/TC.2020.2980520
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The ever-increasing need for computational power in embedded devices has led to the adoption heterogeneous SoCs combining a general purpose CPU with a data parallel accelerator. These systems rely on a shared main memory (DRAM), which makes them highly susceptible to memory interference. A promising software technique to counter such effects is the Predictable Execution Model (PREM). PREM ensures robustness to interference by separating programs into a sequence of memory and compute phases, and by enforcing a platform-level schedule where only a single processing subsystem is permitted to execute a memory phase at a time. This article demonstrates for the first time how PREM can be applied to heterogeneous SoCs, based on a synchronization technique for memory isolation between CPU and GPU plus a compiler to transform GPU kernels into PREM-compliant codes. For compute bound GPU workloads sharing the DRAM bandwidth 50/50 with the CPU we guarantee near-zero timing varibility at a performance loss of just 59 percent, which is one to two orders of magnitude smaller than the worst case we see for unmodified programs under memory interference.
引用
收藏
页码:17 / 29
页数:13
相关论文
共 44 条
[1]  
Agirre I., 2017, P 12 IEEE INT S EMB, P1
[2]   Schedulability Analysis of Global Memory-predictable Scheduling [J].
Alhammad, Ahmed ;
Pellizzoni, Rodolfo .
2014 INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE (EMSOFT), 2014,
[3]  
Ali W., 2017, ARXIV171208738CSPF
[4]  
Allen Randy, 2001, OPTIMIZING COMPILERS
[5]  
[Anonymous], 2013, MISRA C 2012 GUIDELI, VMotor Industry Software Reliability Association and Motor Industry Software Reliability Association Staff
[6]  
[Anonymous], 2018, AMD EMBEDDED G SERIE
[7]  
[Anonymous], 2015, OPENMP APPL PROGR IN
[8]  
[Anonymous], 2019, FRESHPORTS SYSUTILS
[9]  
Antao Samuel F., 2016, 2016 Third Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC). Proceedings, P1, DOI 10.1109/LLVM-HPC.2016.006
[10]  
Bauer M., 2011, P 2011 INT C HIGH PE, P1, DOI DOI 10.1145/2063384.2063400