Observing the Invisible: Live Cache Inspection for High-Performance Embedded Systems

被引：3

作者：

Tarapore, Dharmesh ^{[1
]}

Roozkhosh, Shahin ^{[1
]}

Brzozowski, Steven ^{[1
]}

Mancuso, Renato ^{[1
]}

机构：

[1] Boston Univ, Boston, MA 02215 USA

来源：

IEEE TRANSACTIONS ON COMPUTERS | 2022年 / 71卷 / 03期

基金：

美国国家科学基金会;

关键词：

Hardware; Software; Program processors; Central Processing Unit; Random access memory; Phasor measurement units; Analytical models; Cache; cache snapshotting; ramindex; cacheflow; cache debugging; SIMULATION;

D O I：

10.1109/TC.2021.3060650

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The vast majority of high-performance embedded systems implement multi-level CPU cache hierarchies. But the exact behavior of these CPU caches has historically been opaque to system designers. Absent expensive hardware debuggers, an understanding of cache makeup remains tenuous at best. This enduring opacity further obscures the complex interplay among applications and OS-level components, particularly as they compete for the allocation of cache resources. Notwithstanding the relegation of cache comprehension to proxies such as static cache analysis, performance counter-based profiling, and cache hierarchy simulations, the underpinnings of cache structure and evolution continue to elude software-centric solutions. In this article, we explore a novel method of studying cache contents and their evolution via snapshotting. Our method complements extant approaches for cache profiling to better formulate, validate, and refine hypotheses on the behavior of modern caches. We leverage cache introspection interfaces provided by vendors to perform live cache inspections without the need for external hardware. We present CacheFlow, a proof-of-concept Linux kernel module which snapshots cache contents on an NVIDIA Tegra TX1 system on chip and a Hardkernel Odroid XU4.

引用

页码：559 / 572

页数：14

共 41 条

[31] SoK: Limitations of Confidential Computing via TEEs for High-Performance Compute Systems
Akram, Ayaz
Akella, Venkatesh
Peisert, Sean
Lowe-Power, Jason
2022 IEEE INTERNATIONAL SYMPOSIUM ON SECURE AND PRIVATE EXECUTION ENVIRONMENT DESIGN (SEED 2022), 2022, : 121 - 132
[32] Hotness- and Lifetime-Aware Data Placement and Migration for High-Performance Deep Learning on Heterogeneous Memory Systems
Han, Myeonggyun
Hyun, Jihoon
Park, Seongbeom
Baek, Woongki
IEEE TRANSACTIONS ON COMPUTERS, 2020, 69 (03) : 377 - 391
[33] PMLiteDB: Streamlining Access Paths for High-Performance Persistent Memory Document Database Systems
Jin, Hai
Wei, Shuo
Sha, Yan
Ye, Chencheng
Liu, Haikun
Liao, Xiaofei
IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (06) : 1778 - 1791
[34] Survey of Methodologies, Approaches, and Challenges in Parallel Programming Using High-Performance Computing Systems
Czarnul, Pawel
Proficz, Jerzy
Drypczewski, Krzysztof
SCIENTIFIC PROGRAMMING, 2020, 2020
[35] Design of high-performance entangling logic in silicon quantum dot systems with Bayesian optimization
Kang, Ji-Hoon
Yoon, Taehyun
Lee, Chanhui
Lim, Sungbin
Ryu, Hoon
SCIENTIFIC REPORTS, 2024, 14 (01):
[36] SIMCAN: A simulator to improve the learning of distributed and high-performance computing systems in engineering degrees
Nunez, Alberto
Manoso, Carolina
de Madrid, Angel P.
Pickin, Simon
COMPUTER APPLICATIONS IN ENGINEERING EDUCATION, 2019, 27 (05) : 1126 - 1138
[37] Method for Analyzing Bit Error Rates (BERs) of Nonlinear Circuits and Systems for High-Performance Signaling
Dou, Yuhang
Jiao, Dan
Yan, Jin
Zhu, Jianfang
IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2022, 70 (01) : 732 - 743
[38] A Reusability-Aware Cache Memory Sharing Technique for High-Performance Low-Power CMPs with Private L2 Caches
Youn, Sungjune
Kim, Hyunhee
Kim, Jihong
ISLPED'07: PROCEEDINGS OF THE 2007 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2007, : 56 - 61
[39] High-performance X-ray tomography reconstruction algorithm based on heterogeneous accelerated computing systems
Serrano, Estefania
Bermejo, Guzman
Garcia Blas, Javier
Carretero, Jesus
2014 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2014, : 331 - 338
[40] HPIS3: Towards a High-Performance Simulator for Hybrid Parallel I/O and Storage Systems
Feng, Bo
Liu, Ning
He, Shuibing
Sun, Xian-He
2014 9TH PARALLEL DATA STORAGE WORKSHOP (PDSW), 2014, : 37 - 42

← 1 2 3 4 5 →