Efficient Characterization of Hidden Processor Memory Hierarchies

被引:1
|
作者
Cooper, Keith [1 ]
Xu, Xiaoran [1 ]
机构
[1] Rice Univ, Houston, TX 77005 USA
来源
COMPUTATIONAL SCIENCE - ICCS 2018, PT III | 2018年 / 10862卷
关键词
Efficient characterization; Hidden memory hierarchies; Code performance; Portable tool; CACHE;
D O I
10.1007/978-3-319-93713-7_27
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A processor's memory hierarchy has a major impact on the performance of running code. However, computing platforms, where the actual hardware characteristics are hidden from both the end user and the tools that mediate execution, such as a compiler, a JIT and a runtime system, are used more and more, for example, performing large scale computation in cloud and cluster. Even worse, in such environments, a single computation may use a collection of processors with dissimilar characteristics. Ignorance of the performance-critical parameters of the underlying system makes it difficult to improve performance by optimizing the code or adjusting runtime-system behaviors; it also makes application performance harder to understand. To address this problem, we have developed a suite of portable tools that can efficiently derive many of the parameters of processor memory hierarchies, such as levels, effective capacity and latency of caches and TLBs, in a matter of seconds. The tools use a series of carefully considered experiments to produce and analyze cache response curves automatically. The tools are inexpensive enough to be used in a variety of contexts that may include install time, compile time or runtime adaption, or performance understanding tools.
引用
收藏
页码:335 / 349
页数:15
相关论文
共 47 条
  • [1] An extended analysis of memory hierarchies for efficient implementations of image processing applications
    Christian Hartmann
    Dietmar Fey
    Journal of Real-Time Image Processing, 2018, 14 : 713 - 728
  • [2] An extended analysis of memory hierarchies for efficient implementations of image processing applications
    Hartmann, Christian
    Fey, Dietmar
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2018, 14 (03) : 713 - 728
  • [3] An analytical model for designing memory hierarchies
    Jacob, BL
    Chen, PM
    Silverman, SR
    Mudge, TN
    IEEE TRANSACTIONS ON COMPUTERS, 1996, 45 (10) : 1180 - 1194
  • [4] Energy Efficient 3D Hybrid Processor-Memory Architecture for the Dark Silicon Age
    Niknam, Sobhan
    Asad, Arghavan
    Fathy, Mahmood
    Rahmani, Amir-Mohammad
    2015 10TH INTERNATIONAL SYMPOSIUM ON RECONFIGURABLE COMMUNICATION-CENTRIC SYSTEMS-ON-CHIP (RECOSOC), 2015,
  • [5] Impact of Processor Cache Memory on Storage Performance
    Kim, Young-kuen
    Song, Yong Ho
    PROCEEDINGS INTERNATIONAL SOC DESIGN CONFERENCE 2017 (ISOCC 2017), 2017, : 304 - 305
  • [6] THE EFFECTS OF PROCESSOR ARCHITECTURE ON INSTRUCTION MEMORY TRAFFIC
    MITCHELL, CL
    FLYNN, MJ
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1990, 8 (03): : 230 - 250
  • [7] A Performance Degradation Tolerable Cache Design by Exploiting Memory Hierarchies
    Hsieh, Tong-Yu
    Wang, Chih-Hao
    Chih, Tsung-Liang
    Chi, Ya-Hsiu
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2016, 24 (02) : 784 - 788
  • [8] CACHE HIERARCHY AND MEMORY SUBSYSTEM OF THE AMD OPTERON PROCESSOR
    Conway, Pat
    Kalyanasundharam, Nathan
    Donley, Gregg
    Lepak, Kevin
    Hughes, Bill
    IEEE MICRO, 2010, 30 (02) : 16 - 29
  • [9] Revolver: Processor Architecture for Power Efficient Loop Execution
    Hayenga, Mitchell
    Naresh, Vignyan Reddy Kothinti
    Lipasti, Mikko H.
    2014 20TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA-20), 2014, : 591 - 602
  • [10] A Lightweight Memory Encryption Cache Design and Implementation for Embedded Processor
    Liu, Zhenglin
    Huo, Wenjie
    Zou, Xuecheng
    Lin, Yingyan
    PROCEEDINGS OF THE 2009 12TH INTERNATIONAL SYMPOSIUM ON INTEGRATED CIRCUITS (ISIC 2009), 2009, : 388 - 391