Design and Implementation of Software-Managed Caches for Multicores with Local Memory

被引:0
|
作者
Seo, Sangmin [1 ]
Lee, Jaejin [1 ]
Sura, Zehra [2 ]
机构
[1] Seoul Natl Univ, Sch Engn & Comp Sci, Seoul, South Korea
[2] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY USA
来源
HPCA-15 2009: FIFTEENTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS | 2009年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Heterogeneous multicores, such as Cell BE processors and GPGPUs, typically do not have caches for their accelerator cores because coherence traffic, cache misses, and latencies from different types of memory accesses add overhead and adversely affect instruction scheduling. Instead, the accelerator cores have internal local memory to place their code and data. Programmers of such heterogeneous multicore architectures must explicitly manage data transfers between the local memory of a core and the globally shared main memory. This is a tedious and error-prone programming task. A software-managed cache (SMC), implemented in local memory, can be programmed to automatically handle data transfers at runtime, thus simplifying the task of the programmer In this paper we propose a new software-managed cache design, called extended set-index cache (ESC). It has the benefits of both set-associative and fully associative caches. Its tag search speed is comparable to the set-associative cache and its miss rate is comparable to the fully associative cache. We examine various line replacement policies for SMCs, and discuss their trade-offs. In addition, we propose adaptive execution strategies that select the optimal cache line size and replacement policy for each program region at runtime. To evaluate the effectiveness of our approach, we implement the ESC and other SMC designs on the Cell BE architecture, and measure their performance with 8 OpenMP applications. The evaluation results show that the ESC outperforms other SMC designs. The results also show that our adaptive execution strategies work well with the ESC. In fact, our approach is applicable to all cores with access to both local and global memory in a multicore architecture,
引用
收藏
页码:55 / +
页数:2
相关论文
共 50 条
  • [1] Reducing Code Management Overhead in Software-Managed Multicores
    Cai, Jian
    Kim, Yooseong
    Kim, Youngbin
    Shrivastava, Aviral
    Lee, Kyoungwoo
    PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 1241 - 1244
  • [2] Employing software-managed caches in OpenACC: Opportunities and benefits
    Lashgar A.
    Baniasadi A.
    ACM Transactions on Modeling and Performance Evaluation of Computing Systems, 2016, 1 (01)
  • [3] THE PERFORMANCE OF SOFTWARE-MANAGED MULTIPROCESSOR CACHES ON PARALLEL NUMERICAL PROGRAMS
    CHEONG, H
    VEIDENBAUM, AV
    LECTURE NOTES IN COMPUTER SCIENCE, 1988, 297 : 316 - 337
  • [4] Efficient Code Assignment Techniques for Local Memory on Software Managed Multicores
    Lu, Jing
    Bai, Ke
    Shrivastava, Aviral
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2015, 14 (04)
  • [5] DESIGN TRADEOFFS FOR SOFTWARE-MANAGED TLBS
    UHLIG, R
    NAGLE, D
    STANLEY, T
    MUDGE, T
    SECHREST, S
    BROWN, R
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1994, 12 (03): : 175 - 205
  • [6] WCET-Aware Dynamic Code Management on Scratchpads for Software-Managed Multicores
    Kim, Yooseong
    Broman, David
    Cai, Jian
    Shrivastaval, Aviral
    2014 IEEE 20TH REAL-TIME AND EMBEDDED TECHNOLOGY AND APPLICATIONS SYMPOSIUM (RTAS), 2014, : 179 - 188
  • [7] Understanding the Tradeoffs between Software-Managed vs. Hardware-Managed Caches in GPUs
    Li, Chao
    Yang, Yi
    Dai, Hongwen
    Yan, Shengen
    Mueller, Frank
    Zhou, Huiyang
    2014 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2014, : 231 - 241
  • [8] A Tuning Framework for Software-Managed Memory Hierarchies
    Ren, Manman
    Park, Ji Young
    Houston, Mike
    Aiken, Alex
    Daily, William J.
    PACT'08: PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2008, : 280 - 291
  • [9] A Memory Optimization Technique for Software-Managed Scratchpad Memory in GPUs
    Moazeni, Maryam
    Bui, Alex
    Sarrafzadeh, Majid
    2009 IEEE 7TH SYMPOSIUM ON APPLICATION SPECIFIC PROCESSORS (SASP 2009), 2009, : 43 - +
  • [10] A fully associative software-managed cache design
    Hallnor, EG
    Reinhardt, SK
    PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2000, : 107 - 116