Location-Aware Cache Management for Many-Core Processors with Deep Cache Hierarchy

被引:1
|
作者
Park, Jongsoo [1 ]
Yoo, Richard M. [1 ]
Khudia, Daya S. [2 ]
Hughes, Christopher J. [1 ]
Kim, Daehyun [1 ]
机构
[1] Intel Corp, Parallel Comp Lab, Santa Clara, CA 95051 USA
[2] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
Energy-Efficient Memory Hierarchy; Producer-Consumer Communication; Reuse Distance; Streaming Memory Accesses; MEMORY;
D O I
10.1145/2503210.2503224
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As cache hierarchies become deeper and the number of cores on a chip increases, managing caches becomes more important for performance and energy. However, current hardware cache management policies do not always adapt optimally to the applications behavior: e.g., caches may be polluted by data structures whose locality cannot be captured by the caches, and producer-consumer communication incurs multiple round trips of coherence messages per cache line transferred. We propose load and store instructions that carry hints regarding into which cache(s) the accessed data should be placed. Our instructions allow software to convey locality information to the hardware, while incurring minimal hardware cost and not affecting correctness. Our instructions provide a 1.07x speedup and a 1.24x energy efficiency boost, on average, according to simulations on a 64-core system with private L1 and L2 caches. With a large shared L3 cache added, the benefits increase, providing 1.33 x energy reduction on average.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Victim management in a cache hierarchy
    Franaszek, P. A.
    Lastras-Montano, L. A.
    Kunkel, S. R.
    Sawdey, A. C.
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2006, 50 (4-5) : 507 - 523
  • [32] Power Multiplexing for Thermal Field Management in Many-Core Processors
    Cho, Minki
    Kersey, Chad
    Gupta, Man Prakash
    Sathe, Nikhil
    Kumar, Satish
    Yalamanchili, Sudhakar
    Mukhopadhyay, Saibal
    IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY, 2013, 3 (01): : 94 - 104
  • [33] Economic models for many-core processors
    Kumar, Rakesh
    DR DOBBS JOURNAL, 2008, 33 (03): : 10 - 10
  • [34] Thermally Adaptive Cache Access Mechanisms for 3D Many-Core Architectures
    Xiao, He
    Yueh, Wen
    Mukhopadhyay, Saibal
    Yalamanchili, Sudhakar
    IEEE COMPUTER ARCHITECTURE LETTERS, 2016, 15 (02) : 129 - 132
  • [35] Cache-based high-level simulation of microthreaded many-core architectures
    Uddin, Irfan
    Poss, Raphael
    Jesshope, Chris
    JOURNAL OF SYSTEMS ARCHITECTURE, 2014, 60 (07) : 529 - 552
  • [36] Cache Locality-Centric Parallel String Matching on Many-Core Accelerator Chips
    Tran, Nhat-Phuong
    Lee, Myungho
    Choi, Dong Hoon
    SCIENTIFIC PROGRAMMING, 2015, 2015
  • [37] Cache-Oblivious MPI All-to-All Communications on Many-Core Architectures
    Li, Shigang
    Zhang, Yunquan
    Hoefler, Torsten
    ACM SIGPLAN NOTICES, 2017, 52 (08) : 445 - 446
  • [38] Sharing-aware Efficient Private Caching in Many-core Server Processors
    Shukla, Sudhanshu
    Chaudhuri, Mainak
    2017 IEEE 35TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2017, : 485 - 492
  • [39] Performance-aware cache management for energy-harvesting nonvolatile processors
    Yan Wang
    Kenli Li
    Xia Deng
    Keqin Li
    The Journal of Supercomputing, 2022, 78 : 3425 - 3447
  • [40] Thread-aware Dynamic Shared Cache Compression in Multi-core Processors
    Xie, Yuejian
    Loh, Gabriel H.
    2011 IEEE 29TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2011, : 135 - 141