Location-Aware Cache Management for Many-Core Processors with Deep Cache Hierarchy

被引：1

作者：

Park, Jongsoo ^{[1
]}

Yoo, Richard M. ^{[1
]}

Khudia, Daya S. ^{[2
]}

Hughes, Christopher J. ^{[1
]}

Kim, Daehyun ^{[1
]}

机构：

[1] Intel Corp, Parallel Comp Lab, Santa Clara, CA 95051 USA

[2] Univ Michigan, Ann Arbor, MI 48109 USA

来源：

2013 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC) | 2013年

关键词：

Energy-Efficient Memory Hierarchy; Producer-Consumer Communication; Reuse Distance; Streaming Memory Accesses; MEMORY;

D O I：

10.1145/2503210.2503224

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

As cache hierarchies become deeper and the number of cores on a chip increases, managing caches becomes more important for performance and energy. However, current hardware cache management policies do not always adapt optimally to the applications behavior: e.g., caches may be polluted by data structures whose locality cannot be captured by the caches, and producer-consumer communication incurs multiple round trips of coherence messages per cache line transferred. We propose load and store instructions that carry hints regarding into which cache(s) the accessed data should be placed. Our instructions allow software to convey locality information to the hardware, while incurring minimal hardware cost and not affecting correctness. Our instructions provide a 1.07x speedup and a 1.24x energy efficiency boost, on average, according to simulations on a 64-core system with private L1 and L2 caches. With a large shared L3 cache added, the benefits increase, providing 1.33 x energy reduction on average.

引用

页数：12

共 50 条

[31] Victim management in a cache hierarchy
Franaszek, P. A.
Lastras-Montano, L. A.
Kunkel, S. R.
Sawdey, A. C.
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2006, 50 (4-5) : 507 - 523
[32] Power Multiplexing for Thermal Field Management in Many-Core Processors
Cho, Minki
Kersey, Chad
Gupta, Man Prakash
Sathe, Nikhil
Kumar, Satish
Yalamanchili, Sudhakar
Mukhopadhyay, Saibal
IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY, 2013, 3 (01): : 94 - 104
[33] Economic models for many-core processors
Kumar, Rakesh
DR DOBBS JOURNAL, 2008, 33 (03): : 10 - 10
[34] Thermally Adaptive Cache Access Mechanisms for 3D Many-Core Architectures
Xiao, He
Yueh, Wen
Mukhopadhyay, Saibal
Yalamanchili, Sudhakar
IEEE COMPUTER ARCHITECTURE LETTERS, 2016, 15 (02) : 129 - 132
[35] Cache-based high-level simulation of microthreaded many-core architectures
Uddin, Irfan
Poss, Raphael
Jesshope, Chris
JOURNAL OF SYSTEMS ARCHITECTURE, 2014, 60 (07) : 529 - 552
[36] Cache Locality-Centric Parallel String Matching on Many-Core Accelerator Chips
Tran, Nhat-Phuong
Lee, Myungho
Choi, Dong Hoon
SCIENTIFIC PROGRAMMING, 2015, 2015
[37] Cache-Oblivious MPI All-to-All Communications on Many-Core Architectures
Li, Shigang
Zhang, Yunquan
Hoefler, Torsten
ACM SIGPLAN NOTICES, 2017, 52 (08) : 445 - 446
[38] Sharing-aware Efficient Private Caching in Many-core Server Processors
Shukla, Sudhanshu
Chaudhuri, Mainak
2017 IEEE 35TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2017, : 485 - 492
[39] Performance-aware cache management for energy-harvesting nonvolatile processors
Yan Wang
Kenli Li
Xia Deng
Keqin Li
The Journal of Supercomputing, 2022, 78 : 3425 - 3447
[40] Thread-aware Dynamic Shared Cache Compression in Multi-core Processors
Xie, Yuejian
Loh, Gabriel H.
2011 IEEE 29TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2011, : 135 - 141

← 1 2 3 4 5 →