Location-Aware Cache Management for Many-Core Processors with Deep Cache Hierarchy

被引:1
|
作者
Park, Jongsoo [1 ]
Yoo, Richard M. [1 ]
Khudia, Daya S. [2 ]
Hughes, Christopher J. [1 ]
Kim, Daehyun [1 ]
机构
[1] Intel Corp, Parallel Comp Lab, Santa Clara, CA 95051 USA
[2] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
Energy-Efficient Memory Hierarchy; Producer-Consumer Communication; Reuse Distance; Streaming Memory Accesses; MEMORY;
D O I
10.1145/2503210.2503224
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As cache hierarchies become deeper and the number of cores on a chip increases, managing caches becomes more important for performance and energy. However, current hardware cache management policies do not always adapt optimally to the applications behavior: e.g., caches may be polluted by data structures whose locality cannot be captured by the caches, and producer-consumer communication incurs multiple round trips of coherence messages per cache line transferred. We propose load and store instructions that carry hints regarding into which cache(s) the accessed data should be placed. Our instructions allow software to convey locality information to the hardware, while incurring minimal hardware cost and not affecting correctness. Our instructions provide a 1.07x speedup and a 1.24x energy efficiency boost, on average, according to simulations on a 64-core system with private L1 and L2 caches. With a large shared L3 cache added, the benefits increase, providing 1.33 x energy reduction on average.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Regional cache organization for NoC based many-core processors
    Ye, John M.
    Cao, Man
    Qu, Zening
    Chen, Tianzhou
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2013, 79 (02) : 175 - 186
  • [2] Location-aware cache replacement for mobile environments
    Lai, KY
    Tari, Z
    Bertok, P
    GLOBECOM '04: IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-6, 2004, : 3441 - 3447
  • [3] A future location-aware replacement policy for the cache management at the mobile terminal
    Drakatos, Stylianos
    Pissinou, Niki
    Makki, Kia
    Douligeris, Christos
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2009, 9 (05): : 607 - 629
  • [4] Reconfigurable Many-Core Processor with Cache Coherence
    Han, Xing
    Jiang, Jiang
    Fu, Yuzhuo
    Wang, Chang
    COMPUTER ENGINEERING AND TECHNOLOGY, NCCET 2013, 2013, 396 : 198 - 207
  • [5] Architecture supported synchronization-based cache coherence protocol for many-core processors
    Huang, He
    Liu, Lei
    Song, Feng-Long
    Ma, Xiao-Yu
    Jisuanji Xuebao/Chinese Journal of Computers, 2009, 32 (08): : 1618 - 1630
  • [6] Broadcasted Location-Aware Data Cache for Vehicular Application
    Sato, Kenya
    Koita, Takahiro
    Fukuda, Akira
    EURASIP JOURNAL ON EMBEDDED SYSTEMS, 2007, (01)
  • [7] Cache Capacity Aware Thread Scheduling for Irregular Memory Access on Many-Core GPGPUs
    Kuo, Hsien-Kai
    Yen, Ta-Kan
    Lai, Bo-Cheng Charles
    Jou, Jing-Yang
    2013 18TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2013, : 338 - 343
  • [8] Impact of Partitioning Cache Schemes on the Cache Hierarchy of SMT processors
    Kenyon, Samantha
    Lopez, Sonia
    Sahuquillo, Julio
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 706 - 711
  • [9] SelectDirectory: A Selective Directory for Cache Coherence in Many-Core Architectures
    Yao, Yuan
    Wang, Guanhua
    Ge, Zhiguo
    Mitra, Tulika
    Chen, Wenzhi
    Zhang, Naxin
    2015 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2015, : 175 - 180
  • [10] Reconfigurable MPB Combined with Cache Coherence Protocol in Many-core
    Han, Xing
    Fu, Yuzhuo
    Jiang, Jiang
    PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 385 - 388