Reducing Cache Coherence Traffic with Hierarchical Directory Cache and NUMA-Aware Runtime Scheduling

被引:7
|
作者
Caheny, Paul [1 ,3 ]
Casas, Marc [1 ,3 ]
Moreto, Miguel [1 ,3 ]
Gloaguen, Herve [2 ]
Saintes, Maxime [2 ]
Ayguade, Eduard [1 ,3 ]
Labarta, Jesus [1 ,3 ]
Valero, Mateo [1 ,3 ]
机构
[1] Barcelona Supercomp Ctr, Barcelona, Spain
[2] Univ Politecn Cataluna, Dept Arquitectura Comp, Barcelona, Spain
[3] Bull Atos Technol, Les Clayes Sous Bois, France
基金
欧盟地平线“2020”;
关键词
Cache Coherence; NUMA; Task-based programming models; ARCHITECTURE;
D O I
10.1145/2967938.2967962
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Cache Coherent NUMA (ccNUMA) architectures are a widespread paradigm due to the benefits they provide for scaling core count and memory capacity. Also, the flat memory address space they offer considerably improves programmability. However, ccNUMA architectures require sophisticated and expensive cache coherence protocols to enforce correctness during parallel executions, which trigger a significant amount of on-and off-chip traffic in the system. This paper analyses how coherence traffic may be best constrained in a large, real ccNUMA platform through the use of a joint hardware/software approach. For several benchmarks, we study coherence traffic in detail under the influence of an added hierarchical cache layer in the directory protocol combined with runtime managed NUMA-aware scheduling and data allocation techniques to make most efficient use of the added hardware. The effectiveness of this joint approach is demonstrated by speedups of 1.23x to 2.54x and coherence traffic reductions between 44% and 77% in comparison to NUMA-oblivious scheduling and data allocation. Furthermore, we show that the NUMA-aware techniques we employ at the runtime level are crucial to ensure the added hierarchical layer in the directory coherence protocol does not introduce significant coherence traffic to the system.
引用
收藏
页码:275 / 286
页数:12
相关论文
共 50 条
  • [1] Reducing Cache Coherence Traffic with a NUMA-Aware Runtime Approach
    Caheny, Paul
    Alvarez, Lluc
    Derradji, Said
    Valero, Mateo
    Moreto, Miquel
    Casas, Marc
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (05) : 1174 - 1187
  • [2] Design, Implementation, and Evaluation of a NUMA-Aware Cache for iSCSI Storage Servers
    Ren, Yufei
    Li, Tan
    Yu, Dantong
    Jin, Shudong
    Robertazzi, Thomas
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (02) : 413 - 422
  • [3] Cache Aware Pre-Runtime Scheduling
    Daniel Kästner
    Stephan Thesing
    Real-Time Systems, 1999, 17 : 235 - 256
  • [4] Cache aware pre-runtime scheduling
    Kästner, D
    Thesing, S
    REAL-TIME SYSTEMS, 1999, 17 (2-3) : 235 - 256
  • [5] A NUMA-Aware Runtime Environment for the Actor Model
    Francesquini, Emilio
    Goldman, Alfredo
    Mehaut, Jean-Francois
    2013 42ND ANNUAL INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2013, : 250 - 259
  • [6] Hierarchical Cache Directory for CMP
    Song-Liu Guo
    Hai-Xia Wang
    Yi-Bo Xue
    Chong-Min Li
    Dong-Sheng Wang
    Journal of Computer Science and Technology, 2010, 25 : 246 - 256
  • [7] Hierarchical Cache Directory for CMP
    郭松柳
    王海霞
    薛一波
    李崇民
    汪东升
    Journal of Computer Science & Technology, 2010, 25 (02) : 246 - 256
  • [8] Hierarchical Cache Directory for CMP
    Guo, Song-Liu
    Wang, Hai-Xia
    Xue, Yi-Bo
    Li, Chong-Min
    Wang, Dong-Sheng
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2010, 25 (02) : 246 - 256
  • [9] Reducing Last Level Cache Pollution in NUMA Multicore Systems for Improving Cache Performance
    An, Deukhyeon
    Kim, Jeehong
    Han, JungHyun
    Eom, Young Ik
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2012, PT III, 2012, 7335 : 272 - 282
  • [10] Segment directory: An improvement to the pointer in directory cache coherence schemes
    Department of Electrical Engineering, Korea Adv. Inst. Sci. and Technol., 373-1 Kusong-Dong Yusong-Gu, Taejon, 305-701, Korea, Republic of
    Parallel Processing Letters, 1998, 8 (04): : 577 - 588