Direct coherence: Bringing together performance and scalability in shared-memory multiprocessors

被引:0
作者
Ros, Alberto [1 ]
Acacio, Manuel E. [1 ]
Garcia, Jose M. [1 ]
机构
[1] Univ Murcia, Dept Ingn & Tecnol Computadores, E-30100 Murcia, Spain
来源
HIGH PERFORMANCE COMPUTING - HIPC 2007, PROCEEDINGS | 2007年 / 4873卷
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional directory-based cache coherence protocols suffer from long-latency cache misses as a consequence of the indirection introduced by the home node, which must be accessed on every cache miss before any coherence action can be performed. In this work we present a new protocol that moves the role of storing up-to-date coherence information (and thus ensuring totally ordered accesses) from the home node to one of the sharing caches. Our protocol allows most cache misses to be directly solved from the corresponding remote caches, without requiring the intervention of the home node. In this way, cache miss latencies are reduced. Detailed simulations show that this protocol leads to improvements in total execution time of 8% on average over a highly optimized MOESI directory-based protocol.
引用
收藏
页码:147 / 160
页数:14
相关论文
共 16 条
  • [1] The use of prediction for accelerating upgrade misses in cc-NUMA multiprocessors
    Acacio, ME
    González, J
    García, JM
    Duato, J
    [J]. 2002 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 2002, : 155 - 164
  • [2] ACACIO ME, 2002, SC2002 HIGH PERFORMA
  • [3] Chang JC, 2006, CONF PROC INT SYMP C, P264, DOI 10.1145/1150019.1136509
  • [4] Cheng LQ, 2007, INT S HIGH PERF COMP, P328
  • [5] Culler DavidE., 1999, PARALLEL COMPUTER AR
  • [6] Gupta A., 1990, INT C PAR PROC, P312
  • [7] Rsim: Simulating shared-memory multiprocessors with ILP processors
    Hughes, CJ
    Pai, VS
    Ranganathan, P
    Adve, SV
    [J]. COMPUTER, 2002, 35 (02) : 40 - +
  • [8] MARTIN MM, 2003, THESIS U WISCONSIN M
  • [9] Martin MMK, 2002, EIGHTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, P251
  • [10] Token coherence: Decoupling performance and correctness
    Martin, MMK
    Hill, MD
    Wood, DA
    [J]. 30TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2003, : 182 - 193