Design and Implementation of the Blue Gene/P Snoop Filter

被引:0
|
作者
Salapura, Valentina [1 ]
Blumrich, Matthias [1 ]
Gara, Alan [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
来源
2008 IEEE 14TH INTERNATIONAL SYMPOSIUM ON HIGH PEFORMANCE COMPUTER ARCHITECTURE | 2008年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As multi-core processors evolve, coherence traffic between cores is becoming problematic, both in terms of performance and power The negative effects of coherence (snoop) traffic can be significantly mitigated through snoop filtering. Shielding each cache with a device that can squash snoop requests for addresses known not to be in cache improves performance significantly for caches that cannot perform normal load and snoop lookups simultaneously. In addition, reducing snoop lookups yields power savings. This paper describes the design of the Blue Gene/P snoop filters, and presents hardware measurements to demonstrate their effectiveness. The Blue Gene/P snoop filters combine stream registers and snoop caches to capture both the locality of snoop addresses and their streaming behavior. Simulations of SPLASH-2 benchmarks illustrate tradeoffs and strengths of these two techniques. Their combination is shown to be most effective, eliminating 94-99% of all snoop requests using very few stream registers and snoop cache lines. This translates into an average performance improvement of almost 20% for the NAS benchmarks running on an actual Blue Gene/P system.
引用
收藏
页码:2 / 11
页数:10
相关论文
共 50 条
  • [21] Design and implementation of inverse legendre microstrip filter
    Kant, Rajni
    Gupta, Nisha
    MICROWAVE AND OPTICAL TECHNOLOGY LETTERS, 2017, 59 (01) : 69 - 73
  • [22] Design and Implementation of Seismic Data Digital Filter
    Li, Yong
    MECHATRONICS AND APPLIED MECHANICS II, PTS 1 AND 2, 2013, 300-301 : 1669 - 1672
  • [23] Design and Implementation of Filter for Simulating Hearing Loss
    Zhong, Bo
    Zhang, Li
    Bai, Ying
    Chen, Xiaomei
    Niu, Feng
    Feng, Xiujuan
    2015 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, 2015, : 1708 - 1713
  • [24] Design and implementation of a programmable stack filter with FPGAs
    Hu, M
    Vainio, O
    Gevorkian, D
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1371 - 1374
  • [25] Modified Comb Decimation Filter: Design and Implementation
    Garcia Baez, Ricardo
    Jovanovic Dolecek, Gordana
    2016 IEEE 59TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2016, : 153 - 156
  • [26] Design and Implementation of LTCC Low Pass Filter
    Zhao, Yu-Bo
    Fan, Li-Si
    Mi, Guang-Pu
    Chang, Jin
    Chen, Guo-Wei
    2018 11TH UK-EUROPE-CHINA WORKSHOP ON MILLIMETER WAVES AND TERAHERTZ TECHNOLOGIES (UCMMT2018), VOL 1, 2018,
  • [27] Design and real implementation of adaptive notch filter
    Rector, T
    Doraiswami, R
    2000 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS 1 AND 2: NAVIGATING TO A NEW ERA, 2000, : 625 - 629
  • [28] Design and FPGA implementation of multiplierless comb filter
    Barsainya, Richa
    Agarwal, Meenakshi
    Rawat, Tarun Kumar
    INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 2017, 45 (11) : 1497 - 1513
  • [29] Design of Prototype filter of DFT filter banks with low implementation complexity
    Feng, Zhi Guo
    Yiu, Ka Fai Cedric
    ISCE: 2009 IEEE 13TH INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS, VOLS 1 AND 2, 2009, : 877 - +
  • [30] An FPGA implementation of a snoop cache with synchronization for a multiprocessor System-On-Chip
    Yamawaki, Akira
    Iwane, Masahiko
    2007 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, VOLS 1 AND 2, 2007, : 17 - 24