DP&TB: a coherence filtering protocol for many-core chip multiprocessors

被引:5
|
作者
Yuan, Fengkai [1 ]
Ji, Zhenzhou [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Heilongjiang, Peoples R China
来源
JOURNAL OF SUPERCOMPUTING | 2013年 / 66卷 / 01期
关键词
Chip multiprocessors; Cache coherence protocol; Coherence filtering; Page granularity; On-chip network traffic; Indirection problem; REPLICATION;
D O I
10.1007/s11227-013-0900-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Future many-core chip multiprocessors (CMPs) will integrate hundreds of processor cores on chip. Two cache coherence protocols are the mainstream applied to current CMPs. The token-based protocol (Token) provides high performance, but it generates a prohibitive amount of network traffic, which translates into excessive power consumption. The directory-based protocol (Directory) reduces network traffic, yet trades off with the storage overhead of the directory as well as entails comparatively low performance caused by indirection limiting its applicability for many-core CMPs. In this work, we present DP&TB, a novel cache coherence protocol particularly suited to future many-core CMPs. In DP&TB, cache coherence is maintained at the granularity of a page, facilitating to filter out either unnecessary coherence inspections for blocks inside private pages or network traffic for blocks inside shared pages. We employ Directory to detect private and shared pages and Token to maintain the coherence of the blocks inside shared pages. DP&TB inherits the merit of Directory and Token and overcome their problems. Experimental results show that DP&TB comprehensively beyond Directory and Token with improvement by 9.1 % in performance over Token and by 13.8 % in network traffic over Directory. In addition, the storage overhead of DP&TB is less than half of that of Directory. Our proposal can fulfill the requirement of many-core CMPs to achieve high performance, power and area efficiency.
引用
收藏
页码:249 / 261
页数:13
相关论文
共 50 条
  • [1] DP&TB: a coherence filtering protocol for many-core chip multiprocessors
    Fengkai Yuan
    Zhenzhou Ji
    The Journal of Supercomputing, 2013, 66 : 249 - 261
  • [2] A Direct Coherence Protocol for Many-Core Chip Multiprocessors
    Ros, Alberto
    Acacio, Manuel E.
    Garcia, Jose M.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2010, 21 (12) : 1779 - 1792
  • [3] Automated Modeling and Emulation of Interconnect Designs for Many-Core Chip Multiprocessors
    Ihrig, Colin J.
    Melhem, Rami
    Jones, Alex K.
    PROCEEDINGS OF THE 47TH DESIGN AUTOMATION CONFERENCE, 2010, : 431 - 436
  • [4] Providing Balanced Mapping for Multiple Applications in Many-Core Chip Multiprocessors
    Zhu, Di
    Chen, Lizhong
    Yue, Siyu
    Pinkston, Timothy M.
    Pedram, Massoud
    IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (10) : 3122 - 3135
  • [5] The Case for a Scalable Coherence Protocol for Complex On-Chip Cache Hierarchies in Many-Core Systems
    Menezo, Lucia G.
    Puente, Valentin
    Angel Gregorio, Jose
    2013 22ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT), 2013, : 279 - 288
  • [6] Reconfigurable MPB Combined with Cache Coherence Protocol in Many-core
    Han, Xing
    Fu, Yuzhuo
    Jiang, Jiang
    PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 385 - 388
  • [7] On-Chip Traffic Regulation to Reduce Coherence Protocol Cost on a Microthreaded Many-Core Architecture with Distributed Caches
    Yang, Qiang
    Fu, Jian
    Poss, Raphael
    Jesshope, Chris
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2014, 13
  • [8] Resiliency for Many-core System on a Chip
    Karnik, Tanay
    Tschanz, James
    Borkar, Nitin
    Howard, Jason
    Vangal, Sriram
    De, Vivek
    Borkar, Shekhar
    2014 19TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2014, : 388 - 389
  • [9] Reconfigurable Many-Core Processor with Cache Coherence
    Han, Xing
    Jiang, Jiang
    Fu, Yuzhuo
    Wang, Chang
    COMPUTER ENGINEERING AND TECHNOLOGY, NCCET 2013, 2013, 396 : 198 - 207
  • [10] Architecture supported synchronization-based cache coherence protocol for many-core processors
    Huang, He
    Liu, Lei
    Song, Feng-Long
    Ma, Xiao-Yu
    Jisuanji Xuebao/Chinese Journal of Computers, 2009, 32 (08): : 1618 - 1630