A Quantitative Study of the On-Chip Network and Memory Hierarchy Design for Many-Core Processor

被引:8
作者
Wang, Xu [1 ]
Gan, Ge [2 ]
Manzano, Joseph [2 ]
Fan, Dongrui [1 ]
Guo, Shuxu [3 ]
机构
[1] Chinese Acad Sci, ICT, Key Lab Comp Syst & Arch, Beijing 100190, Peoples R China
[2] Univ Delaware, Dept ECE, Newark, DE 19716 USA
[3] Jilin Univ, Dept EE, Jilin 130000, Peoples R China
来源
PROCEEDINGS OF THE 2008 14TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS | 2008年
基金
中国国家自然科学基金;
关键词
many-core processor; on-chip network; communication pattern; memory hierarchy; cache;
D O I
10.1109/ICPADS.2008.18
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we will study the on-chip network and memory hierarchy design of the Godson-T - a homogeneous many-core processor. Godson-T has 64 cores (with private L1 cache), and 16 global L2 cache banks. All these on-chip units are connected by a 2D 8 x 8 mesh network. Our study reveals that: (a) Global on-chip L2 cache can effectively alleviate the memory pressure caused by the data-thirsty on-chip computing engines. However, its potential is still limited by both the off-chip and the in-chip bandwidth, especially when increasing the number of active threads. (b) On-chip traffic congestion is largely caused by the intensive memory access requests issued from the on-chip cores. Therefore, the design of the on-chip network must consider the available performance of the datapath that connects the processor to the main memory. (c) In theory, different applications have different communication patterns (Berkeley's view [1]). However, the application's runtime communication pattern is only determined by the design of the underlying memory hierarchy and on-chip interconnection. These conclusions are generally applicable to a wide variety of many-core processors with similar design.
引用
收藏
页码:689 / +
页数:2
相关论文
共 18 条
[1]  
Asanovc K., 2006, UCBEECS2006183
[2]   Scaling to the end of silicon with EDGE architectures [J].
Burger, D ;
Keckler, SW ;
McKinley, KS ;
Dahlin, M ;
John, LK ;
Lin, C ;
Moore, CR ;
Burrill, J ;
McDonald, RG ;
Yoder, R .
COMPUTER, 2004, 37 (07) :44-+
[3]  
CHEN JSC, 1991, COMM 1991 ICC 91 IEE, V1, P413
[4]  
CYPER RJ, 1978, COMMUNICATIONS ARCHI
[5]  
Dally W., 2003, PRINCIPLES PRACTICES
[6]  
DELCUVILLO J, 2005, HPCS 2006
[7]   Location consistency - A new memory model and cache consistency protocol [J].
Gao, GR ;
Sarkar, V .
IEEE TRANSACTIONS ON COMPUTERS, 2000, 49 (08) :798-813
[8]  
Gratz P, 2007, PR IEEE COMP DESIGN, P477
[9]  
HELD J, 2006, INTEL WHITE PAPER
[10]  
Hennessy J. L, 2012, COMPUTER ARCHITECTUR