A mesh-of-trees interconnection network for single-chip parallel processing

被引:25
|
作者
Balkan, Aydin O. [1 ]
Qu, Gang [2 ]
Vishkin, Uzi [2 ]
机构
[1] Univ Maryland, Inst Adv Comp Studies, College Pk, MD 20742 USA
[2] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/ASAP.2006.6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
There is a recent surge of interest in single-chip parallel processors. In such machines, it is crucial to implement a high-throughput low-latency interconnection network to connect the on-chip components, especially the processing units and the memory units. In this paper, we propose a new mesh of trees (MoT) implementation of the interconnection network and evaluate it relative to metrics such as wire area, total switch delay and maximum throughput taking into account latency-throughput trade-offs. We show that on-chip interconnection networks can provide higher bandwidth between processors and shared first-level cache than previously considered possible, facilitating greater scalability of memory architectures that require that. MoT is also compared, both analytically and experimentally, to some other traditional network topologies, such as hypercube, butterfly.. fat trees and butterfly fat trees. When we evaluate a 64-terminal MoT network at 65nm technology, concrete results show that MoT provides higher throughput and lower latency especially when the input traffic (or the on-chip parallelism) is high, at the cost of larger area. A recurring problem in networking and communication is that of achieving good sustained throughput in contrast to just high theoretical peak performance that does not materialize for typical work loads. Our quantitative results demonstrate a clear advantage of the proposed MoT network in the context of single-chip parallel processing.
引用
收藏
页码:73 / +
页数:2
相关论文
共 50 条
  • [1] Mesh-of-Trees and Alternative Interconnection Networks for Single-Chip Parallelism
    Balkan, Aydin O.
    Qu, Gang
    Vishkin, Uzi
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2009, 17 (10) : 1419 - 1432
  • [2] An area-efficient high-throughput hybrid interconnection network for single-chip parallel processing
    Balkan, Aydin O.
    Qu, Gang
    Vishkin, Uzi
    2008 45TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, VOLS 1 AND 2, 2008, : 435 - 440
  • [3] Layout-accurate design and implementation of a high-throughput interconnection network for single-chip parallel processing
    Balkan, Aydin O.
    Horak, Michael N.
    Qu, Gang
    Vishkin, Uzi
    15TH ANNUAL IEEE SYMPOSIUM ON HIGH-PERFORMANCE INTERCONNECTS, PROCEEDINGS, 2007, : 21 - +
  • [4] Single-chip multiprocessors: The rebirth of parallel processing
    Sohi, G
    12TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 2003, : 2 - 2
  • [5] Parallel BP Neural Network on Single-chip Cloud Computer
    Li, Boyang
    Liu, Chen
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 1871 - 1875
  • [6] Real-time Interconnection Network for Single-Chip Many-Core Computers
    Richter, Harald
    PROCEEDINGS OF IEEE EAST-WEST DESIGN & TEST SYMPOSIUM (EWDTS 2013), 2013,
  • [7] SINGLE-CHIP VIDEO PROCESSING SYSTEM
    DESOR, HJ
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1991, 37 (03) : 182 - 189
  • [8] A wireless interconnection network for parallel processing
    Marczynski, J
    Tabak, D
    EUROMICRO SYMPOSIUM ON DIGITAL SYSTEMS DESIGN, PROCEEDINGS, 2001, : 386 - 389
  • [9] Video processing for single-chip DVB decoder
    Brett, M
    Gerstenberg, B
    Herberg, C
    Shavit, G
    Liondas, H
    ICCE: 2001 INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, DIGEST OF TECHNICAL PAPERS, 2001, : 82 - 83
  • [10] Evaluating On-Chip Interconnection Architectures for Parallel Processing
    Freitas, Henrique C.
    Navaux, Philippe O. A.
    CSE 2008: PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING, 2008, : 188 - 193