A distributed control path architecture for VLIW processors

被引:0
作者
Zhong, HT [1 ]
Fan, K [1 ]
Mahlke, S [1 ]
Schlansker, M [1 ]
机构
[1] Univ Michigan, Adv Comp Architecture Lab, Ann Arbor, MI 48109 USA
来源
PACT 2005: 14TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES | 2005年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
VLIW architectures are popular in embedded systems because they offer high-performance processing at low cost and energy. The major problem with traditional VLIW designs is that they do not scale efficiently due to bottlenecks that result from centralized resources and global communication. Multicluster designs have been proposed to solve the scaling problem of VLIW datapaths, while much less work has been done on the control path. In this paper, we propose a distributed control path architecture for VLIW processors (DVLIW) to overcome the scalability problem of VLIW control paths. The architecture simplifies the dispersal of complex VLIW instructions and supports efficient distribution of instructions through a limited bandwidth interconnect, while supporting compressed instruction encodings. DVLIW employs a multicluster design where each cluster contains a local instruction memory that provides all intra-cluster control. All clusters have their own program counter and instruction sequencing capabilities, thus instruction execution is completely decentralized. The architecture executes multiple instruction streams at the same time, but these streams collectively function as a single logical instruction stream. Simulation results show that DVLIW processors reduce the number of cross-chip control signals by approximately two orders of magnitude while incurring a small performance overhead to explicitly manage the instruction streams.
引用
收藏
页码:197 / 206
页数:10
相关论文
共 32 条
  • [1] Code size minimization and retargetable assembly for custom EPIC and VLIW instruction formats
    Aditya, S
    Mahlke, SA
    Rau, BR
    [J]. ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2000, 5 (04) : 752 - 773
  • [2] Agarwal V, 2000, PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, P248, DOI [10.1145/342001.339691, 10.1109/ISCA.2000.854395]
  • [3] ALETA A, 2003, MICRO 36 P 36 ANN IE, P326
  • [4] Chu M, 2003, P SIGPLAN 03 C PROGR, P300
  • [5] Colwell R. P., 1990, Proceedings of Supercomputing '90 (Cat. No.90CH2916-5), P910, DOI 10.1109/SUPERC.1990.130118
  • [6] Instruction fetch mechanisms for VLIW architectures with compressed encodings
    Conte, TM
    Banerjia, S
    Larin, SY
    Menezes, KN
    Sathaye, SW
    [J]. PROCEEDINGS OF THE 29TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE - MICRO-29, 1996, : 201 - 211
  • [7] DOBBERPUHL D, 1996, P INT S LOW POW EL D, P11
  • [8] ELDER J, DINRO 4 TRACE DRIVEN
  • [9] Ellis J.R., 1985, BULLDOG COMPILER VLI
  • [10] Fisher J. A., 1983, P 10 ANN INT S COMP, P140, DOI DOI 10.1145/800046.801649