Modulo scheduling for a fully-distributed clustered VLIW architecture

被引:0
作者
Sánchez, J [1 ]
González, A [1 ]
机构
[1] Univ Politecn Cataluna, Dept Comp Architecture, Barcelona, Spain
来源
33RD ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE: MICRO-33 2000, PROCEEDINGS | 2000年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is an approach that many microprocessors are adopting in recent times in order to mitigate the increasing penalties of wire delays. In this work we propose a novel clustered VLIW architecture which has all its resources partitioned among clusters, including the cache memory. A modulo scheduling scheme for this architecture is also proposed. This algorithm takes into account both register and memory inter-cluster communications so that the final schedule results in a cluster assignment that favors cluster locality in cache references and register accesses. It has been evaluated for both 2- and 4-cluster configurations and for differing number and latencies of inter-cluster buses. The proposed algorithm produces schedules with very low communication requirements and outperforms previous cluster-oriented schedulers.
引用
收藏
页码:124 / 133
页数:10
相关论文
共 26 条
  • [1] Agarwal V, 2000, PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, P248, DOI [10.1145/342001.339691, 10.1109/ISCA.2000.854395]
  • [2] [Anonymous], P 8 ANN S COMP ARCH
  • [3] AYGUADE E, 1996, SUP 96 SC96 RES EXH
  • [4] BERMUDO N, 2000, P INT S PERF AN SYST
  • [5] CAPITANIO A, 1992, P 25 INT S MICR, P192
  • [6] Culler DavidE., 1999, PARALLEL COMPUTER AR
  • [7] ELLIS JR, 1986, BULLDOG COMPILER VLI, P180
  • [8] Distributed modulo scheduling
    Fernandes, MM
    Llosa, J
    Topham, N
    [J]. FIFTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 1999, : 130 - 134
  • [9] The TigerSHARC DSP architecture
    Fridman, J
    Greenfield, Z
    [J]. IEEE MICRO, 2000, 20 (01) : 66 - 76
  • [10] GHOSH S, 1997, P 11 INT C SUP ICS 9, P317