Graph-partitioning based instruction scheduling for clustered processors

被引：0

作者：

Aletà, A ^{[1
]}

Codina, JM ^{[1
]}

Sánchez, J ^{[1
]}

González, A ^{[1
]}

机构：

[1] Univ Politecn Cataluna, Dept Comp Architecture, Barcelona, Spain

来源：

34TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO-34, PROCEEDINGS | 2001年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This work presents a novel scheme to schedule loops for clustered microarchitectures, The scheme is based on a preliminary cluster assignment phase implemented through graph partitioning techniques followed by a scheduling phase that integrates register allocation and spill code generation. The graph partitioning scheme is shown to be very effective due to its global view of the whole code while the partition is generated. Results show a significant speedup when compared with previously proposed techniques. For some processor configuration the average speedup for the SPECfp95 is 23% with respect to the published scheme with the best performance. Besides, the proposed scheme is much faster (between 2-7 times, depending on the configuration).

引用

页码：150 / 159

页数：10

共 40 条

[1]

Agarwal V, 2000, PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, P248, DOI [10.1145/342001.339691, 10.1109/ISCA.2000.854395]

[2]

[Anonymous], 1995, CHACO USERS GUIDE VE

[3]

AYGUADE E, 1996, SC 96 RES EXHIBIT PO

[4]

CAPITANIO A, 1992, P 25 INT S MICR MICR, V25, P192

[5]

CODINA JM, 2001, P INT C PAR ARCH COM

[6]

DING C, 1997, P 3 EUR C AUG, P1079

[7]

EICHENBERGER AE, 1995, P 28 INT S MICR MICR, V28, P338

[8]

ELLIS JR, 1986, BULLDOG COMPILER VLI, P180

[9]

Faraboschi P, 2000, PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, P203, DOI [10.1145/342001.339682, 10.1109/ISCA.2000.854391]

[10]

FERNANDES MM, 1999, P INT S HIGH PERF CO, V5, P130

← 1 2 3 4 →