Improving the parallelism and concurrency in decoupled architectures

被引:0
作者
John, L
Radhakrishnan, R
机构
来源
EIGHTH IEEE SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS | 1996年
关键词
access-execute decoupling; compiler optimization; load hoisting; code motion; memory access;
D O I
10.1109/SPDP.1996.570325
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Concurrency between access and execution has been exploited by queues in many decoupled access-execute architectures, but data dependent control dependencies often prohibit prefetching of data to queues. This paper investigates a technique to facilitate anticipatory loading to queues even in presence of data dependent control dependencies. The proposed method consists of fetching along one or both paths of a data dependent control dependency and inserting consume instructions in appropriate paths to consume the unnecessarily fetched data. The compiler hoists load instructions above control dependencies as in conventional load hoisting techniques. The technique is seen to be very effective in programs with data dependent if then-else's. We also present an architecture with multiple access units, the mLSU architecture, which parallelizes the access process. Simulation experiments illustrate that multiple access units improve the performance if access processor instruction issue is a bottleneck.
引用
收藏
页码:130 / 137
页数:8
相关论文
empty
未找到相关数据