Architecture Optimization of Application-Specific Implicit Instructions

被引:0
作者
Di Biagio, Andrea [1 ]
Agosta, Giovanni [1 ]
Sykora, Martino [1 ]
Silvano, Cristina [1 ]
机构
[1] Politecn Milan, DEI, I-20133 Milan, Italy
关键词
Design; Performances; Pipeline architecture; implicit instruction issue;
D O I
10.1145/2331147.2331154
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic configuration of application-specific implicit instructions has been proposed to better exploit the available parallelism at the instruction level in pipelined processors. The support of such implicit instruction issue-requires the pipeline to be extended with a trigger table that describes the instruction implicitly issued as a response to a value written into a triggering register by a triggering instruction (which may be an add or sub instruction). In this article, we explore the design optimization of the trigger table to maximize the number of instructions that can be implicitly issued while keeping the limited size of the trigger table. The concept of implicitly issued instruction has been formally defined by considering the inter-basic block analysis of control and data dependencies. A compilation tool chain has been developed to automatically identify the optimization opportunities, taking into account the constraints imposed by control and data dependencies as well as by architectural limitations. The proposed solutions have been applied to the case of a baseline scalar MIPS processor where, for the selected set of benchmarks (DSPStone and Mibench/automotive), we obtained an average speedup of 17%.
引用
收藏
页数:23
相关论文
共 19 条
[1]  
[Anonymous], LANGUAGE DRIVEN EXPL
[2]   SimpleScalar: An infrastructure for computer system modeling [J].
Austin, T ;
Larson, E ;
Ernst, D .
COMPUTER, 2002, 35 (02) :59-+
[3]   A framework for modeling and estimating the energy dissipation of VLIW-based embedded systems [J].
Benini, L ;
Bruni, D ;
Chinosi, M ;
Silvano, C ;
Zaccaria, V ;
Zafalon, R .
DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2002, 7 (03) :183-203
[4]  
Bracy A, 2004, INT SYMP MICROARCH, P18
[5]  
Corporaal H., 1997, Microprocessor Architectures: from VLIW to TTA
[6]  
GATHAUS MR, 2001, P IEEE 4 ANN WORKSH
[7]  
Gochman S., 2003, INTEL TECHNOLOGY J, V07, P21
[8]   Frequent loop detection using efficient nonintrusive on-chip hardware [J].
Gordon-Ross, A ;
Vahid, F .
IEEE TRANSACTIONS ON COMPUTERS, 2005, 54 (10) :1203-1215
[9]  
HEINRICH J, 1993, MIPS R4000 MICROPROC
[10]   The optimal logic depth per pipeline stage is 6 to 8 FO4 inverter delays [J].
Hrishikesh, MS ;
Jouppi, NP ;
Farkas, KI ;
Burger, D ;
Keckler, SW ;
Shivakumar, P .
29TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2002, :14-24