High Performance Instruction Scheduling Circuits for Out-of-Order Soft Processors

被引:6
作者
Wong, Henry [1 ]
Betz, Vaughn [1 ]
Rose, Jonathan [1 ]
机构
[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 1A1, Canada
来源
2016 IEEE 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM) | 2016年
关键词
D O I
10.1109/FCCM.2016.11
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Soft processors have a role to play in easing the difficulty of designing applications into FPGAs for two reasons: first, they can be deployed only when needed, unlike permanent on-die hard processors. Second, for the portions of an application that can function sufficiently fast on a soft processor, it is far easier to write and debug single-threaded software code than to create hardware. The breadth of this second role increases when the performance of the soft processor increases, yet there has been little progress in the performance of soft processors since their commercial inception - in particular, the sophisticated out-of-order superscalar approaches that arrived in the mid 1990s are not employed, despite the fact that their area cost is now easily tolerable. In this paper we take an important step towards out-of-order execution in soft processors by exploring instruction scheduling in an FPGA substrate. This differs from the hard-processor design problem because the logic substrate is restricted to LUTs, whereas hard processor scheduling circuits employ CAM and wired-OR structures to great benefit. We discuss both circuit and microarchitectural trade-offs, and compare three circuit structures for the scheduler, including a new structure called a fused-logic matrix scheduler. With this circuit, large schedulers up to 40 entries can be built with the same cycle time as the commercial Nios II/f soft processor (240 MHz). This careful design has the potential to significantly increase both the IPC and raw compute performance of a soft processor, compared to current commercial soft processors.
引用
收藏
页码:9 / 16
页数:8
相关论文
共 26 条
  • [1] Aasaraai K., 2010, P FPT DEC
  • [2] Altera, 2015, DSN28162004 ALT
  • [3] Brown M. D., 2001, P MICRO
  • [4] Canal R., 2000, P SUP
  • [5] Chen CH, 2007, IEEE T COMPUT, V56, P1534, DOI [10.1109/TC.2007.70743, 10.1109/TC070743]
  • [6] Ernst D., 2002, P ISCA
  • [7] Issue logic for a 600-MHz out-of-order execution microprocessor
    Farrell, JA
    Fischer, TC
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1998, 33 (05) : 707 - 712
  • [8] Golden M., 2011, P ISSCC FEB
  • [9] Goshima M., 2001, P MICRO
  • [10] Gwennap L., 1997, MICROPROCESSOR REPOR, V11