Tango: A hardware-based data prefetching technique for superscalar processors

被引:7
|
作者
Pinter, SS
Yoaz, A
机构
来源
PROCEEDINGS OF THE 29TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE - MICRO-29 | 1996年
关键词
D O I
10.1109/MICRO.1996.566463
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present a new hardware-based data prefetching mechanism for enhancing instruction level parallelism and improving the performance of superscalar processors. The emphasis in our scheme is on the effective utilization of slack time and hardware resources not used for the main computation. The scheme suggests a new hardware construct, the Program Progress Graph (PPG), as a simple extension to the Branch Target Buffer (BTB). We use the PPG for implementing a fast pre-program counter pre-PC, that travels only through memory reference instructions (rather than scanning all the instructions sequentially). In a single clock cycle the pre-PC extracts all the predicted memory references in some future block of instructions, to obtain early data prefetching. In addition, the PPG can be used for implementing a pre-processor and for instruction prefetching. The prefetch requests are scheduled to ''tango'' with the core requests from the data cache, by using only free time slots on the existing data cache tag ports. Employing special methods for removing prefetch requests that are already in the cache (without utilizing the cache-tag ports bandwidth) and a simple optimization on the cache LRU mechanism reduce the number of prefetch requests sent to the core-cache bus and to the memory (second level) bus. Simulation results an the SPEC92 benchmark for the base line architecture (32K-byte data cache and 12 cycles fetch latency) show an average speedup of 1.36 (CPI ratio).
引用
收藏
页码:214 / 225
页数:12
相关论文
共 50 条
  • [1] EFFECTIVE HARDWARE-BASED DATA PREFETCHING FOR HIGH-PERFORMANCE PROCESSORS
    CHEN, TF
    BAER, JL
    IEEE TRANSACTIONS ON COMPUTERS, 1995, 44 (05) : 609 - 623
  • [2] Energy characterization of hardware-based data prefetching
    Guo, Y
    Chheda, S
    Koren, I
    Krishna, CM
    Moritz, CA
    IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN: VLSI IN COMPUTERS & PROCESSORS, PROCEEDINGS, 2004, : 518 - 523
  • [3] Tolerating medium latencies on data caches with hardware-based prefetching
    Moreno, ED
    Kofuji, ST
    Martins, CAPS
    1997 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS I AND II: ENGINEERING INNOVATION: VOYAGE OF DISCOVERY, 1997, : 720 - 723
  • [4] HARDWARE-BASED DATA COMPRESSION TECHNIQUE.
    Flores, A.V.
    1600, (27):
  • [5] ISP: Using Idle SMs in Hardware-based Prefetching
    Falahati, Hajar
    Abdi, Mania
    Baniasadi, Amirali
    Hessabi, Shaahin
    2013 17TH CSI INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND DIGITAL SYSTEMS (CADS 2013), 2013, : 3 - +
  • [6] A Knapsack Methodology for Hardware-based DMR Protection against Soft Errors in Superscalar Out-of-Order Processors
    Tonetto, Rafael Billig
    Cardoso, Douglas Maciel
    Brandalero, Marcelo
    Agostini, Luciano
    Nazar, Gabriel L.
    Azambuja, Jose Rodrigo
    Schneider Beck, Antonio Carlos
    2019 IFIP/IEEE 27TH INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2019, : 287 - 292
  • [7] HARDWARE-BASED VIRTUALIZATION EASES DESIGN WITH MULTICORE PROCESSORS
    Sathe, Satish
    EDN, 2011, 56 (02) : 28 - +
  • [8] CFCET: A hardware-based control flow checking technique in COTS processors using execution tracing
    Rajabzadeh, A
    Miremadi, SG
    MICROELECTRONICS RELIABILITY, 2006, 46 (5-6) : 959 - 972
  • [9] On the mitigation of Hardware Trojan attacks in embedded processors by exploiting a Hardware-based obfuscator
    Marcelli, Andrea
    Sanchez, Ernesto
    Sasselli, Luca
    Squillero, Giovanni
    2018 IEEE 3RD INTERNATIONAL VERIFICATION AND SECURITY WORKSHOP (IVSW), 2018, : 31 - 37
  • [10] Hardware-Based Generation of Independent Subtraces of Instructions in Clustered Processors
    Ubal, Rafael
    Sahuquillo, Julio
    Petit, Salvador
    Lopez, Pedro
    Duato, Jose
    IEEE TRANSACTIONS ON COMPUTERS, 2013, 62 (05) : 944 - 955