Tango: A hardware-based data prefetching technique for superscalar processors

被引:7
作者
Pinter, SS
Yoaz, A
机构
来源
PROCEEDINGS OF THE 29TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE - MICRO-29 | 1996年
关键词
D O I
10.1109/MICRO.1996.566463
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present a new hardware-based data prefetching mechanism for enhancing instruction level parallelism and improving the performance of superscalar processors. The emphasis in our scheme is on the effective utilization of slack time and hardware resources not used for the main computation. The scheme suggests a new hardware construct, the Program Progress Graph (PPG), as a simple extension to the Branch Target Buffer (BTB). We use the PPG for implementing a fast pre-program counter pre-PC, that travels only through memory reference instructions (rather than scanning all the instructions sequentially). In a single clock cycle the pre-PC extracts all the predicted memory references in some future block of instructions, to obtain early data prefetching. In addition, the PPG can be used for implementing a pre-processor and for instruction prefetching. The prefetch requests are scheduled to ''tango'' with the core requests from the data cache, by using only free time slots on the existing data cache tag ports. Employing special methods for removing prefetch requests that are already in the cache (without utilizing the cache-tag ports bandwidth) and a simple optimization on the cache LRU mechanism reduce the number of prefetch requests sent to the core-cache bus and to the memory (second level) bus. Simulation results an the SPEC92 benchmark for the base line architecture (32K-byte data cache and 12 cycles fetch latency) show an average speedup of 1.36 (CPI ratio).
引用
收藏
页码:214 / 225
页数:12
相关论文
共 50 条
[41]   Hardware-based Cyber Threats [J].
Alves, Thiago ;
Morris, Thomas .
ICISSP: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY, 2018, :259-266
[42]   Instruction Criticality Based Energy-Efficient Hardware Data Prefetching [J].
Kalani, Neelu Shivprakash ;
Panda, Biswabandan .
IEEE COMPUTER ARCHITECTURE LETTERS, 2021, 20 (02) :146-149
[43]   Secured Data Collection With Hardware-Based Ciphers for IoT-Based Healthcare [J].
Tao, Hai ;
Bhuiyan, Md Zakirul Alam ;
Abdalla, Ahmed N. ;
Hassan, Mohammad Mehedi ;
Zain, Jasni Mohamad ;
Hayajneh, Thaier .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (01) :410-420
[44]   Energy-Efficient Hardware Data Prefetching [J].
Guo, Yao ;
Narayanan, Pritish ;
Bennaser, Mahmoud Abdullah ;
Chheda, Saurabh ;
Moritz, Csaba Andras .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2011, 19 (02) :250-263
[45]   Performance characterization of a data mining application via hardware-based monitoring [J].
Thoennes, M ;
Weems, C .
COMMERCIAL APPLICATIONS FOR HIGH-PERFORMANCE COMPUTING, 2001, 4528 :109-117
[46]   Hardware-based Thermal-Aware Register-Renaming Technique in Microprocessors [J].
Kim, Jungwook ;
Jhang, Seong Tae .
INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2010, 13 (02) :389-407
[47]   Effective hardware-based two-way loop cache for high performance low power processors [J].
Anderson, T ;
Agarwala, S .
2000 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN: VLSI IN COMPUTERS & PROCESSORS, PROCEEDINGS, 2000, :403-407
[48]   Unveiling Hardware-based Data Prefetcher, a Hidden Source of Information Leakage [J].
Shin, Youngjoo ;
Kim, Hyung Chan ;
Kwon, Dokeun ;
Jeong, Ji Hoon ;
Hur, Junbeom .
PROCEEDINGS OF THE 2018 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (CCS'18), 2018, :131-145
[49]   Future execution: A hardware prefetching technique for chip multiprocessors [J].
Ganusov, I ;
Burtscher, M .
PACT 2005: 14TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2005, :350-360
[50]   Tolerating Memory Latency Using a Hardware-based Active-pushing Technique [J].
Shi, Liwen ;
Fan, Xiaoya ;
Chen, Jie ;
Huang, Xiaoping ;
Tian, Hangpei .
2009 INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, PROCEEDINGS, 2009, :407-411