Instruction Criticality Based Energy-Efficient Hardware Data Prefetching

被引:5
|
作者
Kalani, Neelu Shivprakash [1 ]
Panda, Biswabandan [2 ]
机构
[1] Ecole Polytech Fed Lausanne, CH-1015 Lausanne, Switzerland
[2] Indian Inst Technol, Mumbai 400076, Maharashtra, India
关键词
Prefetching; IP networks; Benchmark testing; Energy consumption; Memory management; Detectors; Measurement; Cache memory; microarchitecture; POWER;
D O I
10.1109/LCA.2021.3117005
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Hardware data prefetching is a latency hiding technique that mitigates the memory wall problem by fetching data blocks into caches before the processor demands them. For high performing state-of-the-art data prefetchers, this increases dynamic and static energy in memory hierarchy, due to increase in number of requests. A trivial way to improve energy-efficiency of hardware prefetchers is to prefetch instructions on the critical path of execution. As criticality-based data prefetching does not degrade performance significantly; this is an ideal approach to solve the energy-efficiency problem. We discuss limitations of existing critical instruction detection techniques and propose a new technique that uses re-order buffer occupancy as a metric to detect critical instructions and performs prefetcher-specific threshold tuning. With our detector, we achieve maximum memory hierarchy energy savings of 12.3% with 1.4% higher performance, for PPF, and average as follows: (i) SPEC CPU 2017 benchmarks: 2.04% lower energy, 0.3% lower performance, for IPCP at L1D, (ii) client/server benchmarks: 4.7% lower energy, 0.15% lower performance, for PPF, (iii) Cloudsuite benchmarks: 2.99% lower energy, 0.36% higher performance, for IPCP at L1D. IPCP and PPF are state-of-the-art data prefetchers.
引用
收藏
页码:146 / 149
页数:4
相关论文
共 50 条
  • [41] Energy-Efficient Hardware Acceleration through Computing in the Memory
    Paul, Somnath
    Karam, Robert
    Bhunia, Swarup
    Puri, Ruchir
    2014 DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION (DATE), 2014,
  • [42] Execution Time Prediction for Energy-Efficient Hardware Accelerators
    Chen, Tao
    Rucker, Alexander
    Suh, G. Edward
    PROCEEDINGS OF THE 48TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-48), 2015, : 457 - 469
  • [43] Energy-Efficient Design for Massive MIMO With Hardware Impairments
    Liu, Zhihui
    Lee, Chia-Han
    Xu, Wenjun
    Li, Shengyu
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (02) : 843 - 857
  • [44] Protean: Resource-efficient Instruction Prefetching
    Hassan, Muhammad
    Park, Chang Hyun
    Black-Schaffer, David
    PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS, MEMSYS 2023, 2023,
  • [45] Eco-Storage: A Hybrid Storage System with Energy-Efficient Informed Prefetching
    Al Assaf, Maen M.
    Jiang, Xunfei
    Riduan Abid, Mohamed
    Qin, Xiao
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2013, 72 (03): : 165 - 180
  • [46] Eco-Storage: A Hybrid Storage System with Energy-Efficient Informed Prefetching
    Maen M. Al Assaf
    Xunfei Jiang
    Mohamed Riduan Abid
    Xiao Qin
    Journal of Signal Processing Systems, 2013, 72 : 165 - 180
  • [47] CLAP: Clustered Look-Ahead Prefetching for Energy-Efficient DRAM System
    Lee, Yebin
    Kim, Soontae
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2016, 24 (05) : 1770 - 1782
  • [48] Tolerating medium latencies on data caches with hardware-based prefetching
    Moreno, ED
    Kofuji, ST
    Martins, CAPS
    1997 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS I AND II: ENGINEERING INNOVATION: VOYAGE OF DISCOVERY, 1997, : 720 - 723
  • [49] Tango: A hardware-based data prefetching technique for superscalar processors
    Pinter, SS
    Yoaz, A
    PROCEEDINGS OF THE 29TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE - MICRO-29, 1996, : 214 - 225
  • [50] Energy-Efficient Data Gathering and Aggregation Scheme Based on Correlation
    Ye, Jingchuan
    2010 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS 1-3, 2010, : 1627 - 1630