Unrolling shape for out-of-order processors.

被引:0
|
作者
Sato, H [1 ]
机构
[1] Univ Tokyo, Ctr Informat Technol, Tokyo, Japan
来源
INNOVATIVE ARCHITECTURE FOR FUTURE GENERATION HIGH-PERFORMANCE PROCESSORS AND SYSTEMS | 2003年
关键词
D O I
10.1109/IWIA.2003.1262786
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Loop unrolling is today one of the most effective optimizations for modern architectures. To give an analytical model for loop unrolling performance, unrolling shape was proposed. It was applied to in-order processors, and was proved to give an accurate performance model for loop unrolling in term of software pipelining and cache miss alleviation. In this paper, we apply unrolling shape to out-of-order processors. A scheme for calculating PLOOO, pipelining terms of an unrolled loop by factor l are presented as PLOOO(l) = {(Nins(l)/F + NOccpy(l))}/l, where Nins(l) is the number of instructions in an unrolled loop by factor l, F the fetch rate of the architecture, NOccpy(l) the number of store instructions scheduled after Nins(l)/F-th cycle. A pipelining term for in-order processors is essential for calculating NOccpy(l). It is to be noted that the scheme for out-of-order processors uses unrolling shape for in-order processors. Experiments show that our scheme is precise in calculating the behaviour of loop unroling on out-of-order processors. We show that our scheme quantitatively shows the effect of loop unrolling as the one of infinitely unrolled loops on in-order processors. Furthermore, we reveal that the old folklore that the loop unrolling reduces the loop overhead has revived on out-of-order processors as a performance improvement factor as d/dlPL(OOO)(1).
引用
收藏
页码:88 / 97
页数:10
相关论文
共 50 条
  • [31] Evaluating register allocation and instruction scheduling techniques in out-of-order issue processors
    Valluri, Madhavi Gopal
    Govindarajan, R.
    Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT, 1999, : 78 - 83
  • [32] System performance analyses of out-of-order superscalar processors using analytical method
    Kim, HJ
    Kim, SM
    Choi, SB
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1999, E82A (06) : 927 - 938
  • [33] ProfileMe: Hardware support for instruction-level profiling on out-of-order processors
    Dean, J
    Hicks, JE
    Waldspurger, CA
    Weihl, WE
    Chrysos, G
    THIRTIETH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, PROCEEDINGS, 1997, : 292 - 302
  • [34] A Formal Approach for Detecting Vulnerabilities to Transient Execution Attacks in Out-of-Order Processors
    Fadiheh, Mohammad Rahmani
    Mueller, Johannes
    Brinkmann, Raik
    Mitra, Subhasish
    Stoffel, Dominik
    Kunz, Wolfgang
    PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,
  • [35] Out-of-order execution may not be cost-effective on processors featuring simultaneous multithreading
    Hily, S
    Seznec, A
    FIFTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 1999, : 64 - 67
  • [36] Development of efficient computational kernels and linear algebra routines for out-of-order superscalar processors
    Bessonov, O
    Fougère, D
    Roux, B
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2005, 21 (05): : 743 - 748
  • [37] Enhanced Dependence Graph Model for Critical Path Analysis on Modern Out-of-Order Processors
    Tanimoto, Teruo
    Ono, Takatsugu
    Inoue, Koji
    Sasaki, Hiroshi
    IEEE COMPUTER ARCHITECTURE LETTERS, 2017, 16 (02) : 111 - 114
  • [38] QSim: Framework for Cycle-accurate Simulation on Out-of-Order Processors based on QEMU
    Luo, Yan
    Li, Ying
    Yuan, Xinyu
    Yin, Rong
    PROCEEDINGS OF THE 2012 SECOND INTERNATIONAL CONFERENCE ON INSTRUMENTATION & MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2012), 2012, : 1010 - 1015
  • [39] High-Performance Instruction Scheduling Circuits for Superscalar Out-of-Order Soft Processors
    Wong, Henry
    Betz, Vaughn
    Rose, Jonathan
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2018, 11 (01)
  • [40] OUT-OF-ORDER - PATTERSON,TE
    DUGGER, R
    NEW YORK TIMES BOOK REVIEW, 1993, : 12 - 13