Unrolling shape for out-of-order processors.

被引:0
|
作者
Sato, H [1 ]
机构
[1] Univ Tokyo, Ctr Informat Technol, Tokyo, Japan
来源
INNOVATIVE ARCHITECTURE FOR FUTURE GENERATION HIGH-PERFORMANCE PROCESSORS AND SYSTEMS | 2003年
关键词
D O I
10.1109/IWIA.2003.1262786
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Loop unrolling is today one of the most effective optimizations for modern architectures. To give an analytical model for loop unrolling performance, unrolling shape was proposed. It was applied to in-order processors, and was proved to give an accurate performance model for loop unrolling in term of software pipelining and cache miss alleviation. In this paper, we apply unrolling shape to out-of-order processors. A scheme for calculating PLOOO, pipelining terms of an unrolled loop by factor l are presented as PLOOO(l) = {(Nins(l)/F + NOccpy(l))}/l, where Nins(l) is the number of instructions in an unrolled loop by factor l, F the fetch rate of the architecture, NOccpy(l) the number of store instructions scheduled after Nins(l)/F-th cycle. A pipelining term for in-order processors is essential for calculating NOccpy(l). It is to be noted that the scheme for out-of-order processors uses unrolling shape for in-order processors. Experiments show that our scheme is precise in calculating the behaviour of loop unroling on out-of-order processors. We show that our scheme quantitatively shows the effect of loop unrolling as the one of infinitely unrolled loops on in-order processors. Furthermore, we reveal that the old folklore that the loop unrolling reduces the loop overhead has revived on out-of-order processors as a performance improvement factor as d/dlPL(OOO)(1).
引用
收藏
页码:88 / 97
页数:10
相关论文
共 50 条
  • [1] Out-of-order commit processors
    Cristal, A
    Ortega, D
    Llosa, J
    Valero, M
    10TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2004, : 48 - 59
  • [2] Modeling out-of-order processors for WCET analysis
    Li, Xianfeng
    Roychoudhury, Abhik
    Mitra, Tulika
    REAL-TIME SYSTEMS, 2006, 34 (03) : 195 - 227
  • [3] Direct instruction wakeup for out-of-order processors
    Ramírez, MA
    Cristal, A
    Veidenbaum, AV
    Villa, L
    Valero, M
    INNOVATIVE ARCHITECTURE FOR FUTURE GENERATION HIGH-PERFORMANCE PROCESSORS AND SYSTEMS, PROCEEDINGS, 2004, : 2 - 9
  • [4] INTERRUPT HANDLING FOR OUT-OF-ORDER EXECUTION PROCESSORS
    TORNG, HC
    DAY, M
    IEEE TRANSACTIONS ON COMPUTERS, 1993, 42 (01) : 122 - 127
  • [5] Modeling out-of-order processors for WCET analysis
    Xianfeng Li
    Abhik Roychoudhury
    Tulika Mitra
    Real-Time Systems, 2006, 34 : 195 - 227
  • [6] A Novel Register Renaming Technique for Out-of-Order Processors
    Tabani, Hamid
    Arnau, Jose-Maria
    Tubella, Jordi
    Gonzalez, Antonio
    2018 24TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2018, : 259 - 270
  • [7] Modeling out-of-order processors for software timing analysis
    Li, XF
    Roychoudhury, A
    Mitra, T
    25TH IEEE INTERNATIONAL REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, 2004, : 92 - 103
  • [8] A Mechanistic Performance Model for Superscalar Out-of-Order Processors
    Eyerman, Stijn
    Eeckhout, Lieven
    Karkhanis, Tejas
    Smith, James E.
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2009, 27 (02):
  • [9] Dynamic MIPS Rate Stabilization in Out-of-Order Processors
    Suh, Jinho
    Dubois, Michel
    ISCA 2009: 36TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2009, : 46 - 56
  • [10] Improving branch prediction and predicated execution in out-of-order processors
    Quinones, Eduardo
    Parcerisa, Joan-Manuel
    Gonzalez, Antonio
    THIRTEENTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2007, : 75 - +