Unrolling shape for out-of-order processors.

被引:0
|
作者
Sato, H [1 ]
机构
[1] Univ Tokyo, Ctr Informat Technol, Tokyo, Japan
来源
INNOVATIVE ARCHITECTURE FOR FUTURE GENERATION HIGH-PERFORMANCE PROCESSORS AND SYSTEMS | 2003年
关键词
D O I
10.1109/IWIA.2003.1262786
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Loop unrolling is today one of the most effective optimizations for modern architectures. To give an analytical model for loop unrolling performance, unrolling shape was proposed. It was applied to in-order processors, and was proved to give an accurate performance model for loop unrolling in term of software pipelining and cache miss alleviation. In this paper, we apply unrolling shape to out-of-order processors. A scheme for calculating PLOOO, pipelining terms of an unrolled loop by factor l are presented as PLOOO(l) = {(Nins(l)/F + NOccpy(l))}/l, where Nins(l) is the number of instructions in an unrolled loop by factor l, F the fetch rate of the architecture, NOccpy(l) the number of store instructions scheduled after Nins(l)/F-th cycle. A pipelining term for in-order processors is essential for calculating NOccpy(l). It is to be noted that the scheme for out-of-order processors uses unrolling shape for in-order processors. Experiments show that our scheme is precise in calculating the behaviour of loop unroling on out-of-order processors. We show that our scheme quantitatively shows the effect of loop unrolling as the one of infinitely unrolled loops on in-order processors. Furthermore, we reveal that the old folklore that the loop unrolling reduces the loop overhead has revived on out-of-order processors as a performance improvement factor as d/dlPL(OOO)(1).
引用
收藏
页码:88 / 97
页数:10
相关论文
共 50 条
  • [41] Raft with Out-of-order Executions
    Gu X.-S.
    Wei H.-F.
    Qiao L.
    Huang Y.
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (06): : 1748 - 1778
  • [42] Out-of-order vector architectures
    Espasa, R
    Valero, M
    Smith, JE
    THIRTIETH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, PROCEEDINGS, 1997, : 160 - 170
  • [43] Efficient Methods for Out-of-Order Load/Store Execution for High-Performance Soft Processors
    Wong, Henry
    Betz, Vaughn
    Rose, Jonathan
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2013, : 442 - 445
  • [44] Debugging and Optimizing High Performance Superscalar Out-of-Order Processors Using Formal Verification Techniques
    Alizadeh, Bijan
    Fujita, Masahiro
    2011 12TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED), 2011, : 297 - 302
  • [45] Accurate Statistical Performance Modeling and Validation of Out-of-Order Processors Using Monte Carlo Methods
    Alkohlani, Waleed
    Cook, Jeanine
    Cook, Jonathan
    2014 IEEE INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2014,
  • [46] An artificial neural network model of LRU-cache misses on out-of-order embedded processors
    Ji, Kecheng
    Ling, Ming
    Zhang, Yang
    Shi, Longxing
    MICROPROCESSORS AND MICROSYSTEMS, 2017, 50 : 66 - 79
  • [47] Fast precise interrupt handling without associative searching in multiple out-of-order issue processors
    Nam, SJ
    Park, IC
    Kyung, CM
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1999, E82D (03) : 645 - 653
  • [48] An Analytical Cache Performance Evaluation Framework for Embedded Out-of-Order Processors Using Software Characteristics
    Ji, Kecheng
    Ling, Ming
    Shi, Longxing
    Pan, Jianping
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (04)
  • [49] LOW POWER TRI-STATE REGISTER FILES DESIGN FOR MODERN OUT-OF-ORDER PROCESSORS
    Gong, Na
    Tang, Geng
    Wang, Jinhui
    Sridhar, Ramalingam
    2011 IEEE INTERNATIONAL SOC CONFERENCE (SOCC), 2011, : 323 - 328
  • [50] Architectural-Space Exploration of Heterogeneous Reliability and Checkpointing Modes for Out-of-Order Superscalar Processors
    Prabakaran, Bharath Srinivas
    Dave, Mihika
    Kriebel, Florian
    Rehman, Semeen
    Shafique, Muhammad
    IEEE ACCESS, 2019, 7 : 145324 - 145339