Toward a Core Design to Distribute an Execution on a Manycore Processor

被引:1
作者
Goossens, Bernard [1 ,2 ]
Parello, David [1 ,2 ]
Porada, Katarzyna [1 ,2 ]
Rahmoune, Djallal [1 ,2 ]
机构
[1] UPVD, DALI, F-66860 Perpignan 9, France
[2] CNRS, LIRMM, UMR 5506, UM2, F-34095 Montpellier 5, France
来源
PARALLEL COMPUTING TECHNOLOGIES (PACT 2015) | 2015年 / 9251卷
关键词
Microarchitecture; Parallelism; Manycore; Automatic parallelization;
D O I
10.1007/978-3-319-21909-7_38
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a parallel execution model and a core design to run C programs in parallel. The model automatically builds parallel flows of machine instructions from the run trace. It parallelizes instruction fetch, renaming, execution and retirement. Predictor based fetch is replaced by a fetch-decode-and-partly-execute stage able to compute in-order most of the control instructions. Tomasulo's register renaming is extended to memory with a technique to match consumer/producer pairs. The Reorder Buffer is adapted to parallel retirement. A sum reduction code is used to illustrate the model and to give a short analytical evaluation of its performance potential.
引用
收藏
页码:390 / 404
页数:15
相关论文
共 50 条
  • [21] The Design of a Hardware Thread Manager for a Polymorphic Multimedia Processor
    Qian, Bowen
    Li, Tao
    Yang, Ting
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON MULTIMEDIA TECHNOLOGY (ICMT-13), 2013, 84 : 1792 - 1799
  • [22] A Third Generation Many-Core Processor for Secure Embedded Computing Systems
    Irza, John
    Doerr, Michael
    Solka, Michael
    2012 IEEE CONFERENCE ON HIGH PERFORMANCE EXTREME COMPUTING (HPEC), 2012,
  • [23] Analytical memory bandwidth model for many-core processor based systems
    Lee, Hyuk-Jun
    Cho, Woo-Cheol
    Chung, Eui-Young
    IEICE ELECTRONICS EXPRESS, 2012, 9 (18): : 1461 - 1466
  • [24] NoC-based Many-Core Processor Using CUSPARC Architecture
    Soliman, Muhammad R.
    Fahmy, Hossam A. H.
    Habib, S. E. -D.
    2014 26TH INTERNATIONAL CONFERENCE ON MICROELECTRONICS (ICM), 2014, : 84 - 87
  • [25] Experiences with the Sparse Matrix-Vector Multiplication on a Many-core Processor
    Pichel, Juan C.
    Rivera, Francisco F.
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 7 - 15
  • [26] Data Management: The Spirit to Pursuit Peak Performance on Many-Core Processor
    Zhou, Yongbin
    Zhang, Junchao
    Zhang, Shuai
    Yuan, Nan
    Fan, Dongrui
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS, PROCEEDINGS, 2009, : 559 - 564
  • [27] A subnetting mechanism with low cost deadlock-free design for irregular topologies in NoC based manycore processors
    Han, Xing
    Fu, Yuzhuo
    Jiang, Jiang
    Wang, Chang
    2016 3RD INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2016, : 110 - 114
  • [28] An Efficient Task-Based Execution Model for Stochastic Linear Solver on Multi-Core and Many-Core Systems
    Ye, Fan
    Calvin, Christophe
    Petiton, Serge
    2015 IEEE 18TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2015, : 200 - 207
  • [29] Toward Efficient Execution of Mainstream Deep Learning Frameworks on Mobile Devices: Architectural Implications
    Dai, Yuting
    Zhang, Rui
    Xue, Rui
    Liu, Benyong
    Li, Tao
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2021, 40 (03) : 453 - 466
  • [30] Counters based performance analysis and optimization of an out-of-order superscalar processor core
    Sun C.
    Sui B.
    Wang L.
    Wang Y.
    Huang L.
    Li W.
    Wang J.
    1600, National University of Defense Technology (38): : 14 - 19