Toward a Core Design to Distribute an Execution on a Manycore Processor

被引:1
|
作者
Goossens, Bernard [1 ,2 ]
Parello, David [1 ,2 ]
Porada, Katarzyna [1 ,2 ]
Rahmoune, Djallal [1 ,2 ]
机构
[1] UPVD, DALI, F-66860 Perpignan 9, France
[2] CNRS, LIRMM, UMR 5506, UM2, F-34095 Montpellier 5, France
来源
PARALLEL COMPUTING TECHNOLOGIES (PACT 2015) | 2015年 / 9251卷
关键词
Microarchitecture; Parallelism; Manycore; Automatic parallelization;
D O I
10.1007/978-3-319-21909-7_38
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a parallel execution model and a core design to run C programs in parallel. The model automatically builds parallel flows of machine instructions from the run trace. It parallelizes instruction fetch, renaming, execution and retirement. Predictor based fetch is replaced by a fetch-decode-and-partly-execute stage able to compute in-order most of the control instructions. Tomasulo's register renaming is extended to memory with a technique to match consumer/producer pairs. The Reorder Buffer is adapted to parallel retirement. A sum reduction code is used to illustrate the model and to give a short analytical evaluation of its performance potential.
引用
收藏
页码:390 / 404
页数:15
相关论文
共 50 条
  • [1] Extended Cyclostatic Dataflow Program Compilation and Execution for an Integrated Manycore Processor
    Aubry, Pascal
    Beaucamps, Pierre-Edouard
    Blanc, Frederic
    Bodin, Bruno
    Carpov, Sergiu
    Cudennec, Loic
    David, Vincent
    Dore, Philippe
    Dubrulle, Paul
    Dupont de Dinechin, Benoit
    Galea, Francois
    Goubier, Thierry
    Harrand, Michel
    Jones, Samuel
    Lesage, Jean-Denis
    Louise, Stephane
    Morey Chaisemartin, Nicolas
    Thanh Hai Nguyen
    Raynaud, Xavier
    Sirdey, Renaud
    2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 : 1624 - 1633
  • [2] Power and Energy Characterization of an Open Source 25-core Manycore Processor
    McKeown, Michael
    Lavrov, Alexey
    Shahrad, Mohammad
    Jackson, Paul J.
    Fu, Yaosheng
    Balkind, Jonathan
    Nguyen, Tri M.
    Lim, Katie
    Zhou, Yanqi
    Wentzlaff, David
    2018 24TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2018, : 762 - 775
  • [3] Exploring the Thermal Impact on Manycore Processor Performance
    Huang, Wei
    Skadron, Kevin
    Gurumurthi, Sudhanva
    Ribando, Robert J.
    Stan, Mircea R.
    26TH ANNUAL IEEE SEMICONDUCTOR THERMAL MEASUREMENT AND MANAGEMENT SYMPOSIUM, PROCEEDINGS 2010, 2010, : 191 - 197
  • [4] Homogeneous Many-core Processor System Test Distribution and Execution Mechanism
    Kamran, Arezoo
    Navabi, Zainalabedin
    2014 19TH IEEE EUROPEAN TEST SYMPOSIUM (ETS 2014), 2014,
  • [5] Co-optimizing Core Allocation, Mapping and DVFS in Streaming Programs with Moldable Tasks for Energy Efficient Execution on Manycore Architectures
    Melot, Nicolas
    Kessler, Christoph
    Eitschberger, Patrick
    Keller, Joerg
    2019 19TH INTERNATIONAL CONFERENCE ON APPLICATION OF CONCURRENCY TO SYSTEM DESIGN (ACSD 2019), 2019, : 63 - 72
  • [6] A Manycore Processor Based Multilayer Perception Feedforward Acceleration Framework for Embedded System
    Gao, Fang
    Huang, Zhangqin
    Wang, Shulong
    Ji, Xinrong
    2016 3RD INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2016, : 49 - 53
  • [7] Disjoint Out-of-Order Execution Processor
    Sharafeddine, Mageda
    Jothi, Komal
    Akkary, Haitham
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2012, 9 (03)
  • [8] Energy Efficient Stencil Computations on the Low-Power Manycore MPPA-256 Processor
    Podesta Jr, Emmanuel
    do Nascimento, Bruno Marques
    Castro, Marcio
    EURO-PAR 2018: PARALLEL PROCESSING, 2018, 11014 : 642 - 655
  • [9] A Micro-Architecture Design for the 32-bit VLIW DSP Processor Core
    Khoi-Nguyen Le-Huu
    Anh-Vu Dinh-Due
    Nguyen, Tin T.
    PROCEEDINGS OF THE 20TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC2014), 2014, : 46 - 51
  • [10] Towards highly scalable Ab Initio Molecular Dynamics (AIMD) simulations on the Intel Knights Landing manycore processor
    Jacquelin, Mathias
    De Jong, Wibe
    Bylaska, Eric
    2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 234 - 243