Toward a Core Design to Distribute an Execution on a Manycore Processor

被引:1
作者
Goossens, Bernard [1 ,2 ]
Parello, David [1 ,2 ]
Porada, Katarzyna [1 ,2 ]
Rahmoune, Djallal [1 ,2 ]
机构
[1] UPVD, DALI, F-66860 Perpignan 9, France
[2] CNRS, LIRMM, UMR 5506, UM2, F-34095 Montpellier 5, France
来源
PARALLEL COMPUTING TECHNOLOGIES (PACT 2015) | 2015年 / 9251卷
关键词
Microarchitecture; Parallelism; Manycore; Automatic parallelization;
D O I
10.1007/978-3-319-21909-7_38
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a parallel execution model and a core design to run C programs in parallel. The model automatically builds parallel flows of machine instructions from the run trace. It parallelizes instruction fetch, renaming, execution and retirement. Predictor based fetch is replaced by a fetch-decode-and-partly-execute stage able to compute in-order most of the control instructions. Tomasulo's register renaming is extended to memory with a technique to match consumer/producer pairs. The Reorder Buffer is adapted to parallel retirement. A sum reduction code is used to illustrate the model and to give a short analytical evaluation of its performance potential.
引用
收藏
页码:390 / 404
页数:15
相关论文
共 50 条
  • [31] Snitch: A Tiny Pseudo Dual-Issue Processor for Area and Energy Efficient Execution of Floating-Point Intensive Workloads
    Zaruba, Florian
    Schuiki, Fabian
    Hoefler, Torsten
    Benini, Luca
    IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (11) : 1845 - 1860
  • [32] A Design of a GP-GPU based Stream Processor for an Image Processing
    Lee, Kwang Yeob
    Kyung, Gyutaek
    Park, Tae Ryong
    Kwak, Jae Chang
    Koo, Yong Seo
    2015 38TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2015, : 535 - 539
  • [33] hxploration of Fine-Grain Body Bias Control in Many-Core Processor Arrays
    Baas, Bevan
    Bohnenstiehl, Brent
    Cui, Jin
    2018 IEEE SOI-3D-SUBTHRESHOLD MICROELECTRONICS TECHNOLOGY UNIFIED CONFERENCE (S3S), 2018,
  • [34] GreedyTalents: An Energy-Aware Auto-Tuning Method for Many-Core Processor
    Platt, Timothy M.
    Yang, Zhiliu
    Liu, Chen
    2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 1076 - 1083
  • [35] New system software for parallel programming models on the Intel SCC many-core processor
    Clauss, Carsten
    Lankes, Stefan
    Reble, Pablo
    Bemmerl, Thomas
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (09) : 2235 - 2259
  • [36] Display Stream Compression Decoders for Fine-Grained Many-Core Processor Arrays
    Wu, Shifu
    Baas, Bevan M.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (05) : 1730 - 1734
  • [37] Design Exploration For Next Generation High-Performance Manycore On-chip Systems: Application To big.LITTLE Architectures
    Butko, Anastasiia
    Gamatie, Abdoulaye
    Sassatelli, Gilles
    Tones, Lionel
    Robert, Michel
    2015 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, 2015, : 551 - 556
  • [38] GODSON-T: AN EFFICIENT MANY-CORE PROCESSOR EXPLORING THREAD-LEVEL PARALLELISM
    Fan, Dongrui
    Zhang, Hao
    Wang, Da
    Ye, Xiaochun
    Song, Fenglong
    Li, Guojie
    Sun, Ninghui
    IEEE MICRO, 2012, 32 (02) : 38 - 47
  • [39] Parallelization of Automotive Engine Control Software On Embedded Multi-core Processor Using OSCAR Compiler
    Kanehagi, Yohei
    Umeda, Dan
    Hayashi, Akihiro
    Kimura, Keiji
    Kasahara, Hironori
    2013 IEEE COOL CHIPS XVI (COOL CHIPS), 2013,
  • [40] Design of write merging and read prefetching buffer in DRAM controller for embedded processor
    Zhao, Chen
    Mei, Kuizhi
    Zheng, Nanning
    MICROPROCESSORS AND MICROSYSTEMS, 2014, 38 (05) : 451 - 457