MANY-TASK COMPUTING ON MANY-CORE ARCHITECTURES

被引:0
作者
Valero-Lara, Pedro [1 ,2 ]
Nookala, Poornima [3 ]
Pelayo, Fernando L. [4 ]
Jansson, Johan [2 ,5 ]
Dimitropoulos, Serapheim [3 ]
Raicu, Ioan [3 ]
机构
[1] Univ Manchester, Manchester M13 9PL, Lancs, England
[2] BCAM, Bilbao, Spain
[3] IIT, Chicago, IL 60616 USA
[4] UCLM, Albacete, Spain
[5] KTH Royal Inst Technol, Stockholm, Sweden
来源
SCALABLE COMPUTING-PRACTICE AND EXPERIENCE | 2016年 / 17卷 / 01期
关键词
Parallel Computing; Multi-Task Computing; Many-Core; GPU; Intel Xeon Phi; CUDA; OpenMP;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many-Task Computing (MTC) is a common scenario for multiple parallel systems, such as cluster, grids, cloud and supercomputers, but it is not so popular in shared memory parallel processors. In this sense and given the spectacular growth in performance and in number of cores integrated in many-core architectures, the study of MTC on such architectures is becoming more and more relevant. In this paper, authors present what are those programming mechanisms to take advantages of such massively parallel features for the particular target of MTC. Also, the hardware features of the two dominant many-core platforms (NVIDIA's GPUs and Intel Xeon Phi) are also analyzed for our specific framework. Given the important differences in terms of hardware and software in our two many-core platforms, we have considered different strategies based on CUDA (for GPUs) and OpenMP (for Intel Xeon Phi). We carried out several test cases based on an appropriate and widely studied problem for benchmarking as matrix multiplication. Essentially, this study consisted of comparing the time consumed for computing in parallel several tasks one by one (the whole computational resources are used just to compute one task at a time) with the time consumed for computing in parallel the same set of tasks simultaneously (the whole computational resources are used for computing the set of tasks at very same time). Finally, we compared both software-hardware scenarios to identify the most relevant computer features in each of our many-core architectures.
引用
收藏
页码:33 / 46
页数:14
相关论文
共 50 条
  • [21] Optimizing Streaming Parallelism on Heterogeneous Many-Core Architectures
    Zhang, Peng
    Fang, Jianbin
    Yang, Canqun
    Huang, Chun
    Tang, Tao
    Wang, Zheng
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (08) : 1878 - 1896
  • [22] Hybrid Coarrays: a PGAS Feature for Many-Core Architectures
    Cardellini, Valeria
    Fanfarillo, Alessandro
    Filippone, Salvatore
    Rouson, Damian
    PARALLEL COMPUTING: ON THE ROAD TO EXASCALE, 2016, 27 : 175 - 184
  • [23] Parallel Implementations of the Cooperative Particle Swarm Optimization on Many-core and Multi-core Architectures
    Nadia Nedjah
    Rogério de M. Calazan
    Luiza de Macedo Mourelle
    Chao Wang
    International Journal of Parallel Programming, 2016, 44 : 1173 - 1199
  • [24] Architectural Support for Cilk Computations on Many-core Architectures
    Long, Guoping
    Fan, Dongrui
    Zhang, Junchao
    ACM SIGPLAN NOTICES, 2009, 44 (04) : 285 - 286
  • [25] Revision of Relational Joins for Multi-Core and Many-Core Architectures
    Krulis, Martin
    Yaghob, Jakub
    DATESO 2011: DATABASES, TEXTS, SPECIFICATIONS, OBJECTS, 2011, 706 : 229 - 240
  • [26] Fault Tolerant Task Mapping on Many-Core Arrays
    Bonney, Colin
    Campos, Pedro
    Dahir, Nizar
    Tempesti, Gianluca
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [27] Designing Applications for Heterogeneous Many-Core Architectures with the FlexTiles Platform
    Janssen, Benedikt
    Schwiegelshohn, Fynn
    Koedam, Martijn
    Duhem, Francois
    Masing, Leonard
    Werner, Stephan
    Huriaux, Christophe
    Courtay, Antoine
    Wheatley, Emilie
    Goossens, Kees
    Lemonnier, Fabrice
    Millet, Philippe
    Becker, Juergen
    Sentieys, Olivier
    Huebner, Michael
    PROCEEDINGS INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS - ARCHITECTURES, MODELING AND SIMULATION (SAMOS XV), 2015, : 254 - 261
  • [28] A Simple Compressive Sensing Algorithm for Parallel Many-Core Architectures
    Borghi, Alexandre
    Darbon, Jerome
    Peyronnet, Sylvain
    Chan, Tony F.
    Osher, Stanley
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2013, 71 (01): : 1 - 20
  • [29] Optimization of Selected Remote Sensing Algorithms for Many-Core Architectures
    Riha, Lubomir
    Le Moigne, Jacqueline
    El-Ghazawi, Tarek
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2016, 9 (12) : 5576 - 5587
  • [30] Efficient DVFS to Prevent Hard Faults for Many-Core Architectures
    Lai, Zhiquan
    Zhao, Baokang
    Su, Jinshu
    INFORMATION AND COMMUNICATION TECHNOLOGY, 2014, 8407 : 674 - 679