MANY-TASK COMPUTING ON MANY-CORE ARCHITECTURES

被引:0
|
作者
Valero-Lara, Pedro [1 ,2 ]
Nookala, Poornima [3 ]
Pelayo, Fernando L. [4 ]
Jansson, Johan [2 ,5 ]
Dimitropoulos, Serapheim [3 ]
Raicu, Ioan [3 ]
机构
[1] Univ Manchester, Manchester M13 9PL, Lancs, England
[2] BCAM, Bilbao, Spain
[3] IIT, Chicago, IL 60616 USA
[4] UCLM, Albacete, Spain
[5] KTH Royal Inst Technol, Stockholm, Sweden
来源
SCALABLE COMPUTING-PRACTICE AND EXPERIENCE | 2016年 / 17卷 / 01期
关键词
Parallel Computing; Multi-Task Computing; Many-Core; GPU; Intel Xeon Phi; CUDA; OpenMP;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many-Task Computing (MTC) is a common scenario for multiple parallel systems, such as cluster, grids, cloud and supercomputers, but it is not so popular in shared memory parallel processors. In this sense and given the spectacular growth in performance and in number of cores integrated in many-core architectures, the study of MTC on such architectures is becoming more and more relevant. In this paper, authors present what are those programming mechanisms to take advantages of such massively parallel features for the particular target of MTC. Also, the hardware features of the two dominant many-core platforms (NVIDIA's GPUs and Intel Xeon Phi) are also analyzed for our specific framework. Given the important differences in terms of hardware and software in our two many-core platforms, we have considered different strategies based on CUDA (for GPUs) and OpenMP (for Intel Xeon Phi). We carried out several test cases based on an appropriate and widely studied problem for benchmarking as matrix multiplication. Essentially, this study consisted of comparing the time consumed for computing in parallel several tasks one by one (the whole computational resources are used just to compute one task at a time) with the time consumed for computing in parallel the same set of tasks simultaneously (the whole computational resources are used for computing the set of tasks at very same time). Finally, we compared both software-hardware scenarios to identify the most relevant computer features in each of our many-core architectures.
引用
收藏
页码:33 / 46
页数:14
相关论文
共 50 条
  • [1] Multi-core versus many-core computing for many-task Branch-and-Bound applied to big optimization problems
    Melab, N.
    Gmys, J.
    Mezmaz, M.
    Tuyttens, D.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 82 : 472 - 481
  • [2] The Course of "Parallel Computing" in the Many-core Era
    Wan Han
    Gao Xiaopeng
    Li Yi
    SOCIAL SCIENCE AND EDUCATION, 2013, 10 : 455 - +
  • [3] Performance Evaluation of OpenFOAM on Many-Core Architectures
    Brzobohaty, Tomas
    Riha, Lubomir
    Karasek, Tomas
    Kozubek, Tomas
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2014 (ICNAAM-2014), 2015, 1648
  • [4] A Semantic Model for Many-Core Parallel Computing
    Zhang, Nan
    Duan, Zhenhua
    COMBINATORIAL OPTIMIZATION AND APPLICATIONS, 2011, 6831 : 464 - 479
  • [5] Adapting Particle Filter Algorithms to Many-Core Architectures
    Chitchian, Mehdi
    van Amesfoort, Alexander S.
    Simonetto, Andrea
    Keviczky, Tamas
    Sips, Henk J.
    IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 427 - 438
  • [6] Cashmere: Heterogeneous Many-Core Computing
    Hijma, Pieter
    Jacobs, Ceriel J. H.
    van Nieuwpoort, Rob V.
    Bal, Henri E.
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 135 - 145
  • [7] Graph Reachability on Parallel Many-Core Architectures
    Quer, Stefano
    Calabrese, Andrea
    COMPUTATION, 2020, 8 (04) : 1 - 26
  • [8] A Power Modelling Approach for Many-core Architectures
    Lai, Zhiquan
    Lam, King Tin
    Wang, Cho-Li
    Su, Jinshu
    2014 10TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2014, : 128 - 132
  • [9] Fast Convolution Operations on Many-Core Architectures
    Li, Shigang
    Zhang, Yunquan
    Xiang, Chunyang
    Shi, Lei
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 316 - 323
  • [10] Strategies to parallelize a finite element mesh truncation technique on multi-core and many-core architectures
    Jose M. Badia
    Adrian Amor-Martin
    Jose A. Belloch
    Luis Emilio Garcia-Castillo
    The Journal of Supercomputing, 2023, 79 : 7648 - 7664