MANY-TASK COMPUTING ON MANY-CORE ARCHITECTURES

被引：0

作者：

Valero-Lara, Pedro ^{[1
,2
]}

Nookala, Poornima ^{[3
]}

Pelayo, Fernando L. ^{[4
]}

Jansson, Johan ^{[2
,5
]}

Dimitropoulos, Serapheim ^{[3
]}

Raicu, Ioan ^{[3
]}

机构：

[1] Univ Manchester, Manchester M13 9PL, Lancs, England

[2] BCAM, Bilbao, Spain

[3] IIT, Chicago, IL 60616 USA

[4] UCLM, Albacete, Spain

[5] KTH Royal Inst Technol, Stockholm, Sweden

来源：

SCALABLE COMPUTING-PRACTICE AND EXPERIENCE | 2016年 / 17卷 / 01期

关键词：

Parallel Computing; Multi-Task Computing; Many-Core; GPU; Intel Xeon Phi; CUDA; OpenMP;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Many-Task Computing (MTC) is a common scenario for multiple parallel systems, such as cluster, grids, cloud and supercomputers, but it is not so popular in shared memory parallel processors. In this sense and given the spectacular growth in performance and in number of cores integrated in many-core architectures, the study of MTC on such architectures is becoming more and more relevant. In this paper, authors present what are those programming mechanisms to take advantages of such massively parallel features for the particular target of MTC. Also, the hardware features of the two dominant many-core platforms (NVIDIA's GPUs and Intel Xeon Phi) are also analyzed for our specific framework. Given the important differences in terms of hardware and software in our two many-core platforms, we have considered different strategies based on CUDA (for GPUs) and OpenMP (for Intel Xeon Phi). We carried out several test cases based on an appropriate and widely studied problem for benchmarking as matrix multiplication. Essentially, this study consisted of comparing the time consumed for computing in parallel several tasks one by one (the whole computational resources are used just to compute one task at a time) with the time consumed for computing in parallel the same set of tasks simultaneously (the whole computational resources are used for computing the set of tasks at very same time). Finally, we compared both software-hardware scenarios to identify the most relevant computer features in each of our many-core architectures.

引用

页码：33 / 46

页数：14

共 50 条

[1] Multi-core versus many-core computing for many-task Branch-and-Bound applied to big optimization problems
Melab, N.
Gmys, J.
Mezmaz, M.
Tuyttens, D.
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 82 : 472 - 481
[2] The Course of "Parallel Computing" in the Many-core Era
Wan Han
Gao Xiaopeng
Li Yi
SOCIAL SCIENCE AND EDUCATION, 2013, 10 : 455 - +
[3] Performance Evaluation of OpenFOAM on Many-Core Architectures
Brzobohaty, Tomas
Riha, Lubomir
Karasek, Tomas
Kozubek, Tomas
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2014 (ICNAAM-2014), 2015, 1648
[4] A Semantic Model for Many-Core Parallel Computing
Zhang, Nan
Duan, Zhenhua
COMBINATORIAL OPTIMIZATION AND APPLICATIONS, 2011, 6831 : 464 - 479
[5] Adapting Particle Filter Algorithms to Many-Core Architectures
Chitchian, Mehdi
van Amesfoort, Alexander S.
Simonetto, Andrea
Keviczky, Tamas
Sips, Henk J.
IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 427 - 438
[6] Cashmere: Heterogeneous Many-Core Computing
Hijma, Pieter
Jacobs, Ceriel J. H.
van Nieuwpoort, Rob V.
Bal, Henri E.
2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 135 - 145
[7] Graph Reachability on Parallel Many-Core Architectures
Quer, Stefano
Calabrese, Andrea
COMPUTATION, 2020, 8 (04) : 1 - 26
[8] A Power Modelling Approach for Many-core Architectures
Lai, Zhiquan
Lam, King Tin
Wang, Cho-Li
Su, Jinshu
2014 10TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2014, : 128 - 132
[9] Fast Convolution Operations on Many-Core Architectures
Li, Shigang
Zhang, Yunquan
Xiang, Chunyang
Shi, Lei
2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 316 - 323
[10] Strategies to parallelize a finite element mesh truncation technique on multi-core and many-core architectures
Jose M. Badia
Adrian Amor-Martin
Jose A. Belloch
Luis Emilio Garcia-Castillo
The Journal of Supercomputing, 2023, 79 : 7648 - 7664

← 1 2 3 4 5 →