Parallelization of Selected Algorithms on Multi-core CPUs, a Cluster and in a Hybrid CPU plus Xeon Phi Environment

被引:2
作者
Krzywaniak, Adam [1 ]
Czarnul, Pawel [1 ]
机构
[1] Gdansk Univ Technol, Fac Elect Telecommun & Informat, Gdansk, Poland
来源
INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, PT I | 2018年 / 655卷
关键词
Parallel programming; Multi-core CPU; Cluster; Intel Xeon Phi; Parallelization;
D O I
10.1007/978-3-319-67220-5_27
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the paper we present parallel implementations as well as execution times and speed-ups of three different algorithms run in various environments such as on a workstation with multi-core CPUs and a cluster. The parallel codes, implementing the master-slave model in C+MPI, differ in computation to communication ratios. The considered problems include: a genetic algorithm with various ratios of master processing time to communication and fitness evaluation times, matrix multiplication and numerical integration. We present how the codes scale in the aforementioned systems. For the numerical integration code that scales very well we also show performance in a hybrid CPU+Xeon Phi environment.
引用
收藏
页码:292 / 301
页数:10
相关论文
共 13 条
[1]  
Barlas G., 2014, Multicore and GPU Programming: An Integrated Approach
[2]   Parallelization of a genetic algorithm for image restoration and its performance analysis [J].
Chen, YW ;
Nakao, ZS ;
Fang, X .
1996 IEEE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION (ICEC '96), PROCEEDINGS OF, 1996, :463-468
[3]  
Czarnul P, 2011, SCALABLE COMPUT-PRAC, V12, P227
[4]   MERPSYS: An environment for simulation of parallel application execution on large scale HPC systems [J].
Czarnul, Pawel ;
Kuchta, Jaroslaw ;
Matuszek, Mariusz ;
Proficz, Jerzy ;
Rosciszewski, Pawel ;
Wojcik, Michal ;
Szymanski, Julian .
SIMULATION MODELLING PRACTICE AND THEORY, 2017, 77 :124-140
[5]   Benchmarking Performance of a Hybrid Intel Xeon/Xeon Phi System for Parallel Computation of Similarity Measures Between Large Vectors [J].
Czarnul, Pawel .
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 45 (05) :1091-1107
[6]   Modeling energy consumption of parallel applications [J].
Czarnul, Pawel ;
Kuchta, Jaroslaw ;
Rosciszewski, Pawel ;
Proficz, Jerzy .
PROCEEDINGS OF THE 2016 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2016, 8 :855-864
[7]   A Beowulf Cluster for Teaching and Learning [J].
Datti, Ahmad A. ;
Umar, Hadiza A. ;
Galadanci, Jamil .
PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON ECO-FRIENDLY COMPUTING AND COMMUNICATION SYSTEMS, 2015, 70 :62-68
[8]   Analysis of a master-slave architecture for distributed evolutionary computations [J].
Dubreuil, M ;
Gagné, C ;
Parizeau, M .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2006, 36 (01) :229-235
[9]   The Power-Performance Tradeoffs of the Intel Xeon Phi on HPC Applications [J].
Li, Bo ;
Chang, Hung-Ching ;
Song, Shuaiwen Leon ;
Su, Chun-Yi ;
Meyer, Timmy ;
Mooring, John ;
Cameron, Kirk .
PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, :1449-1457
[10]   A Hybrid Double-Layer Master-Slave Model For Multicore-Node Clusters [J].
Liu, Gang ;
Schmider, Hartmut ;
Edgecombe, Kenneth E. .
HIGH PERFORMANCE COMPUTING SYMPOSIUM 2012 (HPCS2012), 2012, 385