Evaluation of parallel particle swarm optimization algorithms within the CUDA™ architecture

被引:103
作者
Mussi, Luca [1 ]
Daolio, Fabio [2 ]
Cagnoni, Stefano [1 ]
机构
[1] Univ Parma, Dept Informat Engn, I-43124 Parma, Italy
[2] Univ Lausanne, HEC Informat Syst Inst, CH-1015 Lausanne, Switzerland
关键词
Particle swarm optimization; Parallel computing; GPUs; nVIDIA CUDA (TM);
D O I
10.1016/j.ins.2010.08.045
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Particle swarm optimization (PSO), like other population-based meta-heuristics, is intrinsically parallel and can be effectively implemented on Graphics Processing Units (GPUs), which are, in fact, massively parallel processing architectures. In this paper we discuss possible approaches to parallelizing PSO on graphics hardware within the Compute Unified Device Architecture (CUDA (TM)), a GPU programming environment by nVIDIA (TM) which supports the company's latest cards. In particular, two different ways of exploiting GPU parallelism are explored and evaluated. The execution speed of the two parallel algorithms is compared, on functions which are typically used as benchmarks for PSO, with a standard sequential implementation of PSO (SPSO), as well as with recently published results of other parallel implementations. An in-depth study of the computation efficiency of our parallel algorithms is carried out by assessing speed-up and scale-up with respect to SPSO. Also reported are some results about the optimization effectiveness of the parallel implementations with respect to SPSO, in cases when the parallel versions introduce some possibly significant difference with respect to the sequential version. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:4642 / 4657
页数:16
相关论文
共 35 条
[1]   Parallelism and evolutionary algorithms [J].
Alba, E ;
Tomassini, M .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (05) :443-462
[2]  
[Anonymous], NVIDIA CUDA C PROGR
[3]  
[Anonymous], 2009, Research Report RR-6829
[4]  
Bratton D., P IEEE SWARM INT S, P120
[5]  
Chang JF, 2005, J INF SCI ENG, V21, P809
[6]  
Diosan L., 2008, J ARTIFICIAL EVOLUTI, V1, P1
[7]  
Diosan L, 2006, LNCS, V3906
[8]  
Eiben A. E., 2015, Natural computing series
[9]  
Gies D., 2003, IEEE Antennas and Propagation Society International Symposium. Digest. Held in conjunction with: USNC/CNC/URSI North American Radio Sci. Meeting (Cat. No.03CH37450), P177
[10]  
Hansen Nikolaus, 2009, RR6829 INRIA