Parallel ant colony optimization on multi-core SIMD CPUs

被引:98
作者
Zhou, Yi [1 ,2 ]
He, Fazhi [1 ,3 ]
Hou, Neng [1 ]
Qiu, Yimin [2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, State Key Lab Software Engn, Wuhan 430072, Hubei, Peoples R China
[2] Wuhan Univ Sci & Technol, Sch Informat Sci & Engn, Wuhan 430081, Hubei, Peoples R China
[3] Huazhong Univ Sci & Technol, State Key Lab Digital Mfg Equipment & Technol, Wuhan 430074, Hubei, Peoples R China
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2018年 / 79卷
基金
美国国家科学基金会;
关键词
Ant colony optimization; SIMD; Multi-core CPU; TSP; Performance evaluation; ALGORITHMS; TRACKING; OPENCL;
D O I
10.1016/j.future.2017.09.073
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Ant colony optimization (ACO) is a population-based metaheuristic for solving hard combinatorial optimization problems. Many studies are dedicated to accelerating ACO by parallel hardware, especially by graphics processing units (GPUs). However, due to the irregular (random) pattern of ACO algorithms in data access and control flow, the performance of GPU-based approaches is constrained by hardware limitations. CPU-based SIMD computing for ACOs is rarely investigated in previous literatures, and how well multicore-SIMD CPU-based parallel ACOs could perform remains unknown. In this paper, we present and evaluate a model of vector parallel ACO for multi-core SIMD CPU architecture. In the proposed model, each ant is mapped with a CPU core and the tour construction of each ant is accelerated by vector instructions. Furthermore, based on the model, we propose a new fitness proportionate selection approach named Vector-based Roulette Wheel (VRW) in the tour construction stage. In this approach, the fitness values are grouped into SIMD lanes and the prefix sum is computed in vector-parallel mode. The proposed algorithm is tested on standard TSP instances ranging from 198 to 4461 cities and shows a speedup factor of 57.8x compared to the single-threaded CPU counterpart. More significantly, we compare our approach with high performance GPU-based ACOs, and the results demonstrate the strong potential of CPU-based parallel ACOs. (C) 2017 The Authors. Published by Elsevier B.V.
引用
收藏
页码:473 / 487
页数:15
相关论文
共 62 条
  • [1] Fine-grain parallelism using multi-core, Cell/BE, and GPU Systems
    不详
    [J]. PARALLEL COMPUTING, 2012, 38 (08) : 365 - 390
  • [2] [Anonymous], 2004, ANT COLONY OPTIMIZAT
  • [3] [Anonymous], 2012, COMPUTER ARCHITECTUR
  • [4] [Anonymous], 2014, Intel Architecture Instruction Set Extensions Programming Reference (319433-022)
  • [5] [Anonymous], P GEN EV COMP
  • [6] [Anonymous], 1999, Swarm Intelligence
  • [7] [Anonymous], 2011, NVIDIA CUDA C PROGR
  • [8] Metaheuristics in combinatorial optimization: Overview and conceptual comparison
    Blum, C
    Roli, A
    [J]. ACM COMPUTING SURVEYS, 2003, 35 (03) : 268 - 308
  • [9] BULLNHEIMER B, 1998, APPL OPTIMIZATION, V24, P87
  • [10] Strategies for accelerating Ant Colony Optimization algorithms on Graphical Processing Units
    Catala, Alejandro
    Jaen, Javier
    Mocholi, Jose A.
    [J]. 2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, : 492 - +