High-throughput Ant Colony Optimization on graphics processing units

被引:25
作者
Cecilia, Jose M. [1 ]
Llanes, Antonio [1 ]
Abellan, Jose L. [1 ]
Gomez-Luna, Juan [2 ]
Chang, Li-Wen [3 ]
Hwu, Wen-Mei W. [4 ]
机构
[1] Catholic Univ Murcia UCAM, Murcia, Spain
[2] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
[3] Microsoft AI & Res Grp, Redmond, WA USA
[4] Univ Illinois, Champaign, IL USA
关键词
Agnostic vectorization; ACO; TSP; GPUs; Atomic operations; MODEL;
D O I
10.1016/j.jpdc.2017.12.002
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nowadays, computer researchers can face ever-complex scientific problems by using a hardware and software co-design. One successful approach is exploring novel massively-parallel Natural-inspired algorithms, such as the Ant Colony Optimization (ACO) algorithm, through the exploitation of high-throughput accelerators such as GPUs, which are designed to provide high levels of parallelism and low Energy per instruction (EP) cost through heavy vectorization. In this paper, we demonstrate how to take advantage of contemporary hardware-based CUDA vectorization to optimize the ACO algorithm when applied to the Traveling Salesman Problem (TSP). Several parallel designs are proposed and analyzed on two different CUDA architectures. Our results reveal that our vectorization approaches can obtain good performance on these architectures. Moreover, atomic operations are under study showing good benefits on latest generations of CUDA architectures. This work lays the groundwork for future developments of ACO algorithm on high-performance platforms. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:261 / 274
页数:14
相关论文
共 53 条
[1]  
[Anonymous], GEN EV COMP C
[2]  
[Anonymous], NVIDIA NEXT GENERATI
[3]  
[Anonymous], 2010, LOGISTICS TECHNOLOGY
[4]  
[Anonymous], THESIS
[5]  
[Anonymous], 2004, ANT COLONY OPTIMIZAT
[6]  
[Anonymous], 2006, IEEE Comput. Intell. Mag., DOI [10.1109/MCI.2006.329691, DOI 10.1109/MCI.2006.329691]
[7]  
[Anonymous], P 11 ANN C GEN EV CO
[8]  
[Anonymous], COMP APPL SYST MOD I
[9]  
[Anonymous], CUDA TOOLKIT DOCUMEN
[10]  
[Anonymous], P GEN EV COMP