Simulating cortical networks on heterogeneous multi-GPU systems

被引：7

作者：

Nere, Andrew ^{[1
]}

Franey, Sean ^{[1
]}

Hashmi, Atif ^{[1
]}

Lipasti, Mikko ^{[1
]}

机构：

[1] Univ Wisconsin, Dept Elect & Comp Engn, Madison, WI 53706 USA

来源：

JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING | 2013年 / 73卷 / 07期

基金：

美国国家科学基金会;

关键词：

Cortical learning algorithms; CUDA; GPGPU; Profiling systems; RECEPTIVE-FIELDS; FUNCTIONAL ARCHITECTURE; MODEL;

D O I：

10.1016/j.jpdc.2012.02.006

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Recent advances in neuroscientific understanding have highlighted the highly parallel computation power of the mammalian neocortex. In this paper we describe a GPGPU-accelerated implementation of an intelligent learning model inspired by the structural and functional properties of the neocortex. Furthermore, we consider two inefficiencies inherent to our initial implementation and propose software optimizations to mitigate such problems. Analysis of our application's behavior and performance provides important insights into the GPGPU architecture, including the number of cores, the memory system, atomic operations, and the global thread scheduler. Additionally, we create a runtime profiling tool for the cortical network that proportionally distributes work across the host CPU as well as multiple GPGPUs available to the system. Using the profiling tool with these optimizations on Nvidia's CUDA framework, we achieve up to 60 x speedup over a single-threaded CPU implementation of the model. (c) 2012 Elsevier Inc. All rights reserved.

引用

页码：953 / 971

页数：19

共 50 条

[41] Introducing and Implementing the Allpairs Skeleton for Programming Multi-GPU Systems
Steuwer, Michel
Friese, Malte
Albers, Sebastian
Gorlatch, Sergei
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2014, 42 (04) : 601 - 618
[42] An implementation of the Social Distances Model using multi-GPU systems
Klusek, Adrian
Topa, Pawel
Was, Jaroslaw
Lubas, Robert
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2018, 32 (04): : 482 - 495
[43] High performance MRI simulations of motion on multi-GPU systems
Christos G Xanthis
Ioannis E Venetis
Anthony H Aletras
Journal of Cardiovascular Magnetic Resonance, 16
[44] Towards an optimized distributed deep learning framework for a heterogeneous multi-GPU cluster
Youngrang Kim
Hyeonseong Choi
Jaehwan Lee
Jik-Soo Kim
Hyunseung Jei
Hongchan Roh
Cluster Computing, 2020, 23 : 2287 - 2300
[45] Towards an optimized distributed deep learning framework for a heterogeneous multi-GPU cluster
Kim, Youngrang
Choi, Hyeonseong
Lee, Jaehwan
Kim, Jik-Soo
Jei, Hyunseung
Roh, Hongchan
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (03): : 2287 - 2300
[46] A PCISPH implementation using distributed multi-GPU acceleration for simulating industrial engineering applications
Verma, Kevin
McCabe, Christopher
Peng, Chong
Wille, Robert
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2020, 34 (04): : 450 - 464
[47] GPU-Chariot: A Programming Framework for Stream Applications Running on Multi-GPU Systems
Ino, Fumihiko
Nakagawa, Shinta
Hagihara, Kenichi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (12): : 2604 - 2616
[48] SkePU: A Multi-Backend Skeleton Programming Library for Multi-GPU Systems
Enmyren, Johan
Kessler, Christoph W.
HLPP 2010: PROCEEDINGS OF THE FOURTH INTERNATIONAL WORKSHOP ON HIGH-LEVEL PARALLEL PROGRAMMING AND APPLICATIONS, 2010, : 5 - 14
[49] Multi-GPU Development of a Neural Networks Based Reconstructor for Adaptive Optics
Gonzalez-Gutierrez, Carlos
Luisa Sanchez-Rodriguez, Maria
Luis Calvo-Rolle, Jose
de Cos Juez, Francisco Javier
COMPLEXITY, 2018,
[50] Scalable multi-node multi-GPU Louvain community detection algorithm for heterogeneous architectures
Bhowmick, Anwesha
Vadhiyar, Sathish
Varun, P. V.
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (17):

← 1 2 3 4 5 →