Simulating cortical networks on heterogeneous multi-GPU systems

被引:7
|
作者
Nere, Andrew [1 ]
Franey, Sean [1 ]
Hashmi, Atif [1 ]
Lipasti, Mikko [1 ]
机构
[1] Univ Wisconsin, Dept Elect & Comp Engn, Madison, WI 53706 USA
基金
美国国家科学基金会;
关键词
Cortical learning algorithms; CUDA; GPGPU; Profiling systems; RECEPTIVE-FIELDS; FUNCTIONAL ARCHITECTURE; MODEL;
D O I
10.1016/j.jpdc.2012.02.006
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recent advances in neuroscientific understanding have highlighted the highly parallel computation power of the mammalian neocortex. In this paper we describe a GPGPU-accelerated implementation of an intelligent learning model inspired by the structural and functional properties of the neocortex. Furthermore, we consider two inefficiencies inherent to our initial implementation and propose software optimizations to mitigate such problems. Analysis of our application's behavior and performance provides important insights into the GPGPU architecture, including the number of cores, the memory system, atomic operations, and the global thread scheduler. Additionally, we create a runtime profiling tool for the cortical network that proportionally distributes work across the host CPU as well as multiple GPGPUs available to the system. Using the profiling tool with these optimizations on Nvidia's CUDA framework, we achieve up to 60 x speedup over a single-threaded CPU implementation of the model. (c) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:953 / 971
页数:19
相关论文
共 50 条
  • [41] Introducing and Implementing the Allpairs Skeleton for Programming Multi-GPU Systems
    Steuwer, Michel
    Friese, Malte
    Albers, Sebastian
    Gorlatch, Sergei
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2014, 42 (04) : 601 - 618
  • [42] An implementation of the Social Distances Model using multi-GPU systems
    Klusek, Adrian
    Topa, Pawel
    Was, Jaroslaw
    Lubas, Robert
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2018, 32 (04): : 482 - 495
  • [43] High performance MRI simulations of motion on multi-GPU systems
    Christos G Xanthis
    Ioannis E Venetis
    Anthony H Aletras
    Journal of Cardiovascular Magnetic Resonance, 16
  • [44] Towards an optimized distributed deep learning framework for a heterogeneous multi-GPU cluster
    Youngrang Kim
    Hyeonseong Choi
    Jaehwan Lee
    Jik-Soo Kim
    Hyunseung Jei
    Hongchan Roh
    Cluster Computing, 2020, 23 : 2287 - 2300
  • [45] Towards an optimized distributed deep learning framework for a heterogeneous multi-GPU cluster
    Kim, Youngrang
    Choi, Hyeonseong
    Lee, Jaehwan
    Kim, Jik-Soo
    Jei, Hyunseung
    Roh, Hongchan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (03): : 2287 - 2300
  • [46] A PCISPH implementation using distributed multi-GPU acceleration for simulating industrial engineering applications
    Verma, Kevin
    McCabe, Christopher
    Peng, Chong
    Wille, Robert
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2020, 34 (04): : 450 - 464
  • [47] GPU-Chariot: A Programming Framework for Stream Applications Running on Multi-GPU Systems
    Ino, Fumihiko
    Nakagawa, Shinta
    Hagihara, Kenichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (12): : 2604 - 2616
  • [48] SkePU: A Multi-Backend Skeleton Programming Library for Multi-GPU Systems
    Enmyren, Johan
    Kessler, Christoph W.
    HLPP 2010: PROCEEDINGS OF THE FOURTH INTERNATIONAL WORKSHOP ON HIGH-LEVEL PARALLEL PROGRAMMING AND APPLICATIONS, 2010, : 5 - 14
  • [49] Multi-GPU Development of a Neural Networks Based Reconstructor for Adaptive Optics
    Gonzalez-Gutierrez, Carlos
    Luisa Sanchez-Rodriguez, Maria
    Luis Calvo-Rolle, Jose
    de Cos Juez, Francisco Javier
    COMPLEXITY, 2018,
  • [50] Scalable multi-node multi-GPU Louvain community detection algorithm for heterogeneous architectures
    Bhowmick, Anwesha
    Vadhiyar, Sathish
    Varun, P. V.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (17):