Enabling Scientific Computing on Memristive Accelerators

被引:68
作者
Feinberg, Ben [1 ]
Vengalam, Uday Kumar Reddy [1 ]
Whitehair, Nathan [1 ]
Wang, Shibo [2 ]
Ipek, Engin [1 ,2 ]
机构
[1] Univ Rochester, Dept Elect & Comp Engn, 601 Elmwood Ave, Rochester, NY 14627 USA
[2] Univ Rochester, Dept Comp Sci, Rochester, NY 14627 USA
来源
2018 ACM/IEEE 45TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA) | 2018年
关键词
Accelerator Architectures; Resistive RAM;
D O I
10.1109/ISCA.2018.00039
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Linear algebra is ubiquitous across virtually every field of science and engineering, from climate modeling to macroeconomics. This ubiquity makes linear algebra a prime candidate for hardware acceleration, which can improve both the run time and the energy efficiency of a wide range of scientific applications. Recent work on memristive hardware accelerators shows significant potential to speed up matrix-vector multiplication (MVM), a critical linear algebra kernel at the heart of neural network inference tasks. Regrettably, the proposed hardware is constrained to a narrow range of workloads: although the eight- to 16-bit computations afforded by memristive MVM accelerators are acceptable for machine learning, they are insufficient for scientific computing where high-precision floating point is the norm. This paper presents the first proposal to enable scientific computing on memristive crossbars. Three techniques are explored reducing overheads by exploiting exponent range locality, early termination of fixed-point computation, and static operation scheduling that together enable a fixed-point memristive accelerator to perform high-precision floating point without the exorbitant cost of na ve floating-point emulation on fixed-point hardware. A heterogeneous collection of crossbars with varying sizes is proposed to efficiently handle sparse matrices, and an algorithm for mapping the dense subblocks of a sparse matrix to an appropriate set of crossbars is investigated. The accelerator can be combined with existing GPU-based systems to handle datasets that cannot be efficiently handled by the memristive accelerator alone. The proposed optimizations permit the memristive MVM concept to be applied to a wide range of problem domains, respectively improving the execution time and energy dissipation of sparse linear solvers by 10.3x and 10.9x over a purely GPU-based system.
引用
收藏
页码:367 / 382
页数:16
相关论文
共 56 条
  • [41] High Performance Numerical Computing for High Energy Physics: A New Challenge for Big Data Science
    Pop, Florin
    [J]. ADVANCES IN HIGH ENERGY PHYSICS, 2014, 2014
  • [42] SAAD Y, 1986, SIAM J SCI STAT COMP, V7, P856, DOI 10.1137/0907058
  • [43] Saad Y., 2003, Iterative Methods for Sparse Linear Systems
  • [44] Analysis of Power Consumption and Linearity in Capacitive Digital-to-Analog Converters Used in Successive Approximation ADCs
    Saberi, Mehdi
    Lotfi, Reza
    Mafinezhad, Khalil
    Serdijn, Wouter A.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2011, 58 (08) : 1736 - 1748
  • [45] ISAAC: A Convolutional Neural Network Accelerator with In-Situ Analog Arithmetic in Crossbars
    Shafiee, Ali
    Nag, Anirban
    Muralimanohar, Naveen
    Balasubramonian, Rajeev
    Strachan, John Paul
    Hu, Miao
    Williams, R. Stanley
    Srikumar, Vivek
    [J]. 2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, : 14 - 26
  • [46] Song L., 2017, GRAPHR ACCELERATING
  • [47] BUS-INVERT CODING FOR LOW-POWER I/O
    STAN, MR
    BURLESON, WP
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 1995, 3 (01) : 49 - 58
  • [48] BI-CGSTAB - A FAST AND SMOOTHLY CONVERGING VARIANT OF BI-CG FOR THE SOLUTION OF NONSYMMETRIC LINEAR-SYSTEMS
    VANDERVORST, HA
    [J]. SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1992, 13 (02): : 631 - 644
  • [49] Properties of galaxies reproduced by a hydrodynamic simulation
    Vogelsberger, M.
    Genel, S.
    Springel, V.
    Torrey, P.
    Sijacki, D.
    Xu, D.
    Snyder, G.
    Bird, S.
    Nelson, D.
    Hernquist, L.
    [J]. NATURE, 2014, 509 (7499) : 177 - +
  • [50] Vuduc R.W., 2003, THESIS UC BERKELEY