Accelerating Sparse Matrix Operations in Neural Networks on Graphics Processing Units

被引:0
|
作者
Argueta, Arturo [1 ]
Chiang, David [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
来源
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019) | 2019年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graphics Processing Units (GPUs) are commonly used to train and evaluate neural networks efficiently. While previous work in deep learning has focused on accelerating operations on dense matrices/tensors on GPUs, efforts have concentrated on operations involving sparse data structures. Operations using sparse structures are common in natural language models at the input and output layers, because these models operate on sequences over discrete alphabets. We present two new GPU algorithms: one at the input layer, for multiplying a matrix by a few-hot vector (generalizing the more common operation of multiplication by a one-hot vector) and one at the output layer, for a fused softmax and top-N selection (commonly used in beam search). Our methods achieve speedups over state-of-the-art parallel GPU baselines of up to 7x and 50x, respectively. We also illustrate how our methods scale on different GPU architectures.
引用
收藏
页码:6215 / 6224
页数:10
相关论文
共 50 条
  • [1] Accelerating Sparse Linear Algebra Using Graphics Processing Units
    Spagnoli, Kyle E.
    Humphrey, John R.
    Price, Daniel K.
    Kelmelis, Eric J.
    MODELING AND SIMULATION FOR DEFENSE SYSTEMS AND APPLICATIONS VI, 2011, 8060
  • [2] Accelerating FCM neural network classifier using graphics processing units with CUDA
    Wang, Lin
    Yang, Bo
    Chen, Yuehui
    Chen, Zhenxiang
    Sun, Hongwei
    APPLIED INTELLIGENCE, 2014, 40 (01) : 143 - 153
  • [3] Accelerating FCM neural network classifier using graphics processing units with CUDA
    Lin Wang
    Bo Yang
    Yuehui Chen
    Zhenxiang Chen
    Hongwei Sun
    Applied Intelligence, 2014, 40 : 143 - 153
  • [4] Accelerating NTRU Encryption with Graphics Processing Units
    Bai, Tianyu
    Davis, Spencer
    Li, Juanjuan
    Gu, Ying
    Jiang, Hai
    INTERNATIONAL JOURNAL OF NETWORKED AND DISTRIBUTED COMPUTING, 2014, 2 (04) : 250 - 258
  • [5] Accelerating parameter inference with graphics processing units
    Wysocki, D.
    O'Shaughnessy, R.
    Lange, Jacob
    Fang, Yao-Lung L.
    PHYSICAL REVIEW D, 2019, 99 (08)
  • [6] Accelerating Viterbi algorithm on graphics processing units
    Muhammad Kashif Hanif
    Karl-Heinz Zimmermann
    Computing, 2017, 99 : 1105 - 1123
  • [7] Accelerating Viterbi algorithm on graphics processing units
    Hanif, Muhammad Kashif
    Zimmermann, Karl-Heinz
    COMPUTING, 2017, 99 (11) : 1105 - 1123
  • [8] Sparse Spiking Neural-Like Membrane Systems on Graphics Processing Units
    Hernandez-Tello, Javier
    Martinez-del-Amor, Miguel A.
    Orellana-Martin, David
    Cabarle, Francis George C.
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2024, 34 (07)
  • [9] Accelerating Physical Simulations Using Graphics Processing Units
    Hoffmann, Karl Heinz
    Hofmann, Michael
    Lang, Jens
    Rnger, Gudula
    Seeger, Steffen
    IT-INFORMATION TECHNOLOGY, 2011, 53 (02): : 49 - 59
  • [10] Accelerating Gate Sizing Using Graphics Processing Units
    Shi, Bing
    Zhang, Yufu
    Srivastava, Ankur
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2012, 31 (01) : 160 - 164