TransPimLib: Efficient Transcendental Functions for Processing-in-Memory Systems

被引:5
作者
Item, Maurus [1 ]
Gomez-Luna, Juan [1 ]
Guo, Yuxin [1 ]
Oliveira, Geraldo F. [1 ]
Sadrosadati, Mohammad [1 ]
Mutlu, Onur [1 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
来源
2023 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, ISPASS | 2023年
关键词
processing-in-memory; processing-near-memory; transcendental functions; activation functions; machine learning;
D O I
10.1109/ISPASS57527.2023.00031
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Processing-in-memory (PIM) promises to alleviate the data movement bottleneck in modern computing systems. However, current real-world PIM systems have the inherent disadvantage that their hardware is more constrained than in conventional processors (CPU, GPU), due to the difficulty and cost of building processing elements near or inside the memory. As a result, general-purpose PIM architectures support fairly limited instruction sets and struggle to execute complex operations such as transcendental functions and other hard-to-calculate operations (e.g., square root). These operations are particularly important for some modern workloads, e.g., activation functions in machine learning applications. In order to provide support for transcendental (and other hardto-calculate) functions in general-purpose PIM systems, we present TransPimLib, a library that provides CORDIC-based and LUT-based methods for trigonometric functions, hyperbolic functions, exponentiation, logarithm, square root, etc. We develop an implementation of TransPimLib for the UPMEM PIM architecture and perform a thorough evaluation of TransPimLib's methods in terms of performance and accuracy, using microbenchmarks and three full workloads (Blackscholes, Sigmoid, Softmax). We open-source all our code and datasets at https://github.com/CMU-SAFARI/transpimlib.
引用
收藏
页码:235 / 247
页数:13
相关论文
共 147 条
  • [41] SynCron: Efficient Synchronization Support for Near-Data-Processing Architectures
    Giannoula, Christina
    Vijaykumar, Nandita
    Papadopoulou, Nikela
    Karakostas, Vasileios
    Fernandez, Ivan
    Gomez-Luna, Juan
    Orosa, Lois
    Koziris, Nectarios
    Goumas, Georgios
    Mutlu, Onur
    [J]. 2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021), 2021, : 263 - 276
  • [42] Glassner A. S., 1989, An Introduction to Ray Tracing
  • [43] WHAT EVERY COMPUTER SCIENTIST SHOULD KNOW ABOUT FLOATING-POINT ARITHMETIC
    GOLDBERG, D
    [J]. COMPUTING SURVEYS, 1991, 23 (01) : 5 - 48
  • [44] Gomez-Luna J., 2021, 2021 12 INT GREEN SU, P1
  • [45] Gomez-Luna J., 2022, IEEE Access
  • [46] Guo Q., 2014, P WORKSH NEAR DAT PR
  • [47] CAIRO: A Compiler-Assisted Technique for Enabling Instruction-Level Offloading of Processing-In-Memory
    Hadidi, Ramyad
    Nai, Lifeng
    Kim, Hyojong
    Kim, Hyesoon
    [J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2017, 14 (04)
  • [48] SIMDRAM: A Framework for Bit-Serial SIMD Processing using DRAM
    Hajinazar, Nastaran
    Oliveira, Geraldo F.
    Gregorio, Sven
    Ferreira, Joao Dinis
    Ghiasi, Nika Mansouri
    Patel, Minesh
    Alser, Mohammed
    Ghose, Saugata
    Gomez-Luna, Juan
    Mutlu, Onur
    [J]. ASPLOS XXVI: TWENTY-SIXTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, 2021, : 329 - 345
  • [49] Hajj Izzat El, 2021, arXiv
  • [50] Hamdioui S, 2017, DES AUT TEST EUROPE, P722, DOI 10.23919/DATE.2017.7927083