Efficient GPU Implementation of Multiple-Precision Addition based on Residue Arithmetic

被引:0
|
作者
Isupov, Konstantin [1 ]
Knyazkov, Vladimir [2 ]
机构
[1] Vyatka State Univ, Dept Elect Comp Machines, Kirov 610000, Russia
[2] Penza State Univ, Res Inst Fundamental & Appl Studies, Penza 440026, Russia
关键词
Multiple-precision algorithm; integer arithmetic; residue number system; GPU; CUDA;
D O I
10.14569/IJACSA.2020.0110901
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this work, the residue number system (RNS) is applied for efficient addition of multiple-precision integers using graphics processing units (GPUs) that support the Compute Unified Device Architecture (CUDA) platform. The RNS allows calculations with the digits of a multiple-precision number to be performed in an element-wise fashion, without the overhead of communication between them, which is especially useful for massively parallel architectures such as the GPU architecture. The paper discusses two multiple-precision integer algorithms. The first algorithm relies on if-else statements to test the signs of the operands. In turn, the second algorithm uses radix complement RNS arithmetic to handle negative numbers. While the first algorithm is more straightforward, the second one avoids branch divergence among threads that concurrently compute different elements of a multiple-precision array. As a result, the second algorithm shows significantly better performance compared to the first algorithm. Both algorithms running on an NVIDIA RTX 2080 Ti GPU are faster than the multi-core GNU MP implementation running on an Intel Xeon 4100 processor.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 37 条
  • [1] Multiple-Precision Residue-Based Arithmetic Library for Parallel CPU-GPU Architectures: Data Types and Features
    Isupov, Konstantin
    Kuvaev, Alexander
    Popov, Mikhail
    Zaviyalov, Anton
    PARALLEL COMPUTING TECHNOLOGIES (PACT 2017), 2017, 10421 : 196 - 204
  • [2] Performance data of multiple-precision scalar and vector BLAS operations on CPU and GPU
    Isupov, Konstantin
    DATA IN BRIEF, 2020, 30
  • [3] A Modular-Positional Computation Technique for Multiple-Precision Floating-Point Arithmetic
    Isupov, Konstantin
    Knyazkov, Vladimir
    PARALLEL COMPUTING TECHNOLOGIES (PACT 2015), 2015, 9251 : 47 - 61
  • [4] Multiple-Precision Summation on Hybrid CPU-GPU Platforms Using RNS-based Floating-Point Representation
    Isupov, Konstantin
    Kuvaev, Alexander
    FIFTH INTERNATIONAL CONFERENCE ON ENGINEERING AND TELECOMMUNICATION (ENT-MIPT 2018), 2018, : 153 - 157
  • [5] CAVLCU: an efficient GPU-based implementation of CAVLC
    Fuentes-Alventosa, Antonio
    Gomez-Luna, Juan
    Maria Gonzalez-Linares, Jose
    Guil, Nicolas
    Medina-Carnicer, R.
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (06) : 7556 - 7590
  • [6] CAVLCU: an efficient GPU-based implementation of CAVLC
    Antonio Fuentes-Alventosa
    Juan Gómez-Luna
    José Maria González-Linares
    Nicolás Guil
    R. Medina-Carnicer
    The Journal of Supercomputing, 2022, 78 : 7556 - 7590
  • [7] Design and implementation of multiple-precision BLAS Level 1 functions for graphics processing units
    Isupov, Konstantin
    Knyazkov, Vladimir
    Kuvaev, Alexander
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 140 : 25 - 36
  • [8] A Realization Method of Forward Converters from Multiple-Precision Binary Numbers to Residue Numbers with Arbitrary Mutable Modulus
    Shirakawa, Koki
    Uemura, Takashi
    Iguchi, Yukihiro
    2011 41ST IEEE INTERNATIONAL SYMPOSIUM ON MULTIPLE-VALUED LOGIC (ISMVL), 2011, : 268 - 273
  • [9] RNS-based Data Representation for Handling Multiple-Precision Integers on Parallel Architectures
    Isupov, Konstantin
    Knyazkov, Vladimir
    2016 INTERNATIONAL CONFERENCE ON ENGINEERING AND TELECOMMUNICATION (ENT 2016), 2016, : 76 - 79
  • [10] An Efficient Transaction-based GPU Implementation of Minimum Spanning Forest Algorithm
    Manoochehri, Shayan
    Goodarzi, Bahareh
    Goswami, Dhrubajyoti
    2017 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2017, : 643 - 650