Efficient GPU Implementation of Multiple-Precision Addition based on Residue Arithmetic

被引:0
|
作者
Isupov, Konstantin [1 ]
Knyazkov, Vladimir [2 ]
机构
[1] Vyatka State Univ, Dept Elect Comp Machines, Kirov 610000, Russia
[2] Penza State Univ, Res Inst Fundamental & Appl Studies, Penza 440026, Russia
关键词
Multiple-precision algorithm; integer arithmetic; residue number system; GPU; CUDA;
D O I
10.14569/IJACSA.2020.0110901
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this work, the residue number system (RNS) is applied for efficient addition of multiple-precision integers using graphics processing units (GPUs) that support the Compute Unified Device Architecture (CUDA) platform. The RNS allows calculations with the digits of a multiple-precision number to be performed in an element-wise fashion, without the overhead of communication between them, which is especially useful for massively parallel architectures such as the GPU architecture. The paper discusses two multiple-precision integer algorithms. The first algorithm relies on if-else statements to test the signs of the operands. In turn, the second algorithm uses radix complement RNS arithmetic to handle negative numbers. While the first algorithm is more straightforward, the second one avoids branch divergence among threads that concurrently compute different elements of a multiple-precision array. As a result, the second algorithm shows significantly better performance compared to the first algorithm. Both algorithms running on an NVIDIA RTX 2080 Ti GPU are faster than the multi-core GNU MP implementation running on an Intel Xeon 4100 processor.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 37 条
  • [21] An Efficient Computation of Dempster-Shafer Theory of Evidence Based on Native GPU Implementation
    Rico, Noelia
    Troiano, Luigi
    Diaz, Irene
    BELIEF FUNCTIONS: THEORY AND APPLICATIONS (BELIEF 2021), 2021, 12915 : 291 - 299
  • [22] FAST AND EFFICIENT REAL-TIME GPU BASED IMPLEMENTATION OF WAVE FIELD SYNTHESIS
    Ranjan, Rishabh
    Gan, Woon-Seng
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [23] An Efficient GPU-Based Multiple Pattern Matching Algorithm for Packet Filtering
    Hung, Che-Lun
    Lin, Chun-Yuan
    Wu, Po-Chang
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2017, 86 (2-3): : 347 - 358
  • [24] An efficient GPU implementation for large scale individual-based simulation of collective behavior
    Erra, Ugo
    Frola, Bernardino
    Scarano, Vittorio
    Couzin, Iain
    2009 INTERNATIONAL WORKSHOP ON HIGH PERFORMANCE COMPUTATIONAL SYSTEMS BIOLOGY, PROCEEDINGS, 2009, : 51 - +
  • [25] An Efficient GPU-Based Multiple Pattern Matching Algorithm for Packet Filtering
    Che-Lun Hung
    Chun-Yuan Lin
    Po-Chang Wu
    Journal of Signal Processing Systems, 2017, 86 : 347 - 358
  • [26] Efficient GPU Implementation of the McMurchie-Davidson Method for Shell-Based ERI Computations
    Fujii, Haruto
    Ito, Yasuaki
    Yokogawa, Nobuya
    Suzuki, Kanta
    Tsuji, Satoki
    Nakano, Koji
    Parque, Victor
    Kasagi, Akihiko
    APPLIED SCIENCES-BASEL, 2025, 15 (05):
  • [27] An Efficient GPU-Based Implementation of the R-MSF-Algorithm for Remote Sensing Imagery
    Castro-Palazuelos, David
    Robles-Valdez, Daniel
    Torres-Roman, Deni
    PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 1030 - 1038
  • [28] Efficient GPU-Implementation of H-P Sort Based on Improved Histogram Computation
    Takase, Kaito
    Hagihara, Takumi
    Fujimoto, Noriyuki
    Wada, Koichi
    THE PROCEEDINGS OF INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION, HPC ASIA 2024, 2024, : 134 - 144
  • [29] Efficient GPU Implementation of AFP-Based Algorithms to Accelerate Comparative Bioinformatic Analysis of Protein Structures in 3D-Space
    I. A. Timokhin
    N. N. Popova
    D. A. Suplatov
    Lobachevskii Journal of Mathematics, 2022, 43 : 904 - 915
  • [30] Efficient GPU Implementation of AFP-Based Algorithms to Accelerate Comparative Bioinformatic Analysis of Protein Structures in 3D-Space
    Timokhin, I. A.
    Popova, N. N.
    Suplatov, D. A.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2022, 43 (04) : 904 - 915