Performance data of multiple-precision scalar and vector BLAS operations on CPU and GPU

被引:3
|
作者
Isupov, Konstantin [1 ]
机构
[1] Vyatka State Univ, Dept Elect Comp Machines, Kirov, Russia
来源
DATA IN BRIEF | 2020年 / 30卷
基金
俄罗斯科学基金会;
关键词
Multiple-precision arithmetic; Floating-point computations; Graphics processing units; CUDA; BLAS; IMPLEMENTATION; DESIGN;
D O I
10.1016/j.dib.2020.105506
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Many optimized linear algebra packages support the singleand double-precision floating-point data types. However, there are a number of important applications that require a higher level of precision, up to hundreds or even thousands of digits. This article presents performance data of four dense basic linear algebra subprograms - ASUM, DOT, SCAL, and AXPY - implemented using existing extended-/multipleprecision software for conventional central processing units and CUDA compatible graphics processing units. The following open source packages are considered: MPFR, MPDECIMAL, ARPREC, MPACK, XBLAS, GARPREC, CAMPARY, CUMP, and MPRES-BLAS. The execution time of CPU and GPU implementations is measured at a fixed problem size and various levels of numeric precision. The data in this article are related to the research article entitled "Design and implementation of multiple-precision BLAS Level 1 functions for graphics processing units"[1]. (C) 2020 The Author(s). Published by Elsevier Inc.
引用
收藏
页数:7
相关论文
共 9 条
  • [1] Design and implementation of multiple-precision BLAS Level 1 functions for graphics processing units
    Isupov, Konstantin
    Knyazkov, Vladimir
    Kuvaev, Alexander
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 140 : 25 - 36
  • [2] Multiple-Precision Summation on Hybrid CPU-GPU Platforms Using RNS-based Floating-Point Representation
    Isupov, Konstantin
    Kuvaev, Alexander
    FIFTH INTERNATIONAL CONFERENCE ON ENGINEERING AND TELECOMMUNICATION (ENT-MIPT 2018), 2018, : 153 - 157
  • [3] Efficient GPU Implementation of Multiple-Precision Addition based on Residue Arithmetic
    Isupov, Konstantin
    Knyazkov, Vladimir
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (09) : 1 - 8
  • [4] Multiple-precision sparse matrix-vector multiplication on GPUs
    Isupov, Konstantin
    JOURNAL OF COMPUTATIONAL SCIENCE, 2022, 61
  • [5] On the basic operations of interval multiple-precision arithmetic with center-radius form
    Matsuda, Nozomu
    Yamamoto, Nobito
    IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2011, 2 (01): : 54 - 67
  • [6] Performance Analysis of LiDAR Data Processing on Multi-Core CPU and GPU Architectures
    Alzyout, Mohammad S.
    Al Nounou, Abd Alrahman
    Tikkisetty, Yashwanth Naidu
    Alawneh, Shadi
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [7] Performance Improvement of CUDA Applications by Reducing CPU-GPU Data Transfer Overhead
    Sunitha, N., V
    Raju, K.
    Chiplunkar, Niranjan N.
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2017, : 211 - 215
  • [8] Performance impact on resource sharing among multiple CPU-and GPU-based applications
    Yamagiwa, Shinichi
    Wada, Koichi
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2011, 26 (04) : 313 - 329
  • [9] INVESTIGATION OF PARALLEL DATA PROCESSING USING HYBRID HIGH PERFORMANCE CPU plus GPU SYSTEMS AND CUDA STREAMS
    Czarnul, Pawel
    COMPUTING AND INFORMATICS, 2020, 39 (03) : 510 - 536