Performance data of multiple-precision scalar and vector BLAS operations on CPU and GPU

被引：3

作者：

Isupov, Konstantin ^{[1
]}

机构：

[1] Vyatka State Univ, Dept Elect Comp Machines, Kirov, Russia

来源：

DATA IN BRIEF | 2020年 / 30卷

基金：

俄罗斯科学基金会;

关键词：

Multiple-precision arithmetic; Floating-point computations; Graphics processing units; CUDA; BLAS; IMPLEMENTATION; DESIGN;

D O I：

10.1016/j.dib.2020.105506

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Many optimized linear algebra packages support the singleand double-precision floating-point data types. However, there are a number of important applications that require a higher level of precision, up to hundreds or even thousands of digits. This article presents performance data of four dense basic linear algebra subprograms - ASUM, DOT, SCAL, and AXPY - implemented using existing extended-/multipleprecision software for conventional central processing units and CUDA compatible graphics processing units. The following open source packages are considered: MPFR, MPDECIMAL, ARPREC, MPACK, XBLAS, GARPREC, CAMPARY, CUMP, and MPRES-BLAS. The execution time of CPU and GPU implementations is measured at a fixed problem size and various levels of numeric precision. The data in this article are related to the research article entitled "Design and implementation of multiple-precision BLAS Level 1 functions for graphics processing units"[1]. (C) 2020 The Author(s). Published by Elsevier Inc.

引用

页数：7

共 9 条

[1] Design and implementation of multiple-precision BLAS Level 1 functions for graphics processing units
Isupov, Konstantin
Knyazkov, Vladimir
Kuvaev, Alexander
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 140 : 25 - 36
[2] Multiple-Precision Summation on Hybrid CPU-GPU Platforms Using RNS-based Floating-Point Representation
Isupov, Konstantin
Kuvaev, Alexander
FIFTH INTERNATIONAL CONFERENCE ON ENGINEERING AND TELECOMMUNICATION (ENT-MIPT 2018), 2018, : 153 - 157
[3] Efficient GPU Implementation of Multiple-Precision Addition based on Residue Arithmetic
Isupov, Konstantin
Knyazkov, Vladimir
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (09) : 1 - 8
[4] Multiple-precision sparse matrix-vector multiplication on GPUs
Isupov, Konstantin
JOURNAL OF COMPUTATIONAL SCIENCE, 2022, 61
[5] On the basic operations of interval multiple-precision arithmetic with center-radius form
Matsuda, Nozomu
Yamamoto, Nobito
IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2011, 2 (01): : 54 - 67
[6] Performance Analysis of LiDAR Data Processing on Multi-Core CPU and GPU Architectures
Alzyout, Mohammad S.
Al Nounou, Abd Alrahman
Tikkisetty, Yashwanth Naidu
Alawneh, Shadi
2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
[7] Performance Improvement of CUDA Applications by Reducing CPU-GPU Data Transfer Overhead
Sunitha, N., V
Raju, K.
Chiplunkar, Niranjan N.
PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2017, : 211 - 215
[8] Performance impact on resource sharing among multiple CPU-and GPU-based applications
Yamagiwa, Shinichi
Wada, Koichi
INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2011, 26 (04) : 313 - 329
[9] INVESTIGATION OF PARALLEL DATA PROCESSING USING HYBRID HIGH PERFORMANCE CPU plus GPU SYSTEMS AND CUDA STREAMS
Czarnul, Pawel
COMPUTING AND INFORMATICS, 2020, 39 (03) : 510 - 536

← 1 →