Parallel Singular Value Decomposition on Heterogeneous Multi-core and Multi-GPU Platforms

被引:0
|
作者
Feng, Xiaowen [1 ,2 ]
Jin, Hai [1 ]
Zheng, Ran [1 ]
Zhu, Lei [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab, Wuhan 430074, Peoples R China
[2] Elect Power Corp, Informat & Commun Co Hunan, Changsha 410007, Hunan, Peoples R China
来源
2014 NINTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM) | 2014年
关键词
Heterogeneous Platform; Singular Value Decomposition; Divide-and-Conquer; Coordination; DIVIDE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Singular value decomposition (SVD) is one of the most fundamental matrix calculations in numerical linear algebra. Traditional solution is the QR-iteration-based SVD algorithm on CPU, and it is time-consuming. Nowadays, Graphics Processing Units (GPUs) are suited for many general purpose tasks and have emerged as low price and high performance accelerators. In this paper, the parallel-friendly divide-and-conquer approach is employed to accelerate SVD algorithm on the heterogeneous multicore and multi-GPU systems. Two mechanisms are designed to make good use of the computational resource on the heterogeneous system, including two-layer divide-and-conquer and coordination between CPU and GPU. The experimental results show that our algorithm is faster than Intel MKL with four CPU cores, and reaches 45 times speedup with four NVIDIA GTX460 GPUs over LAPACK. Our implementation can also achieve about 1.5 times speedup by doubling the number of GPU devices.
引用
收藏
页码:45 / 50
页数:6
相关论文
共 50 条
  • [31] Resolving cold start and sparse data challenge in recommender systems using multi-level singular value decomposition
    Rodpysh, Keyvan Vahidy
    Mirabedini, Seyed Javad
    Banirostam, Touraj
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 94
  • [32] Real-time Singular Value Decomposition based Load Frequency Control of Multi-area Power System
    Khan, Irfan
    Kar, Soummya
    Xu, Yinliang
    2018 CLEMSON UNIVERSITY POWER SYSTEMS CONFERENCE (PSC), 2018,
  • [33] A new video watermarking using redundant discrete wavelet in singular value decomposition domain with multi-objective optimization
    Kumar, Pala Mahesh
    Srinivas, Kalyanapu
    Prabhakar, Bitla
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (13)
  • [34] Extending Guided Filters Through Effective Utilization of Multi-Channel Guide Images Based on Singular Value Decomposition
    Mishiba, Kazu
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2025, 6 : 385 - 397
  • [35] Multi-Objective Genetic Algorithm Optimization for Image Watermarking Based on Singular Value Decomposition and Lifting Wavelet Transform
    Loukhaoukha, Khaled
    Chouinard, Jean-Yves
    Taieb, Mohamed Haj
    IMAGE AND SIGNAL PROCESSING, PROCEEDINGS, 2010, 6134 : 394 - 403
  • [36] Multi-Frequency Planar Near-Field Scanning by Means of Singular-Value Decomposition (SVD) Optimization
    Capozzoli, A.
    Curcio, C.
    Liseno, A.
    IEEE ANTENNAS AND PROPAGATION MAGAZINE, 2011, 53 (06) : 212 - 221
  • [37] MR3-SMP: A symmetric tridiagonal eigensolver for multi-core architectures
    Petschow, M.
    Bientinesi, P.
    PARALLEL COMPUTING, 2011, 37 (12) : 795 - 805
  • [38] Joint multi-subspace feature learning with singular value decomposition for robust single-sample face recognition
    Fei, Rong
    Zhang, Jian
    Bo, Liling
    Zhang, Heng
    Li, Hongran
    Li, Ming
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 114
  • [39] A general and efficient divide-and-conquer algorithm framework for multi-core clusters
    Gonzalez, Carlos H.
    Fraguela, Basilio B.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (03): : 2605 - 2626
  • [40] Comparison of batch scheduling for identical multi-tasks jobs on heterogeneous platforms
    Diakite, Sekou
    Nicod, Jean-Marc
    Philippe, Laurent
    PROCEEDINGS OF THE 16TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2008, : 374 - 378