Parallel Singular Value Decomposition on Heterogeneous Multi-core and Multi-GPU Platforms

被引:0
|
作者
Feng, Xiaowen [1 ,2 ]
Jin, Hai [1 ]
Zheng, Ran [1 ]
Zhu, Lei [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab, Wuhan 430074, Peoples R China
[2] Elect Power Corp, Informat & Commun Co Hunan, Changsha 410007, Hunan, Peoples R China
来源
2014 NINTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM) | 2014年
关键词
Heterogeneous Platform; Singular Value Decomposition; Divide-and-Conquer; Coordination; DIVIDE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Singular value decomposition (SVD) is one of the most fundamental matrix calculations in numerical linear algebra. Traditional solution is the QR-iteration-based SVD algorithm on CPU, and it is time-consuming. Nowadays, Graphics Processing Units (GPUs) are suited for many general purpose tasks and have emerged as low price and high performance accelerators. In this paper, the parallel-friendly divide-and-conquer approach is employed to accelerate SVD algorithm on the heterogeneous multicore and multi-GPU systems. Two mechanisms are designed to make good use of the computational resource on the heterogeneous system, including two-layer divide-and-conquer and coordination between CPU and GPU. The experimental results show that our algorithm is faster than Intel MKL with four CPU cores, and reaches 45 times speedup with four NVIDIA GTX460 GPUs over LAPACK. Our implementation can also achieve about 1.5 times speedup by doubling the number of GPU devices.
引用
收藏
页码:45 / 50
页数:6
相关论文
共 50 条
  • [21] Singular Value Decomposition Optimizes Multi-Level Orthogonal Code Index Modulation DCSK System
    Zhang, Gang
    Chen, Xibiao
    Hu, Yanan
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (12) : 2874 - 2878
  • [22] Theory of multi-resolution singular value decomposition and its application to signal processing and fault diagnosis
    Zhao X.
    Ye B.
    Chen T.
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2010, 46 (20): : 64 - 75
  • [23] Analysis of Turbulence Driven Particle Transport in PANTA by Using Multi-Field Singular Value Decomposition
    Kodahara T.
    Sasaki M.
    Kawachi Y.
    Jajima Y.
    Kobayashi T.
    Yamada T.
    Arakawa H.
    Fujisawa A.
    Plasma and Fusion Research, 2023, 18
  • [24] Analysis of Turbulence Driven Particle Transport in PANTA by Using Multi-Field Singular Value Decomposition
    Kodahara, Takumi
    Sasaki, Makoto
    Kawachi, Yuichi
    Jajima, Yuki
    Kobayashi, Tatsuya
    Yamada, Takuma
    Arakawa, Hiroyuki
    Fujisawa, Akihide
    PLASMA AND FUSION RESEARCH, 2023, 18
  • [25] Centralized Multi-Scale Singular Value Decomposition for Feature Construction in LIDAR Image Classification Problems
    Bassu, D.
    Izmailov, R.
    McIntosh, A.
    Ness, L.
    Shallcross, D.
    2012 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2012,
  • [26] A Parallel Implementation of Singular Value Decomposition based on Map-Reduce and PARPACK
    Ding, Yaguang
    Zhu, Guofeng
    Cui, Chenyang
    Zhou, Jian
    Tao, Liang
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 739 - 741
  • [27] Singular Value Decomposition-Based Saturated Control Scheme for a Multi-DOF Wave Energy Converter
    Li, XiaoGang
    Wang, Jiameng
    Zhai, Peng
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2023, 59 (02) : 2019 - 2029
  • [28] SEVDA: Singular Value Decomposition Based Parallel Write Scheme for Memristive CNN Accelerators
    Al-Shaarawy, Ali
    Genov, Roman
    Amirsoleimani, Amirali
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [29] A general and efficient divide-and-conquer algorithm framework for multi-core clusters
    Carlos H. González
    Basilio B. Fraguela
    Cluster Computing, 2017, 20 : 2605 - 2626
  • [30] MR3-SMP: A symmetric tridiagonal eigensolver for multi-core architectures
    Petschow, M.
    Bientinesi, P.
    PARALLEL COMPUTING, 2011, 37 (12) : 795 - 805