Sparse Cholesky Factorization on FPGA Using Parameterized Model

被引:2
作者
Sun, Yichun [1 ]
Liu, Hengzhu [1 ]
Zhou, Tong [1 ]
机构
[1] Natl Univ Def Technol, Sch Comp, Deya Rd 109, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
PARALLEL ALGORITHMS; OPTIMIZATION;
D O I
10.1155/2017/3021591
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Cholesky factorization is a fundamental problem in most engineering and science computation applications. When dealing with a large sparse matrix, numerical decomposition consumes the most time. We present a vector architecture to parallelize numerical decomposition of Cholesky factorization. We construct an integrated analytical parameterized performance model to accurately predict the execution times of typical matrices under varying parameters. Our proposed approach is general for accelerator and limited by neither field-programmable gate arrays (FPGAs) nor application-specific integrated circuit. We implement a simplified module in FPGAs to prove the accuracy of the model. The experiments show that, for most cases, the performance differences between the predicted and measured execution are less than 10%. Based on the performance model, we optimize parameters and obtain a balance of resources and performance after analyzing the performance of varied parameter settings. Comparing with the state-of-the-art implementation in CPU and GPU, we find that the performance of the optimal parameters is 2x that of CPU. Our model offers several advantages, particularly in power consumption. It provides guidance for the design of future acceleration components.
引用
收藏
页数:11
相关论文
共 24 条
  • [1] [Anonymous], P S APPL ACC HIGH PE
  • [2] [Anonymous], S APPL ACC HIGH PERF
  • [3] [Anonymous], IEICE ELECT EXPRESS
  • [4] GPU-Accelerated Sparse LU Factorization for Circuit Simulation with Performance Modeling
    Chen, Xiaoming
    Ren, Ling
    Wang, Yu
    Yang, Huazhong
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (03) : 786 - 795
  • [5] Algorithm 887: CHOLMOD, Supernodal Sparse Cholesky Factorization and Update/Downdate
    Chen, Yanqing
    Davis, Timothy A.
    Hager, William W.
    Rajamanickam, Sivasankaran
    [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2008, 35 (03):
  • [6] THE MULTIFRONTAL SOLUTION OF INDEFINITE SPARSE SYMMETRIC LINEAR-EQUATIONS
    DUFF, IS
    REID, JK
    [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1983, 9 (03): : 302 - 325
  • [7] Gallivan K., 1990, PARALLEL ALGORITHMS, V4
  • [8] PARALLEL CHOLESKY FACTORIZATION ON A SHARED-MEMORY MULTIPROCESSOR
    GEORGE, A
    HEATH, MT
    LIU, J
    [J]. LINEAR ALGEBRA AND ITS APPLICATIONS, 1986, 77 : 165 - 187
  • [9] George T., 2011, Proceedings of the 25th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2011), P372, DOI 10.1109/IPDPS.2011.44
  • [10] A Performance Modeling and Optimization Analysis Tool for Sparse Matrix-Vector Multiplication on GPUs
    Guo, Ping
    Wang, Liqiang
    Chen, Po
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (05) : 1112 - 1123