An Enhanced Image Reconstruction Tool for Computed Tomography on GPUs

被引:6
作者
Yu, Xiaodong [1 ]
Wang, Hao [1 ]
Feng, Wu-chun [1 ]
Gong, Hao [2 ]
Cao, Guohua [2 ]
机构
[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24060 USA
[2] Virginia Tech, Dept Biomed Engr & Mech, Blacksburg, VA 24060 USA
来源
ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2017 | 2017年
关键词
GPU; Computed Tomography; Image Reconstruction; Algebraic Reconstruction Technique; Sparse Matrix-Vector Multiplication; SpMV; Transposition; SCHEME; ART;
D O I
10.1145/3075564.3078889
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The algebraic reconstruction technique (ART) is an iterative algorithm for CT (i.e., computed tomography) image reconstruction that delivers better image quality with less radiation dosage than the industry-standard filtered back projection (FBP). However, the high computational cost of ART requires researchers to turn to high-performance computing to accelerate the algorithm. Alas, existing approaches for ART suffer from inefficient design of compressed data structures and computational kernels on GPUs. Thus, this paper presents our enhanced CUDA-based CT image reconstruction tool based on the algebraic reconstruction technique (ART) or cuART. It delivers a compression and parallelization solution for ART-based image reconstruction on GPUs. We address the under-performing, but popular, GPU libraries, e.g., cuSPARSE, BRC, and CSR5, on the ART algorithm and propose a symmetrybased CSR format (SCSR) to further compress the CSR data structure and optimize data access for both SpMV and SpMV_T via a column-indices permutation. We also propose sorting-based and sorting-free blocking techniques to optimize the kernel computation by leveraging the sparsity patterns of the system matrix. The end result is that cuART can reduce the memory footprint significantly and enable practical CT datasets to fit into a single GPU. The experimental results on a NVIDIA Tesla K80 GPU illustrate that our approach can achieve up to 6.8x, 7.2x, and 5.4x speedups over counterparts that use cuSPARSE, BRC, and CSR5, respectively.
引用
收藏
页码:97 / 106
页数:10
相关论文
共 39 条
  • [11] ALGEBRAIC RECONSTRUCTION TECHNIQUES (ART) FOR 3-DIMENSIONAL ELECTRON MICROSCOPY AND X-RAY PHOTOGRAPHY
    GORDON, R
    BENDER, R
    HERMAN, GT
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 1970, 29 (03) : 471 - &
  • [12] Efficient Sparse Matrix-Vector Multiplication on GPUs using the CSR Storage Format
    Greathouse, Joseph L.
    Daga, Mayank
    [J]. SC14: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2014, : 769 - 780
  • [13] Grüll F, 2012, PROC INT CONF RECON
  • [14] A PROJECTION ACCESS ORDER FOR SPEEDY CONVERGENCE OF ART (ALGEBRAIC RECONSTRUCTION TECHNIQUE) - A MULTILEVEL SCHEME FOR COMPUTED-TOMOGRAPHY
    GUAN, HQ
    GORDON, R
    [J]. PHYSICS IN MEDICINE AND BIOLOGY, 1994, 39 (11) : 2005 - 2022
  • [15] Guo M., 2017, MED PHYS
  • [16] Hou K., 2017, ACM INT C COMP FRONT
  • [17] Kak A.C., 1984, DIGITAL IMAGE PROCES, P111
  • [18] Parallel image reconstruction on MIMD computers for three-dimensional cone-beam tomography
    Laurent, C
    Peyrin, F
    Chassery, JM
    Amiel, M
    [J]. PARALLEL COMPUTING, 1998, 24 (9-10) : 1461 - 1479
  • [19] GPU-Based Acceleration for Interior Tomography
    Liu, Rui
    Luo, Yan
    Yu, Hengyong
    [J]. IEEE ACCESS, 2014, 2 : 757 - 770
  • [20] Speculative segmented sum for sparse matrix-vector multiplication on heterogeneous processors
    Liu, Weifeng
    Vinter, Brian
    [J]. PARALLEL COMPUTING, 2015, 49 : 179 - 193