A fast and high-quality cone beam reconstruction pipeline using the GPU

被引:10
作者
Schiwietz, Thomas [1 ,3 ]
Bose, Supratik [2 ]
Maltz, Jonathan [2 ]
Westermann, Rudiger [3 ]
机构
[1] Siemens Corp Res, 755 College Road East, Princeton, NJ 08540 USA
[2] Siemens Oncol Care Syst, Concord, CA 95420 USA
[3] Tech Univ Munich, Comp Graph & Visualizat Grp, D-85748 Garching, Germany
来源
MEDICAL IMAGING 2007: PHYSICS OF MEDICAL IMAGING, PTS 1-3 | 2007年 / 6510卷
关键词
cone beam reconstruction; filtered backprojection; curvature smoothing; FFT; high-pass filtering; ring artifact removal; cupping artifact removal; GPU;
D O I
10.1117/12.707598
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Cone beam scanners have evolved rapidly in the past years. Increasing sampling resolution of the projection images and the desire to reconstruct high resolution output volumes increases both the memory consumption and the processing time considerably. In order to keep the processing time down new strategies for memory management are required as well as new algorithmic implementations of the reconstruction pipeline. In this paper, we present a fast and high-quality cone beam reconstruction pipeline using the Graphics Processing Unit (CPU). This pipeline includes the backprojection process and also pre-filtering and post-filtering stages. In particular, we focus on a subset of five stages, but more stages can be integrated easily. In the pre-filtering stage, we first reduce the amount of noise in the acquired projection images by a non-linear curvature-based smoothing algorithm. Then, we apply a high-pass filter as required by the inverse Radon transform. Next, the backprojection pass reconstructs a raw 3D volume. In post-processing, we first filter the volume by a ring artifact removal. Then, we remove cupping artifacts by our novel uniformity correction algorithm. We present the algorithm in detail. In order to execute the pipeline as quickly as possible we take advantage of GPUs that have proven to be very fast parallel processors for numerical problems. Unfortunately, both the projection images and the reconstruction volume are too large to fit into 512 MB of GPU memory. Therefore, we present an efficient memory management strategy that minimizes the bus transfer between main memory and CPU memory. Our results show a 4 times performance gain over a highly optimized CPU implementation using SSE2/3 commands. At the same time, the image quality is comparable to the CPU results with an average per pixel difference of 10(-5).
引用
收藏
页数:12
相关论文
共 13 条
[1]  
[Anonymous], COMPUTER GRAPHICS
[2]  
Eberly DavidH., 2001, 3D GAME ENGINE DESIG
[3]  
Farin Gerald E, 2002, CURVES SURFACES CAGD
[4]   PRACTICAL CONE-BEAM ALGORITHM [J].
FELDKAMP, LA ;
DAVIS, LC ;
KRESS, JW .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1984, 1 (06) :612-619
[5]  
Gray K., 2003, MICROSOFT DIRECTX 9
[6]   Linear algebra operators for GPU implementation of numerical algorithms [J].
Krüger, J ;
Westermann, R .
ACM TRANSACTIONS ON GRAPHICS, 2003, 22 (03) :908-916
[7]  
NESKOVIC P, 2004, VMV, P151
[8]  
SCHIWIETZ T, 2006, MED IMAGING 2006 VIS, P646
[9]   Fast shadows and lighting effects using texture mapping [J].
Segal, Mark ;
Korobkin, Carl ;
van Widenfelt, Rolf ;
Foran, Jim ;
Haeberli, Paul .
Computer Graphics (ACM), 1992, 26 (02) :249-252
[10]  
THILAKA S, 2005, GPU GEMS, V2, P765