Computed Tomography (CT) serves as a key imaging technology that relies on computationally intensive filtering and backprojection algorithms for 3D image reconstruction. While conventional high-resolution image reconstruction (> 2K(3)) solutions provide quick results, they typically treat reconstruction as an offline workload to be performed remotely on large-scale HPC systems. The growing demand for post-construction AI-driven analytics and the need for real-time adjustments call for high-resolution reconstruction solutions that are feasible on local computing resources, i.e. a multi-GPU server at most. In this paper, we propose a novel approach that utilizes Tensor Cores to optimize image reconstruction without sacrificing precision. We also introduce a framework designed to enable real-time execution of end-to-end distributed image reconstruction in a multi-GPU environment. Evaluations conducted on a single Nvidia A100 and H100 GPU show performance improvements of 1.91x and 2.15x compared to highly optimized production libraries. Furthermore, our framework, when deployed on 8-card Nvidia A100 GPU system, demonstrates the ability to reconstruct real-world datasets into 2048(3) volumes (32 GB) in slightly more than one minute and 4096(3) volumes (256 GB) in 7 minutes.
机构:
Moscow MV Lomonosov State Univ, Fac Phys, Dept Math, Moscow 119991, RussiaMoscow MV Lomonosov State Univ, Fac Phys, Dept Math, Moscow 119991, Russia
Titarenko, S. S.
Yagola, A. G.
论文数: 0引用数: 0
h-index: 0
机构:
Moscow MV Lomonosov State Univ, Fac Phys, Dept Math, Moscow 119991, RussiaMoscow MV Lomonosov State Univ, Fac Phys, Dept Math, Moscow 119991, Russia