共 50 条
Real-time High-resolution X-Ray Computed Tomography
被引:1
|作者:
Wu, Du
[1
,2
]
Chen, Peng
[2
,3
]
Wang, Xiao
[4
]
Lyngaas, Issac
[4
]
Miyajima, Takaaki
[5
]
Endo, Toshio
[1
]
Matsuoka, Satoshi
[2
]
Wahib, Mohamed
[2
]
机构:
[1] Tokyo Inst Technol, Tokyo, Japan
[2] RIKEN CCS, Kobe, Hyogo, Japan
[3] Natl Inst Adv Ind Sci & Technol, Tokyo, Japan
[4] Oak Ridge Natl Lab, Oak Ridge, TN USA
[5] Meiji Univ, Tokyo, Japan
来源:
PROCEEDINGS OF THE 38TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ACM ICS 2024
|
2024年
关键词:
Computed Tomography;
Tensor Cores;
GPU;
BEAM CT RECONSTRUCTION;
IMAGE-RECONSTRUCTION;
ALGORITHMS;
GPU;
D O I:
10.1145/3650200.3656634
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Computed Tomography (CT) serves as a key imaging technology that relies on computationally intensive filtering and backprojection algorithms for 3D image reconstruction. While conventional high-resolution image reconstruction (> 2K(3)) solutions provide quick results, they typically treat reconstruction as an offline workload to be performed remotely on large-scale HPC systems. The growing demand for post-construction AI-driven analytics and the need for real-time adjustments call for high-resolution reconstruction solutions that are feasible on local computing resources, i.e. a multi-GPU server at most. In this paper, we propose a novel approach that utilizes Tensor Cores to optimize image reconstruction without sacrificing precision. We also introduce a framework designed to enable real-time execution of end-to-end distributed image reconstruction in a multi-GPU environment. Evaluations conducted on a single Nvidia A100 and H100 GPU show performance improvements of 1.91x and 2.15x compared to highly optimized production libraries. Furthermore, our framework, when deployed on 8-card Nvidia A100 GPU system, demonstrates the ability to reconstruct real-world datasets into 2048(3) volumes (32 GB) in slightly more than one minute and 4096(3) volumes (256 GB) in 7 minutes.
引用
收藏
页码:110 / 123
页数:14
相关论文