Importance of Aligning Training Strategy with Evaluation for Diffusion Models in 3D Multiclass Segmentation

被引：0

作者：

Fu, Yunguan ^{[1
,2
]}

Li, Yiwen ^{[3
]}

Saeed, Shaheer U. ^{[1
]}

Clarkson, Matthew J. ^{[1
]}

Hu, Yipeng ^{[1
,3
]}

机构：

[1] UCL, London, England

[2] InstaDeep, London, England

[3] Univ Oxford, Oxford, England

来源：

DEEP GENERATIVE MODELS, DGM4MICCAI 2023 | 2024年 / 14533卷

基金：

英国工程与自然科学研究理事会;

关键词：

Image Segmentation; Diffusion Model; Prostate MR; Abdominal CT;

D O I：

10.1007/978-3-031-53767-7_9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, denoising diffusion probabilistic models (DDPM) have been applied to image segmentation by generating segmentation masks conditioned on images, while the applications were mainly limited to 2D networks without exploiting potential benefits from the 3D formulation. In this work, we studied the DDPM-based segmentation model for 3D multiclass segmentation on two large multiclass data sets (prostate MR and abdominal CT). We observed that the difference between training and test methods led to inferior performance for existing DDPM methods. To mitigate the inconsistency, we proposed a recycling method which generated corrupted masks based on the model's prediction at a previous time step instead of using ground truth. The proposed method achieved statistically significantly improved performance compared to existing DDPMs, independent of a number of other techniques for reducing train-test discrepancy, including performing mask prediction, using Dice loss, and reducing the number of diffusion time steps during training. The performance of diffusion models was also competitive and visually similar to non-diffusion-based U-net, within the same compute budget. The JAX-based diffusion framework has been released at https:// github.com/mathpluscode/ImgX-DiffSeg.

引用

页码：86 / 95

页数：10

共 28 条

[1] Ali Hazrat, 2023, Artificial Intelligence and Cognitive Science: 30th Irish Conference, AICS 2022, Revised Selected Papers. Communications in Computer and Information Science (1662), P32, DOI 10.1007/978-3-031-26438-2_3
[2] Amit T, 2022, Arxiv, DOI arXiv:2112.00390
[3] Baranchuk D., 2021, arXiv
[4] Chen T, 2023, Arxiv, DOI arXiv:2210.06366
[5] Dhariwal P, 2021, ADV NEUR IN, V34
[6] Fu YG, 2019, Arxiv, DOI arXiv:1908.08035
[7] Ho, 2020, ADV NEURAL INFORM PR, V33, P6840, DOI [10.48550/arXiv.2006.11239, DOI 10.48550/ARXIV.2006.11239]
[8] Ho JAT, 2022, Arxiv, DOI [arXiv:2207.12598, 10.48550/arXiv.2207.12598]
[9] Ji YF, 2022, Arxiv, DOI arXiv:2206.08023
[10] Khader F, 2022, Arxiv, DOI arXiv:2211.03364

← 1 2 3 →