Noise-Robust Diffusion Based Semantic Segmentation

被引：0

作者：

Kaya, Ahmet Kagan ^{[1
]}

机构：

[1] Aselsan Inc, Yenimahalle, Turkiye

来源：

2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU | 2023年

关键词：

semantic segmentation; diffusion models; time embedding;

D O I：

10.1109/SIU59756.2023.10223870

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many semantic segmentation methods that are either fast or having high accuracy based on Jaccard index(IoU) have been proposed in the literature. Therefore, diffusion models, which have shown very successful results recently, have also been used in semantic segmentation studies. Although diffusion based segmentation methods work faster than the current state-of-the-art methods, diffusion models did not provide sufficient performance in terms of accuracy. However, it is important observation that noisy and ordinary images can be used together since the diffusion models have a structure that is robust for learning. In this study, the NRSeg architecture in which diffusion models can use both noisy and ordinary images together was created. The new model performance was measured in terms of IoU and these results were compared with the performances of the state-of-the-art methods in the literature.

引用

页数：4

共 16 条

[1]

Amit T, 2022, Arxiv, DOI [arXiv:2112.00390, 10.48550/arXiv.2112.00390]

[2]

Baranchuk D., 2022, arXiv, DOI 10.48550/arXiv.2112.03126

[3]

Bucher M, 2019, ADV NEUR IN, V32

[4] Emerging Properties in Self-Supervised Vision Transformers [J].

Caron, Mathilde ;

Touvron, Hugo ;

Misra, Ishan ;

Jegou, Herve ;

Mairal, Julien ;

Bojanowski, Piotr ;

Joulin, Armand .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9630-9640

[5]

Chen SF, 2023, Arxiv, DOI arXiv:2211.09788

[6]

He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]

[7]

Ho J., 2020, Advances in neural information processing systems, V33, P6840

[8]

Liu MY, 2019, Arxiv, DOI arXiv:1909.08599

[9] Deep Learning Face Attributes in the Wild [J].

Liu, Ziwei ;

Luo, Ping ;

Wang, Xiaogang ;

Tang, Xiaoou .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3730-3738

[10]

Luo C., 2022, arXiv, DOI 10.48550/arXiv.2208.11970

← 1 2 →