MTDiff: Visual anomaly detection with multi-scale diffusion models

被引:7
作者
Wang, Xubin [1 ]
Li, Wenju [1 ]
He, Xiangjian [2 ]
机构
[1] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai 201418, Peoples R China
[2] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo 315100, Peoples R China
关键词
Anomaly detection; Image processing; Diffusion probabilistic model; Pattern recognition; Visual application; INSPECTION;
D O I
10.1016/j.knosys.2024.112364
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Advancements in computer vision have fueled rapid developments in unsupervised anomaly detection, but current methods often encounter limitations when addressing anomalies with varying scales, and the intricate pipelines that require significant tuning efforts further hinder the usability. In this work, we propose MTDiff, a novel anomaly detection method comprising diffusion models built on different scales. In essence, the constructed scale-specific branches and their incorporation can enhance the pattern coverage, thus improving performance. MTDiff involves two parts: reconstruction that repairs the anomalous region to pseudo-normal, and detection that carefully compares and localizes the anomalies. Instead of the typical forward process of diffusion, we construct a partial Markov chain to improve the reconstruction quality. During the discrimination, we construct a simple but effective detector that operates on feature-level to better utilize the rich contextual information. MTDiff comes with a concise training pipeline, with optimized diffusion iterations ensuring the efficiency. Sufficient experiments reveal that it outperforms the state-of-the-art approaches, showing superior stability and robustness in both image- and pixel-level anomaly detection. The related code is available at https://github.com/vergilben/MTDiff.
引用
收藏
页数:13
相关论文
共 92 条
[1]   Latent Space Autoregression for Novelty Detection [J].
Abati, Davide ;
Porrello, Angelo ;
Calderara, Simone ;
Cucchiara, Rita .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :481-490
[2]   Skip-GANomaly: Skip Connected and Adversarially Trained Encoder-Decoder Anomaly Detection [J].
Akcay, Samet ;
Atapour-Abarghouei, Amir ;
Breckon, Toby P. .
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[3]   GANomaly: Semi-supervised Anomaly Detection via Adversarial Training [J].
Akcay, Samet ;
Atapour-Abarghouei, Amir ;
Breckon, Toby P. .
COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 :622-637
[4]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[5]  
Baranchuk D., 2022, INT C LEARN REPR
[6]  
Bergmann P, 2019, Arxiv, DOI [arXiv:1807.02011, DOI 10.48550/ARXIV.1807.02011]
[7]   Uninformed Students: Student-Teacher Anomaly Detection with Discriminative Latent Embeddings [J].
Bergmann, Paul ;
Fauser, Michael ;
Sattlegger, David ;
Steger, Carsten .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4182-4191
[8]   MVTec AD - A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection [J].
Bergmann, Paul ;
Fauser, Michael ;
Sattlegger, David ;
Steger, Carsten .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9584-9592
[9]   Mixed supervision for surface-defect detection: From weakly to fully supervised learning [J].
Bozic, Jakob ;
Tabernik, Domen ;
Skocaj, Danijel .
COMPUTERS IN INDUSTRY, 2021, 129
[10]   ITran: A novel transformer-based approach for industrial anomaly detection and localization [J].
Cai, Xiangyu ;
Xiao, Ruliang ;
Zeng, Zhixia ;
Gong, Ping ;
Ni, Youcong .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125