Unsupervised Visual Anomaly Detection Using Self-Supervised Pre-Trained Transformer

被引:0
作者
Kim, Jun-Hyung [1 ]
Kwon, Goo-Rak [1 ]
机构
[1] Chosun Univ, Dept Informat & Commun Engn, Gwangju 61452, South Korea
关键词
Image reconstruction; Image segmentation; Transformers; Computational modeling; Location awareness; Feature extraction; Anomaly detection; Data augmentation; Self-supervised learning; data-augmentation; self-supervised learning; transformer;
D O I
10.1109/ACCESS.2024.3454753
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the various industrial manufacturing processes, the automatic visual inspection system is an essential part as it reduces the chances of delivering defective products and the cost of training and hiring experts for manual inspection. In this work, we propose a new unsupervised anomaly detection method inspired by the masked language model for the automatic visual inspection system. The proposed method consists of an image tokenizer and two subnetworks, a reconstruction subnetwork, and a segmentation subnetwork. We adopt a pre-trained self-supervised vision Transformer model to use it as an image tokenizer. Our first subnetwork is trained to predict the anomaly-free patch tokens and the second subnetwork is trained to produce anomaly segmentation results from both the reconstructed and input patch tokens. During training, only the two subnetworks are optimized, and parameters of an image tokenizer are frozen. Experimental results show that the proposed method exhibits better performance than conventional methods in detecting defective products by achieving 99.05% I-AUROC on MVTecAD dataset and 94.8% I-AUROC on BTAD.
引用
收藏
页码:127604 / 127613
页数:10
相关论文
共 48 条
[1]  
Bergmann P, 2019, Arxiv, DOI [arXiv:1807.02011, DOI 10.48550/ARXIV.1807.02011]
[2]   MVTec AD - A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection [J].
Bergmann, Paul ;
Fauser, Michael ;
Sattlegger, David ;
Steger, Carsten .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9584-9592
[3]   Emerging Properties in Self-Supervised Vision Transformers [J].
Caron, Mathilde ;
Touvron, Hugo ;
Misra, Ishan ;
Jegou, Herve ;
Mairal, Julien ;
Bojanowski, Piotr ;
Joulin, Armand .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9630-9640
[4]  
Chen T., 2020, INT C MACHINE LEARNI, P1597
[5]   Describing Textures in the Wild [J].
Cimpoi, Mircea ;
Maji, Subhransu ;
Kokkinos, Iasonas ;
Mohamed, Sammy ;
Vedaldi, Andrea .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3606-3613
[6]  
Cohen N, 2021, Arxiv, DOI arXiv:2005.02357
[7]   Improved anomaly detection by training an autoencoder with skip connections on images corrupted with Stain-shaped noise [J].
Collin, Anne-Sophie ;
De Vleeschouwer, Christophe .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :7915-7922
[8]   A Survey on Unsupervised Anomaly Detection Algorithms for Industrial Images [J].
Cui, Yajie ;
Liu, Zhaoxiang ;
Lian, Shiguo .
IEEE ACCESS, 2023, 11 :55297-55315
[9]  
Darcet T, 2024, Arxiv, DOI [arXiv:2309.16588, DOI 10.48550/ARXIV.2309.16588]
[10]   Masked Transformer for Image Anomaly Localization [J].
De Nardin, Axel ;
Mishra, Pankaj ;
Foresti, Gian Luca ;
Piciarelli, Claudio .
INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2022, 32 (07)