Transformer-based Encoder-Decoder Model for Surface Defect Detection

被引:0
作者
Lu, Xiaofeng [1 ]
Fan, Wentao [1 ]
机构
[1] Huaqiao Univ, Coll Comp Sci & Technol, Xiamen, Peoples R China
来源
6TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE, ICIAI2022 | 2022年
基金
中国国家自然科学基金;
关键词
transformer; surface anomaly detection; deep learning; industrial quality inspection; NEURAL-NETWORK;
D O I
10.1145/3529466.3529471
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep learning approaches have been gaining popularity in industrial quality control (e.g. surface defect detection), due to their ability for automatically extracting more representative features. In this paper, we propose a two-stage end-to-end approach through a Transformer-based encoder-decoder for surface defect detection. First, we develop a surface defect detection model to train the slicing of input raw images with the same final resolution of the input images and the output images, which better expands the perceptual field. After that, a 1x1 convolution layer is applied to its final layer, thus reducing the number of channels to obtain a single-channel output mask. Then, we combine this single-channel output mask with the output obtained from the last layer of the first stage as the input of the second stage decision layer. Considering different types of sample data, we design two different decision network strategies, namely: plain-up sampling and dynamic-up sampling. Our experimental studies on several publicly available datasets show that the proposed approach is general and effective in detecting defects, and we only need a relatively small number of samples to train the model, which has a good applicability in industrial practice where the sample size is normally limited.
引用
收藏
页码:125 / 130
页数:6
相关论文
共 26 条
[1]   Uninformed Students: Student-Teacher Anomaly Detection with Discriminative Latent Embeddings [J].
Bergmann, Paul ;
Fauser, Michael ;
Sattlegger, David ;
Steger, Carsten .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4182-4191
[2]   End-to-end training of a two-stage neural network for defect detection [J].
Bozic, Jakob ;
Tabernik, Domen ;
Skocaj, Danijel .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :5619-5626
[3]   Mixed supervision for surface-defect detection: From weakly to fully supervised learning [J].
Bozic, Jakob ;
Tabernik, Domen ;
Skocaj, Danijel .
COMPUTERS IN INDUSTRY, 2021, 129
[4]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[5]   Visual-Based Defect Detection and Classification Approaches for Industrial Applications-A SURVEY [J].
Czimmermann, Tamas ;
Ciuti, Gastone ;
Milazzo, Mario ;
Chiurazzi, Marcello ;
Roccella, Stefano ;
Oddo, Calogero Maria ;
Dario, Paolo .
SENSORS, 2020, 20 (05)
[6]  
Damacharla P., 2021, 2021 INT C APPL ART, P1, DOI DOI 10.1109/ICAPAI49758.2021.9462060
[7]   Defect Detection and Classification by Training a Generic Convolutional Neural Network Encoder [J].
Dong, Xinghui ;
Taylor, Christopher J. ;
Cootes, Tim F. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 :6055-6069
[8]  
Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
[9]  
Faghih-Roohi S, 2016, IEEE IJCNN, P2584, DOI 10.1109/IJCNN.2016.7727522
[10]  
Hahn, 2007, SENSATIONAL KNOWLEDG