Semi-Supervised Learning for Defect Segmentation with Autoencoder Auxiliary Module

被引：11

作者：

Sae-ang, Bee-ing ^{[1
]}

Kumwilaisak, Wuttipong ^{[1
]}

Kaewtrakulpong, Pakorn ^{[2
]}

机构：

[1] King Mongkuts Univ Technol Thonburi, Elect & Engn, Bangkok 10140, Thailand

[2] Tesla Inc, Austin, TX 78725 USA

来源：

SENSORS | 2022年 / 22卷 / 08期

关键词：

defect segmentation; deep learning; semi-supervised learning; SUPPORT;

D O I：

10.3390/s22082915

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

In general, one may have access to a handful of labeled normal and defect datasets. Most unlabeled datasets contain normal samples because the defect samples occurred rarely. Thus, the majority of approaches for anomaly detection are formed as unsupervised problems. Most of the previous methods have typically chosen an autoencoder to extract the common characteristics of the unlabeled dataset, assumed as normal characteristics, and determine the unsuccessfully reconstructed area as the defect area in an image. However, we could waste the ground truth data if we leave them unused. In addition, a suitable choice of threshold value is needed for anomaly segmentation. In our study, we propose a semi-supervised setting to make use of both unlabeled and labeled samples and the network is trained to segment out defect regions automatically. We first train an autoencoder network to reconstruct defect-free images from an unlabeled dataset, mostly containing normal samples. Then, a difference map between the input and the reconstructed image is calculated and feeds along with the corresponding input image into the subsequent segmentation module. We share the ground truth for both kinds of input and train the network with binary cross-entropy loss. Additional difference images can also increase stability during training. Finally, we show extensive experimental results to prove that, with help from a handful of ground-truth segmentation maps, the result is improved overall by 3.83%.

引用

页数：15

共 25 条

[1]

Baldi P., 2011, P ICML WORKSH UNS TR, V27

[2]

Bergmann P., 2019, P INT JOINT C COMP V

[3] MVTec AD - A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection [J].

Bergmann, Paul ;

Fauser, Michael ;

Sattlegger, David ;

Steger, Carsten .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9584-9592

[4] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[5] Context Contrasted Feature and Gated Multi-scale Aggregation for Scene Segmentation [J].

Ding, Henghui ;

Jiang, Xudong ;

Shuai, Bing ;

Liu, Ai Qun ;

Wang, Gang .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2393-2402

[6]

Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672

[7]

Kim J, 2012, J MACH LEARN RES, V13, P2529

[8]

Kumwilaisak W., 2019, INTELLIGENT VISUAL C, V11st

[9] Image Denoising With Deep Convolutional Neural and Multi-Directional Long Short-Term Memory Networks Under Poisson Noise Environments [J].

Kumwilaisak, Wuttipong ;

Piriyatharawet, Teerawat ;

Lasang, Pongsak ;

Thatphithakkul, Nattanun .

IEEE ACCESS, 2020, 8 :86998-87010

[10]

Laine S., 2017, INT C LEARNING REPRE, DOI DOI 10.48550/ARXIV.1610.02242

← 1 2 3 →