Dual-Attention Transformer and Discriminative Flow for Industrial Visual Anomaly Detection

被引:11
作者
Yao, Haiming [1 ]
Luo, Wei [1 ]
Yu, Wenyong [2 ]
Zhang, Xiaotian [1 ]
Qiang, Zhenfeng [1 ]
Luo, Donghao [1 ]
Shi, Hui [3 ]
机构
[1] Tsinghua Univ, Dept Precis Instrument, State Key Lab Precis Measurement Technol & Instru, Beijing 100084, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, State Key Lab Digital Mfg Equipment & Technol, Wuhan 430074, Peoples R China
[3] Wuhan Univ Technol, Sch Mech & Elect Engn, Wuhan 430070, Peoples R China
基金
中国国家自然科学基金;
关键词
Anomaly detection; anomaly localization; complex semantic defect inspection; dual-attention transformer; discriminative normalizing flow;
D O I
10.1109/TASE.2023.3322156
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we introduce the novel state-of-the-art Dual-attention Transformer and Discriminative Flow (DADF) framework for visual anomaly detection. Based on only normal knowledge, visual anomaly detection has wide applications in industrial scenarios and has attracted significant attention. However, most existing methods fail to meet the requirements of logic defect detection under complex semantic conditions. In contrast, the proposed DADF presents a new paradigm: it firstly leverages a pre-trained network to acquire multi-scale prior embeddings, followed by the development of a vision Transformer with dual attention mechanisms, namely self-attention and memorial-attention, to achieve global-local two-level reconstruction for prior embeddings with the sequential and normality association. Additionally, we propose using normalizing flow to establish discriminative likelihood for the joint distribution of prior and reconstructions at each scale. The experimental results validate the effectiveness of the proposed DADF approach, as evidenced by the impressive performance metrics obtained across various benchmarks, especially for logic defects with complex semantics. Specifically, DADF achieves image-level and pixel-level AUROC scores of 98.3 and 98.4, respectively, on the Mvtec AD benchmark, and an image-level AUROC score of 83.7 and a pixel sPRO score of 67.4 on the Mvtec LOCO AD benchmark. Additionally, we applied DADF to a real-world Printed Circuit Board (PCB) industrial defect inspection task, further demonstrating its efficacy in practical scenarios. The source code of DADF is available at https://github.com/hmyao22/DADF. Note to Practitioners-Most of the current industrial visual inspection techniques can only detect structural defects under uncomplicated semantic settings. Detecting anomalies in products featuring intricate components and logical defects with high-level semantics remains a considerable challenge. The presented DADF is a robust model that can effectively identify defects in products with complex components, such as Printed Circuit Boards (PCBs). Furthermore, it can also accurately detect both structural and logical defects, which is of significant importance for practical industrial applications.
引用
收藏
页码:6126 / 6140
页数:15
相关论文
共 36 条
  • [1] Skip-GANomaly: Skip Connected and Adversarially Trained Encoder-Decoder Anomaly Detection
    Akcay, Samet
    Atapour-Abarghouei, Amir
    Breckon, Toby P.
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [2] Beyond Dents and Scratches: Logical Constraints in Unsupervised Anomaly Detection and Localization
    Bergmann, Paul
    Batzner, Kilian
    Fauser, Michael
    Sattlegger, David
    Steger, Carsten
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (04) : 947 - 969
  • [3] Uninformed Students: Student-Teacher Anomaly Detection with Discriminative Latent Embeddings
    Bergmann, Paul
    Fauser, Michael
    Sattlegger, David
    Steger, Carsten
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4182 - 4191
  • [4] MVTec AD - A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
    Bergmann, Paul
    Fauser, Michael
    Sattlegger, David
    Steger, Carsten
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9584 - 9592
  • [5] Bergmann Paul, 2018, arXiv
  • [6] Collaborative Discrepancy Optimization for Reliable Image Anomaly Localization
    Cao, Yunkang
    Xu, Xiaohao
    Liu, Zhaoge
    Shen, Weiming
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (11) : 10674 - 10683
  • [7] Informative knowledge distillation for image anomaly segmentation
    Cao, Yunkang
    Wan, Qian
    Shen, Weiming
    Gao, Liang
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 248
  • [8] Cohen N., 2020, arXiv
  • [9] Cohen Niv, 2023, arXiv
  • [10] Anomaly Detection via Reverse Distillation from One-Class Embedding
    Deng, Hanqiu
    Li, Xingyu
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9727 - 9736