Enriched multi-scale cascade pyramid features and guided context attention network for industrial surface defect detection

被引:17
|
作者
Shao, Linhao [1 ]
Zhang, Erhu [1 ]
Duan, Jinghong [2 ]
Ma, Qiurui [3 ]
机构
[1] Xian Univ Technol, Dept Informat Sci, Xian 710048, Peoples R China
[2] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China
[3] Xian Univ Technol, Sch Mech & Precis Instrument Engn, Xian 710048, Peoples R China
基金
中国国家自然科学基金;
关键词
Surface defect detection; Deep learning; Pyramid feature fusion; Guided context attention; Attention mechanism; CLASSIFICATION;
D O I
10.1016/j.engappai.2023.106369
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Surface defect detection is a very important technique to guarantee product quality in industrial fields. However, the detection of multi-scale defects and defects with poor visibility is still a challenging problem. To address this issue, we propose a novel network by collaborating multi-scale cascade pyramid features and a guided context attention mechanism for the pixel-wise defection of surface defects, called MPA-Net. The MPA-Net is a full y-convolutional network (FCN) with an encoder-decoder architecture, which can integrate multi-scale features and merge them into the different stages of the decoder for generating the defect segmentation map. Specifically, the proposed guided context attention module (GCA) is used to transmit the global context information from the large scale to the small scale, which can promote the initial recovery capability of the decoder, and thus help to locate defects with different sizes and defects with poor visibility. Moreover, the proposed pyramid feature fusion and enrichment module (FFEM) is employed to aggregate low-level coarse features and high-level semantic features in each scale, so as to increase the ability of defect feature representation. The aggregation features at different scales are then fused to the different layers of the decoder, which is beneficial to recover the details of defects gradually. The evaluation results on four public datasets demonstrate that the proposed method has excellent performances on mean intersection of union (DAGM2007: 64.94%, KolektorSSD: 77.90%, RSDDs-I: 86.63%, RSDDs-II: 80.62%, FID: 96.98%) and mean pixel accuracy (DAGM2007: 67.97%, KolektorSSD: 85.01%, RSDDs-I: 94.13%, RSDDs-II: 88.53%, FID: 98.71%).
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Multi-scale Semantic Segmentation Enriched Features for Pedestrian Detection
    Xie, Xiaolu
    Wang, Zengfu
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2196 - 2201
  • [32] Steel defect detection based on multi-scale lightweight attention
    Zhou Y.
    Meng J.-N.
    Wang D.-L.
    Tan Y.-Q.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (03): : 901 - 909
  • [33] CONTEXT-AWARE HIERARCHICAL FEATURE ATTENTION NETWORK FOR MULTI-SCALE OBJECT DETECTION
    Xu, Xuelong
    Luo, Xiangfeng
    Ma, Liyan
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2011 - 2015
  • [34] PPA-Net: Pyramid Pooling Attention Network for Multi-Scale Ship Detection in SAR Images
    Tang, Gang
    Zhao, Hongren
    Claramunt, Christophe
    Zhu, Weidong
    Wang, Shiming
    Wang, Yide
    Ding, Yuehua
    REMOTE SENSING, 2023, 15 (11)
  • [35] MAPPNet: A Multi-Scale Attention Pyramid Pooling Network for Dental Calculus Segmentation
    Nie, Tianyu
    Yao, Shihong
    Wang, Di
    Wang, Conger
    Zhao, Yishi
    APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [36] Multi-scale feature fusion pyramid attention network for single image dehazing
    Liu, Jianlei
    Liu, Peng
    Zhang, Yuanke
    IET IMAGE PROCESSING, 2023, 17 (09) : 2726 - 2735
  • [37] Incorporating Triple Attention and Multi-scale Pyramid Network for Underwater Image Enhancement
    Sun, Kaichuan
    Tian, Yubo
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (03) : 387 - 397
  • [38] Attention-guided multi-scale context aggregation network for multi-modal brain glioma segmentation
    Wu, Shaozhi
    Cao, Yunjian
    Li, Xinke
    Liu, Qiyu
    Ye, Yuyun
    Liu, Xingang
    Zeng, Liaoyuan
    Tian, Miao
    MEDICAL PHYSICS, 2023, 50 (12) : 7629 - 7640
  • [39] A multi-scale semantically enriched feature pyramid network with enhanced focal loss for small-object detection
    Kiobya, Twahir
    Zhou, Junfeng
    Maiseli, Baraka
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [40] Dense Attention Pyramid Networks for Multi-Scale Ship Detection in SAR Images
    Cui, Zongyong
    Li, Qi
    Cao, Zongjie
    Liu, Nengyuan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11): : 8983 - 8997