Enriched multi-scale cascade pyramid features and guided context attention network for industrial surface defect detection

被引：22

作者：

Shao, Linhao ^{[1
]}

Zhang, Erhu ^{[1
]}

Duan, Jinghong ^{[2
]}

Ma, Qiurui ^{[3
]}

机构：

[1] Xian Univ Technol, Dept Informat Sci, Xian 710048, Peoples R China

[2] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China

[3] Xian Univ Technol, Sch Mech & Precis Instrument Engn, Xian 710048, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 123卷

基金：

中国国家自然科学基金;

关键词：

Surface defect detection; Deep learning; Pyramid feature fusion; Guided context attention; Attention mechanism; CLASSIFICATION;

D O I：

10.1016/j.engappai.2023.106369

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Surface defect detection is a very important technique to guarantee product quality in industrial fields. However, the detection of multi-scale defects and defects with poor visibility is still a challenging problem. To address this issue, we propose a novel network by collaborating multi-scale cascade pyramid features and a guided context attention mechanism for the pixel-wise defection of surface defects, called MPA-Net. The MPA-Net is a full y-convolutional network (FCN) with an encoder-decoder architecture, which can integrate multi-scale features and merge them into the different stages of the decoder for generating the defect segmentation map. Specifically, the proposed guided context attention module (GCA) is used to transmit the global context information from the large scale to the small scale, which can promote the initial recovery capability of the decoder, and thus help to locate defects with different sizes and defects with poor visibility. Moreover, the proposed pyramid feature fusion and enrichment module (FFEM) is employed to aggregate low-level coarse features and high-level semantic features in each scale, so as to increase the ability of defect feature representation. The aggregation features at different scales are then fused to the different layers of the decoder, which is beneficial to recover the details of defects gradually. The evaluation results on four public datasets demonstrate that the proposed method has excellent performances on mean intersection of union (DAGM2007: 64.94%, KolektorSSD: 77.90%, RSDDs-I: 86.63%, RSDDs-II: 80.62%, FID: 96.98%) and mean pixel accuracy (DAGM2007: 67.97%, KolektorSSD: 85.01%, RSDDs-I: 94.13%, RSDDs-II: 88.53%, FID: 98.71%).

引用

页数：15

共 52 条

[1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[2] Autonomous concrete crack detection using deep fully convolutional neural network [J].

Cao Vu Dung ;

Le Duc Anh .

AUTOMATION IN CONSTRUCTION, 2019, 99 :52-58

[3] Autonomous Structural Visual Inspection Using Region-Based Deep Learning for Detecting Multiple Damage Types [J].

Cha, Young-Jin ;

Choi, Wooram ;

Suh, Gahyun ;

Mahmoudkhani, Sadegh ;

Buyukozturk, Oral .

COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2018, 33 (09) :731-747

[4] Automatic Defect Detection of Fasteners on the Catenary Support Device Using Deep Convolutional Neural Network [J].

Chen, Junwen ;

Liu, Zhigang ;

Wang, Hongrui ;

Nunez, Alfredo ;

Han, Zhiwei .

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2018, 67 (02) :257-269

[5]

Chen LC, 2016, Arxiv, DOI arXiv:1412.7062

[6]

Chen LC, 2017, Arxiv, DOI [arXiv:1606.00915, 10.48550/ARXIV.1606.00915, 10.1109/TPAMI.2017.2699184, DOI 10.48550/ARXIV.1606.00915]

[7]

Chen LC, 2017, Arxiv, DOI arXiv:1706.05587

[8] Pavement crack detection and recognition using the architecture of segNet [J].

Chen, Tingyang ;

Cai, Zhenhua ;

Zhao, Xi ;

Chen, Chen ;

Liang, Xufeng ;

Zou, Tierui ;

Wang, Pan .

JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2020, 18 (18)

[9]

DAGM, 2018, DAT

[10] Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

← 1 2 3 4 5 6 →