Attention-based convolution neural network for magnetic tile surface defect classification and detection

被引:6
作者
Li, Ju [1 ]
Wang, Kai [1 ]
He, Mengfan [1 ]
Ke, Luyao [1 ]
Wang, Heng [1 ]
机构
[1] Sichuan Univ, Sch Mech Engn, Innovat Method & Creat Design Key Lab Sichuan Prov, Chengdu 610065, Peoples R China
关键词
Attention-based CNNs; Multi-layer convolution; Magnetic tile; Image classification; Surface-defect detection;
D O I
10.1016/j.asoc.2024.111631
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effectively identifying surface defects in magnetic tiles has proven to be highly challenging due to limited sample availability and irrelevant background interference, which also plays a crucial role in significantly influencing the lifespan and reliability of permanent magnet motors. To address these challenges, our study draws inspiration from a comprehensive analysis of the retinal attention mechanism and proposes three guiding criteria: multi -level resolution, what to look for, and where to look at. These criteria are utilized as foundational principles to enhance the representation learning capability of designed neural network structures through the incorporation of the retinal attention mechanism. Subsequently, based on these guiding criteria, we introduce a novel convolutional retinal attention block (CRAB) to learn discriminative and robust feature representations for magnetic tile surface defect classification and detection. The proposed CRAB comprises three modules: multi -resolution module (MRM), global attention aggregation module (GAAM), and local attention aggregation module (LAAM), designed to extract discriminative and robust features by refining meaningful information and suppressing redundant ones. Comprehensive experimental results across image classification and object detection tasks demonstrate that the proposed CRAB outperforms existing methods such as SE, ECA, and CBAM, and can effectively amplify the representation power across various backbone networks, including VGG-16, GoogLeNet, ResNet-18, and ResNet-50. An evaluation on surface defect classification and detection tasks for industrial magnetic tiles further shows that CRAB achieves accuracies of 99.50% and 96.98%, respectively. These results emphasize the promising application prospects of the proposed method in detecting industrial surface defects amid expansive and inconsequential backgrounds. The code of the proposed method is available at: https://github.com/KWflyer/CRAB.
引用
收藏
页数:11
相关论文
共 54 条
  • [11] The Pascal Visual Object Classes (VOC) Challenge
    Everingham, Mark
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
  • [12] Functional connectivity in the retina at the resolution of photoreceptors
    Field, Greg D.
    Gauthier, Jeffrey L.
    Sher, Alexander
    Greschner, Martin
    Machado, Timothy A.
    Jepson, Lauren H.
    Shlens, Jonathon
    Gunning, Deborah E.
    Mathieson, Keith
    Dabrowski, Wladyslaw
    Paninski, Liam
    Litke, Alan M.
    Chichilnisky, E. J.
    [J]. NATURE, 2010, 467 (7316) : 673 - U54
  • [13] Fu C.-Y., 2017, arXiv
  • [14] Dual Attention Network for Scene Segmentation
    Fu, Jun
    Liu, Jing
    Tian, Haijie
    Li, Yong
    Bao, Yongjun
    Fang, Zhiwei
    Lu, Hanqing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
  • [15] Garway-Heath DF, 2000, INVEST OPHTH VIS SCI, V41, P1774
  • [16] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [17] Deep Pyramidal Residual Networks
    Han, Dongyoon
    Kim, Jiwhan
    Kim, Junmo
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6307 - 6315
  • [18] Out-of-distribution detection-assisted trustworthy machinery fault diagnosis approach with uncertainty-aware deep ensembles
    Han, Te
    Li, Yan-Fu
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2022, 226
  • [19] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
  • [20] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778