CAFS-Net: Context-Aware Feature Selection Network for Accurate and Efficient Tiny Surface Defect Segmentation

被引:0
作者
Tian, Zhonghua [1 ]
Yang, Xianqiang [1 ]
机构
[1] Harbin Inst Technol, Res Inst Intelligent Control & Syst, Harbin 150001, Peoples R China
关键词
Feature extraction; Transformers; Semantic segmentation; Inspection; Training; Accuracy; Noise; Decoding; Computer vision; Computational modeling; Concentrated random cropping strategy; context-aware encoder network (CANet); feature selection module (FSM); lightweight feature fusing decoder; tiny defect segmentation;
D O I
10.1109/TIM.2025.3547078
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Although deep learning-based methods have achieved remarkable performance in the industrial defect segmentation area, tiny defect segmentation in ultrahigh-resolution image scenarios still remains unexplored. Most of the existing methods utilize attention mechanisms and a sliding window strategy for tiny defect segmentation. However, this approach is not only computationally demanding but also prone to texture noise interference, likely stemming from a lack of global contextual understanding. To alleviate this challenge, we propose a context-aware feature selection network (CAFS-Net) which consists of a context-aware encoder network (CANet), a novel feature selection module (FSM), and a lightweight feature-fusing decoder. The CANet is constructed by low-level convolutional blocks and high-level transformer blocks to capture both local and global context information, thereby enhancing the discrimination ability between tiny defects and texture noise. The FSM includes a multilayer perceptron (MLP) classifier and a selector for selecting defective features of image blocks from the feature pyramid of image blocks based on the classification outcomes. Then, the selected defective features are fed to the lightweight feature-fusing decoder to perform multiscale feature fusion and obtain the segmentation masks of the defective image patches. Additionally, we propose a concentrated random cropping data augmentation strategy to address the class imbalance problem during training. We conducted extensive experiments on two defect segmentation datasets, including the compact camera module (CCM) defect segmentation dataset and the printed circuit board (PCB) defect segmentation dataset, to demonstrate the superiority and generalization performance of our proposed model. The results show that our CAFS-Net outperforms other state-of-the-art (SOTA) methods in both accuracy and efficiency.
引用
收藏
页数:11
相关论文
共 37 条
  • [1] Anantrasirichai N, 2019, IEEE IMAGE PROC, P2481, DOI [10.1109/ICIP.2019.8803305, 10.1109/icip.2019.8803305]
  • [2] Cao X., 2020, P IEEE INT S PROD CO, P1
  • [3] Masked-attention Mask Transformer for Universal Image Segmentation
    Cheng, Bowen
    Misra, Ishan
    Schwing, Alexander G.
    Kirillov, Alexander
    Girdhar, Rohit
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1280 - 1289
  • [4] Dosovitskiy Alexey, 2021, INT C LEARN REPR, DOI DOI 10.48550/ARXIV.2010.11929
  • [5] Guo MH, 2022, ADV NEUR IN
  • [6] FLatten Transformer: Vision Transformer using Focused Linear Attention
    Han, Dongchen
    Pan, Xuran
    Han, Yizeng
    Song, Shiji
    Huang, Gao
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5938 - 5948
  • [7] Adaptive Pyramid Context Network for Semantic Segmentation
    He, Junjun
    Deng, Zhongying
    Zhou, Lei
    Wang, Yali
    Qiao, Yu
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7511 - 7520
  • [8] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [9] Hong YD, 2021, Arxiv, DOI [arXiv:2101.06085, 10.48550/arXiv.2101.06085]
  • [10] Searching for MobileNetV3
    Howard, Andrew
    Sandler, Mark
    Chu, Grace
    Chen, Liang-Chieh
    Chen, Bo
    Tan, Mingxing
    Wang, Weijun
    Zhu, Yukun
    Pang, Ruoming
    Vasudevan, Vijay
    Le, Quoc V.
    Adam, Hartwig
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1314 - 1324