CAFS-Net: Context-Aware Feature Selection Network for Accurate and Efficient Tiny Surface Defect Segmentation

被引：1

作者：

Tian, Zhonghua ^{[1
]}

Yang, Xianqiang ^{[1
]}

机构：

[1] Harbin Inst Technol, Res Inst Intelligent Control & Syst, Harbin 150001, Peoples R China

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2025年 / 74卷

关键词：

Feature extraction; Transformers; Semantic segmentation; Inspection; Training; Accuracy; Noise; Decoding; Computer vision; Computational modeling; Concentrated random cropping strategy; context-aware encoder network (CANet); feature selection module (FSM); lightweight feature fusing decoder; tiny defect segmentation;

D O I：

10.1109/TIM.2025.3547078

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Although deep learning-based methods have achieved remarkable performance in the industrial defect segmentation area, tiny defect segmentation in ultrahigh-resolution image scenarios still remains unexplored. Most of the existing methods utilize attention mechanisms and a sliding window strategy for tiny defect segmentation. However, this approach is not only computationally demanding but also prone to texture noise interference, likely stemming from a lack of global contextual understanding. To alleviate this challenge, we propose a context-aware feature selection network (CAFS-Net) which consists of a context-aware encoder network (CANet), a novel feature selection module (FSM), and a lightweight feature-fusing decoder. The CANet is constructed by low-level convolutional blocks and high-level transformer blocks to capture both local and global context information, thereby enhancing the discrimination ability between tiny defects and texture noise. The FSM includes a multilayer perceptron (MLP) classifier and a selector for selecting defective features of image blocks from the feature pyramid of image blocks based on the classification outcomes. Then, the selected defective features are fed to the lightweight feature-fusing decoder to perform multiscale feature fusion and obtain the segmentation masks of the defective image patches. Additionally, we propose a concentrated random cropping data augmentation strategy to address the class imbalance problem during training. We conducted extensive experiments on two defect segmentation datasets, including the compact camera module (CCM) defect segmentation dataset and the printed circuit board (PCB) defect segmentation dataset, to demonstrate the superiority and generalization performance of our proposed model. The results show that our CAFS-Net outperforms other state-of-the-art (SOTA) methods in both accuracy and efficiency.

引用

页数：11

共 37 条

[1]

Anantrasirichai N, 2019, IEEE IMAGE PROC, P2481, DOI [10.1109/ICIP.2019.8803305, 10.1109/icip.2019.8803305]

[2]

Cao X., 2020, P IEEE INT S PROD CO, P1

[3] Masked-attention Mask Transformer for Universal Image Segmentation [J].

Cheng, Bowen ;

Misra, Ishan ;

Schwing, Alexander G. ;

Kirillov, Alexander ;

Girdhar, Rohit .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :1280-1289

[4]

Dosovitskiy A., 2021, P INT C LEARN REPR, DOI [10.48550/arXiv.2010.11929, DOI 10.48550/ARXIV.2010.11929]

[5]

Guo MH, 2022, ADV NEUR IN

[6] FLatten Transformer: Vision Transformer using Focused Linear Attention [J].

Han, Dongchen ;

Pan, Xuran ;

Han, Yizeng ;

Song, Shiji ;

Huang, Gao .

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, :5938-5948

[7] Adaptive Pyramid Context Network for Semantic Segmentation [J].

He, Junjun ;

Deng, Zhongying ;

Zhou, Lei ;

Wang, Yali ;

Qiao, Yu .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7511-7520

[8] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[9]

Hong YD, 2021, Arxiv, DOI arXiv:2101.06085

[10] Searching for MobileNetV3 [J].

Howard, Andrew ;

Sandler, Mark ;

Chu, Grace ;

Chen, Liang-Chieh ;

Chen, Bo ;

Tan, Mingxing ;

Wang, Weijun ;

Zhu, Yukun ;

Pang, Ruoming ;

Vasudevan, Vijay ;

Le, Quoc V. ;

Adam, Hartwig .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1314-1324

← 1 2 3 4 →