GLF-NET: Global and Local Dynamic Feature Fusion Network for Real-Time Steel Strip Surface Defect Detection

被引:0
作者
Ma, Yunfei [1 ]
Zhang, Zhaohui [1 ]
Ma, Shaocheng [1 ]
Shi, Kailun [1 ]
Fan, Chenglong [1 ]
机构
[1] Hebei Normal Univ, Coll Comp & Cyber Secur, Hebei Prov Engn Res Ctr Supply Chain Big Data Anal, Hebei Prov Key Lab Network & Informat Secur, Shijiazhuang 050024, Peoples R China
关键词
Feature extraction; Defect detection; Steel; Strips; YOLO; Surface treatment; Real-time systems; Detectors; Semantics; Neck; Surface defect detection; YOLOv5s; feature fusion; receptive field; attention mechanism;
D O I
10.1109/ACCESS.2025.3539350
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Surface defect detection plays a crucial role in ensuring the quality standards of hot-rolled steel strips. To meet the demands for high precision and real-time performance in industrial defect detection, this paper introduces an improved one-stage detector based on YOLOv5s, named GLF-NET, that focuses on a good balance between speed and precision. Firstly, an Attention Augmented Module (AAM) is proposed and used in the backbone, with the aim of minimizing the loss of semantic and location information of defects during the process of feature extraction. Secondly, to enrich the model's capacity of representing multi-scale features of defects, an innovative Global and Local Dynamic Feature Fusion (GLF) module is designed and plugged into the top-down FPN part of the neck, bridging the semantic gap between different feature layers and enabling the model to adaptively select features for fusion. Additionally, a novel Receptive Field Augmented Module (RFA) is proposed and integrated into the bottom-up PAN structure of the neck, enhancing the detector's ability of perceiving defects with irregular shapes and large aspect ratios. Extensive experimental results on the NEU-DET steel strip surface defect dataset demonstrate that GLF-NET obtains an impressive mAP value of 79.2%, exceeding YOLOv5s by 4.2%. Furthermore, with an impressive detection speed of 95 Frames Per Second (FPS), GLF-NET not only meets the real-time demands of industrial defect detection but also demonstrates exceptional capabilities in defect detection. Code is available at https://github.com/MYF1124/GLF-NET.
引用
收藏
页码:26063 / 26078
页数:16
相关论文
共 66 条
[1]  
Abeywickrama T, 2016, ARXIV
[2]  
Bochkovskiy A., 2020, YOLOv4: Optimal Speed and Accuracy of Object Detection, DOI 10.48550/ARXIV.2004.10934
[3]   RetinaNet With Difference Channel Attention and Adaptively Spatial Feature Fusion for Steel Surface Defect Detection [J].
Cheng, Xun ;
Yu, Jianbo .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70 (70)
[4]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[5]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[6]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[7]   PGA-Net: Pyramid Feature Fusion and Global Context Attention Network for Automated Surface Defect Detection [J].
Dong, Hongwen ;
Song, Kechen ;
He, Yu ;
Xu, Jing ;
Yan, Yunhui ;
Meng, Qinggang .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (12) :7448-7458
[8]  
Gevorgyan Z., 2022, arXiv
[9]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[10]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587