An Improved YOLOv5 Algorithm for Steel Surface Defect Detection

被引:1
作者
Li Shaoxiong [1 ]
Shi Zaifeng [1 ,3 ]
Kong Fanning [1 ]
Wang Ruoqi [1 ]
Luo Tao [2 ]
机构
[1] Tianjin Univ, Sch Microelect, Tianjin 300072, Peoples R China
[2] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China
[3] Tianjin Key Lab Imaging & Sensing Microelect Tech, Tianjin 300072, Peoples R China
关键词
surface defect detection; receptive field; feature alignment; decoupled head; attention mechanism;
D O I
10.3788/LOP230711
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Scale of steel surface defects is different, but existing detection algorithms have poor multi-scale feature processing ability and low accuracy. Therefore, an improved YOLOv5 algorithm for steel surface defect detection is proposed. First, receptive field modules are added after the feature output layer of the backbone to enhance the discrimination and robustness of the features which can better perceive the feature information of different scales. Then, aligned feature aggregation modules are used to replace the traditional feature fusion structure to solve the feature misalignment problem in the fusion process of high and low resolution feature maps. Finally, decoupled heads with efficient channel attention mechanisms are used to output the detection results. The attention mechanism can adaptively calibrate the channel response, and the decoupled heads enable classification and regression tasks to be performed independently. The experimental results on NEU-DET dataset show that the mean average precision of the proposed method is 80.51%, which is 4.48% higher than that of the benchmark model, and the detection speed is 31.96 frame/s. Compared with other mainstream object detection algorithms, the proposed algorithm has higher accuracy while maintaining certain detection speed, enabling efficient steel surface defect detection.
引用
收藏
页数:9
相关论文
共 27 条
[21]  
[汤勃 Tang Bo], 2017, [中国图象图形学报, Journal of Image and Graphics], V22, P1640
[22]   Focal Loss for Dense Object Detection [J].
Lin, Tsung-Yi ;
Goyal, Priya ;
Girshick, Ross ;
He, Kaiming ;
Dollar, Piotr .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2999-3007
[23]   YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors [J].
Wang, Chien-Yao ;
Bochkovskiy, Alexey ;
Liao, Hong-Yuan Mark .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :7464-7475
[24]   CSPNet: A New Backbone that can Enhance Learning Capability of CNN [J].
Wang, Chien-Yao ;
Liao, Hong-Yuan Mark ;
Wu, Yueh-Hua ;
Chen, Ping-Yang ;
Hsieh, Jun-Wei ;
Yeh, I-Hau .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :1571-1580
[25]  
Wang Q., 2020, INT SYM QUAL ELECT, P11531, DOI [DOI 10.1109/CVPR42600.2020.01155, DOI 10.1109/isqed48828.2020.9137057, 10.1109/CVPR42600.2020.01155]
[26]  
Yu F., 2016, INT C LEARN REPR MAY
[27]  
Yu X Y, 2022, IEEE Transactions on Instrumentation and Measurement, P71