Enhanced You Only Look Once X for surface defect detection of strip steel

被引:5
作者
Wu, Ruiqi [1 ]
Zhou, Feng [1 ]
Li, Nan [1 ]
Liu, Haibo [1 ]
Guo, Naihong [2 ]
Wang, Rugang [1 ]
机构
[1] Yancheng Inst Technol, Sch Informat Technol, Yancheng, Peoples R China
[2] Yancheng Xiongying Precis Machinery Co Ltd, Yancheng, Peoples R China
关键词
strip steel; surface defect detection; YOLOX; lightweight; attention module; INSPECTION;
D O I
10.3389/fnbot.2022.1042780
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Using deep learning-based methods to detect surface defects in strip steel can reduce the impact of human factors and lower costs while maintaining accuracy and efficiency. However, the main disadvantages of this method is the inability to tradeoff accuracy and efficiency. In addition, the low proportion of valid information and the lack of distinctive features result in a high rate of missed detection of small objects. In this paper, we propose a lightweight YOLOX surface defect detection network and introduce the Multi-scale Feature Fusion Attention Module (MFFAM). Lightweight CSP structures are used to optimize the backbone of the original network. MFFAM uses different scales of receptive fields for feature maps of different resolutions, after which features are fused and passed into the spatial and channel attention modules in parallel. Experimental results show that lightweight CSP structures can improve the detection frame rate without compromising accuracy. MFFAM can significantly improve the detection accuracy of small objects. Compared with the initial YOLOX, the mAP and FPS were 81.21% and 82.87Hz, respectively, which was an improvement of 4.29% and 12.72Hz. Compared with existing methods, the proposed model has superior performance and practicality, verifying the effectiveness of the optimization method.
引用
收藏
页数:12
相关论文
共 48 条
[1]   Exploiting dynamic spatio-temporal correlations for citywide traffic flow prediction using attention based neural networks [J].
Ali, Ahmad ;
Zhu, Yanmin ;
Zakarya, Muhammad .
INFORMATION SCIENCES, 2021, 577 :852-870
[2]   A data aggregation based approach to exploit dynamic spatio-temporal correlations for citywide crowd flows prediction in fog computing [J].
Ali, Ahmad ;
Zhu, Yanmin ;
Zakarya, Muhammad .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (20) :31401-31433
[3]   Leveraging Spatio-Temporal Patterns for Predicting Citywide Traffic Crowd Flows Using Deep Hybrid Neural Networks [J].
Ali, Ahmad ;
Zhu, Yanmin ;
Chen, Qiuxia ;
Yu, Jiadi ;
Cai, Haibin .
2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, :125-132
[4]  
[Anonymous], 2012, World Academy of Science Engineering and Technology
[5]  
[Anonymous], 2014, Int. J. Innov. Res. Sci. Eng. Technol
[6]   Triplet-Graph Reasoning Network for Few-Shot Metal Generic Surface Defect Segmentation [J].
Bao, Yanqi ;
Song, Kechen ;
Liu, Jie ;
Wang, Yanyan ;
Yan, Yunhui ;
Yu, Han ;
Li, Xingjie .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
[7]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[8]   STDnet-ST: Spatio-temporal ConvNet for small object detection [J].
Bosquet, Brais ;
Mucientes, Manuel ;
Brea, Victor M. .
PATTERN RECOGNITION, 2021, 116 (116)
[9]   RetinaNet With Difference Channel Attention and Adaptively Spatial Feature Fusion for Steel Surface Defect Detection [J].
Cheng, Xun ;
Yu, Jianbo .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70 (70)
[10]  
Ge Z, 2021, Arxiv, DOI arXiv:2107.08430