CFF-YOLO: cross-space feature fusion based YOLO model for screw detection in vehicle chassis

被引:2
作者
Xu, Haixia [1 ]
Ding, Fanxun [1 ]
Zhou, Wei [1 ]
Han, Feng [1 ]
Liu, Yanbang [1 ]
Zhu, Jiang [1 ]
机构
[1] Xiangtan Univ, Sch Automat & Elect Informat, Xiangtan 411100, Peoples R China
关键词
Deep learning; Small object detection; Yolov5; Feature Fusion;
D O I
10.1007/s11760-024-03474-w
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Proper installation of chassis screws is critical for vehicle quality and safety. With the widespread application of the YOLO model in the industry, we propose a Cross-space Feature Fusion based on the YOLO model for screw detection in vehicle chassis, named CFF-YOLO. We design a Cross-space Feature Fusion (CFF) module to adaptively aggregate features at different scales and correlate the low-level with high-level feature maps. According to the property of screw objects with the same scale, we modify the Yolov5 to accelerate inference speed by keeping one detection head while removing the unimportant network pathways and the other two detection heads. Besides, we design a wide-range, multi-camera line-scan imaging method to capture the same scale of screws in the whole chassis and create a custom vehicle chassis dataset (VCD). Experimental results on dataset VCD show that our proposed CFF-YOLO only takes 6.2 ms to detect one image and merely 781.2 ms to inspect an entire vehicle chassis, and outperforms Yolov5s and Yolov8n in mean Average Precision (mAP) by 6.3% and 2.1%, reaching 81.0% mAP respectively. Our proposed CFF-YOLO achieves a good trade-off between accuracy and speed.
引用
收藏
页码:8537 / 8546
页数:10
相关论文
共 46 条
[1]  
[Anonymous], 2022, ISCYY YOLOAIR MAKES
[2]  
Berg A.C, 2017, ARXIV PREPRINT ARXIV
[3]  
Bochkovskiy A, 2020, PREPRINT, DOI 10.48550/ARXIV.2004.10934
[4]   GhostViT: Expediting Vision Transformers via Cheap Operations [J].
Cao H. ;
Qu Z. ;
Chen G. ;
Li X. ;
Thiele L. ;
Knoll A. .
IEEE Transactions on Artificial Intelligence, 2024, 5 (06) :2517-2525
[5]   Autonomous Structural Visual Inspection Using Region-Based Deep Learning for Detecting Multiple Damage Types [J].
Cha, Young-Jin ;
Choi, Wooram ;
Suh, Gahyun ;
Mahmoudkhani, Sadegh ;
Buyukozturk, Oral .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2018, 33 (09) :731-747
[6]   TDD-net: a tiny defect detection network for printed circuit boards [J].
Ding, Runwei ;
Dai, Linhui ;
Li, Guangpeng ;
Liu, Hong .
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2019, 4 (02) :110-116
[7]  
Feng C, 2017, COMPUTING IN CIVIL ENGINEERING 2017: INFORMATION MODELLING AND DATA ANALYTICS, P298
[8]  
Ge Z., ARXIV
[9]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[10]  
Glenn J., 2021, YOLOV3 SPPV90