RT-DETRmg: a lightweight real-time detection model for small traffic signs

被引:3
作者
Wang, Yiqiao [1 ]
Chen, Jinling [1 ]
Yang, Bo [2 ]
Chen, Yu [1 ]
Su, Yanlin [1 ]
Liu, Rong [1 ]
机构
[1] Southwest Petr Univ, Chengdu 610500, Sichuan, Peoples R China
[2] State Grid Sichuan Informat & Telecommun Co, Chengdu 610095, Sichuan, Peoples R China
关键词
RT-DETR; Traffic sign detection; Small object detection; Feature fusion;
D O I
10.1007/s11227-024-06800-8
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In intelligent transportation systems, real-time detection performance and accuracy are essential metrics. This paper proposes a lightweight real-time detection model, RT-DETRmg, to address the challenges of false and missed detections of small traffic signs and to improve the algorithm's real-time performance. RT-DETRmg enhances the multi-scale feature extraction capability of the RT-DETR backbone network by incorporating a Multiple Scale Sequence Fusion module, which effectively integrates global and local semantic information from different scales of images. Additionally, a cascaded group attention module is utilized within an efficient hybrid encoder to reduce computational complexity, thereby enhancing real-time performance. To further optimize small object detection, a small receptive field feature layer is introduced, while a large receptive field feature layer is removed. Experimental results on the TT100K and GTSDB datasets demonstrate the superiority of RT-DETRmg over existing models. On the TT100K dataset, RT-DETRmg achieves a 2.0% improvement in mean average precision and a 6.6% increase in frames per second compared to the baseline RT-DETR model, while reducing model parameters and computational complexity. On the GTSDB dataset, RT-DETRmg further demonstrates its strong generalization ability, achieving a 2.2% improvement in the F1 score and a 1.7% increase in mean average precision compared to the baseline network. These findings highlight the effectiveness of RT-DETRmg in enhancing both detection accuracy and real-time performance of small traffic signs in diverse scenarios.
引用
收藏
页数:18
相关论文
共 31 条
[1]   Review of deep learning: concepts, CNN architectures, challenges, applications, future directions [J].
Alzubaidi, Laith ;
Zhang, Jinglan ;
Humaidi, Amjad J. ;
Al-Dujaili, Ayad ;
Duan, Ye ;
Al-Shamma, Omran ;
Santamaria, J. ;
Fadhel, Mohammed A. ;
Al-Amidie, Muthana ;
Farhan, Laith .
JOURNAL OF BIG DATA, 2021, 8 (01)
[2]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
[3]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[4]   Road traffic sign detection and classification [J].
delaEscalera, A ;
Moreno, LE ;
Salichs, MA ;
Armingol, JM .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 1997, 44 (06) :848-859
[5]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[6]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[7]  
Han K, 2021, ADV NEUR IN
[8]   A Survey on Vision Transformer [J].
Han, Kai ;
Wang, Yunhe ;
Chen, Hanting ;
Chen, Xinghao ;
Guo, Jianyuan ;
Liu, Zhenhua ;
Tang, Yehui ;
Xiao, An ;
Xu, Chunjing ;
Xu, Yixing ;
Yang, Zhaohui ;
Zhang, Yiman ;
Tao, Dacheng .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) :87-110
[9]  
He Kaiming., P IEEE INT C COMPUTE, P2961
[10]  
Joseph RK, 2016, CRIT POL ECON S ASIA, P1