RT-DETRmg: a lightweight real-time detection model for small traffic signs

被引：3

作者：

Wang, Yiqiao ^{[1
]}

Chen, Jinling ^{[1
]}

Yang, Bo ^{[2
]}

Chen, Yu ^{[1
]}

Su, Yanlin ^{[1
]}

Liu, Rong ^{[1
]}

机构：

[1] Southwest Petr Univ, Chengdu 610500, Sichuan, Peoples R China

[2] State Grid Sichuan Informat & Telecommun Co, Chengdu 610095, Sichuan, Peoples R China

来源：

JOURNAL OF SUPERCOMPUTING | 2025年 / 81卷 / 01期

关键词：

RT-DETR; Traffic sign detection; Small object detection; Feature fusion;

D O I：

10.1007/s11227-024-06800-8

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In intelligent transportation systems, real-time detection performance and accuracy are essential metrics. This paper proposes a lightweight real-time detection model, RT-DETRmg, to address the challenges of false and missed detections of small traffic signs and to improve the algorithm's real-time performance. RT-DETRmg enhances the multi-scale feature extraction capability of the RT-DETR backbone network by incorporating a Multiple Scale Sequence Fusion module, which effectively integrates global and local semantic information from different scales of images. Additionally, a cascaded group attention module is utilized within an efficient hybrid encoder to reduce computational complexity, thereby enhancing real-time performance. To further optimize small object detection, a small receptive field feature layer is introduced, while a large receptive field feature layer is removed. Experimental results on the TT100K and GTSDB datasets demonstrate the superiority of RT-DETRmg over existing models. On the TT100K dataset, RT-DETRmg achieves a 2.0% improvement in mean average precision and a 6.6% increase in frames per second compared to the baseline RT-DETR model, while reducing model parameters and computational complexity. On the GTSDB dataset, RT-DETRmg further demonstrates its strong generalization ability, achieving a 2.2% improvement in the F1 score and a 1.7% increase in mean average precision compared to the baseline network. These findings highlight the effectiveness of RT-DETRmg in enhancing both detection accuracy and real-time performance of small traffic signs in diverse scenarios.

引用

页数：18

共 31 条

[1] Review of deep learning: concepts, CNN architectures, challenges, applications, future directions [J].

Alzubaidi, Laith ;

Zhang, Jinglan ;

Humaidi, Amjad J. ;

Al-Dujaili, Ayad ;

Duan, Ye ;

Al-Shamma, Omran ;

Santamaria, J. ;

Fadhel, Mohammed A. ;

Al-Amidie, Muthana ;

Farhan, Laith .

JOURNAL OF BIG DATA, 2021, 8 (01)

[2]

Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]

[3] End-to-End Object Detection with Transformers [J].

Carion, Nicolas ;

Massa, Francisco ;

Synnaeve, Gabriel ;

Usunier, Nicolas ;

Kirillov, Alexander ;

Zagoruyko, Sergey .

COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229

[4] Road traffic sign detection and classification [J].

delaEscalera, A ;

Moreno, LE ;

Salichs, MA ;

Armingol, JM .

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 1997, 44 (06) :848-859

[5] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

[6] Rich feature hierarchies for accurate object detection and semantic segmentation [J].

Girshick, Ross ;

Donahue, Jeff ;

Darrell, Trevor ;

Malik, Jitendra .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587

[7]

Han K, 2021, ADV NEUR IN

[8] A Survey on Vision Transformer [J].

Han, Kai ;

Wang, Yunhe ;

Chen, Hanting ;

Chen, Xinghao ;

Guo, Jianyuan ;

Liu, Zhenhua ;

Tang, Yehui ;

Xiao, An ;

Xu, Chunjing ;

Xu, Yixing ;

Yang, Zhaohui ;

Zhang, Yiman ;

Tao, Dacheng .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) :87-110

[9]

He Kaiming., P IEEE INT C COMPUTE, P2961

[10]

Joseph RK, 2016, CRIT POL ECON S ASIA, P1

← 1 2 3 4 →