Transformer Fusion and Residual Learning Group Classifier Loss for Long-Tailed Traffic Sign Detection

被引:3
作者
Zeng, Guanjie [1 ]
Huang, Weiguo [1 ]
Wang, Yinjie [1 ]
Wang, Xiang [1 ]
E, Wenjuan [1 ]
机构
[1] Soochow Univ, Sch Rail Transportat, Suzhou 215131, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Head; Transformers; Fuses; Rail transportation; YOLO; Task analysis; Automatic traffic sign detection and recognition (ATDR); long-tailed detection; small object detection; RECOGNITION;
D O I
10.1109/JSEN.2024.3360408
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Automatic traffic sign detection and recognition (ATDR) plays a crucial role as a submodule of the intelligent transportation system (ITS) and has gained significant attention in recent years. However, ATDR faces challenges due to two main factors: the small size of traffic signs in images and the long-tailed distribution of sign categories in the real world (i.e.,only a few categories have sufficient samples, and most categories have insufficient samples). These challenges result in a decline in the performance of existing detection frameworks. In this article, we propose a novel framework for traffic sign detection, specifically designed to tackle the challenges posed by these issues. Our approach includes an efficient adaptive and transformed spatial feature fusion module (ATS2F) for detecting small traffic signs. This module adaptively learns feature map associations to extract and fuse meaningful features through cross-space and cross-scale interactions at different levels. In addition, to tackle the long-tailed traffic sign problem, we have designed a residual learning group classifier loss (RLGCL), by dividing the traffic signs into two levels, reserving a specific capacity for the long-tailed category from the perspective of the parameter space, the residual fusion mechanism is introduced to enhance and optimize the tail category image. Our experimental results, obtained from rigorous evaluations on the Tsinghua-Tencent 100K (TT100K) and GTSDB datasets, demonstrate the effectiveness of our approach. The improvements achieved in small traffic sign detection and long-tailed detection underscore the significance of our framework in advancing ATDR research.
引用
收藏
页码:10551 / 10560
页数:10
相关论文
共 43 条
  • [1] DFR-TSD: A Deep Learning Based Framework for Robust Traffic Sign Detection Under Challenging Weather Conditions
    Ahmed, Sabbir
    Kamal, Uday
    Hasan, Md. Kamrul
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (06) : 5150 - 5162
  • [2] A systematic study of the class imbalance problem in convolutional neural networks
    Buda, Mateusz
    Maki, Atsuto
    Mazurowski, Maciej A.
    [J]. NEURAL NETWORKS, 2018, 106 : 249 - 259
  • [3] Traffic Sign Recognition Based on SIFT Features
    Cai, Nian
    Liang, Wenzhao
    Xu, Shaoqiu
    Li, Fangzhen
    [J]. NANOTECHNOLOGY AND COMPUTER ENGINEERING, 2010, 121-122 : 596 - +
  • [4] Cascade R-CNN: Delving into High Quality Object Detection
    Cai, Zhaowei
    Vasconcelos, Nuno
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
  • [5] SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning
    Chen, Long
    Zhang, Hanwang
    Xiao, Jun
    Nie, Liqiang
    Shao, Jian
    Liu, Wei
    Chua, Tat-Seng
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6298 - 6306
  • [6] Cheng J., 2016, EMNLP, P551, DOI [DOI 10.18653/V1/D16-1053, 10.18653/v1/D16-1053]
  • [7] TRD-YOLO: A Real-Time, High-Performance Small Traffic Sign Detection Algorithm
    Chu, Jinqi
    Zhang, Chuang
    Yan, Mengmeng
    Zhang, Haichao
    Ge, Tao
    [J]. SENSORS, 2023, 23 (08)
  • [8] ResLT: Residual Learning for Long-Tailed Recognition
    Cui, Jiequan
    Liu, Shu
    Tian, Zhuotao
    Zhong, Zhisheng
    Jia, Jiaya
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3695 - 3706
  • [9] Traffic sign recognition and analysis for intelligent vehicles
    de la Escalera, A
    Armingol, JM
    Mata, M
    [J]. IMAGE AND VISION COMPUTING, 2003, 21 (03) : 247 - 258
  • [10] Dong Zhang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12373), P323, DOI 10.1007/978-3-030-58604-1_20