Vehicle color recognition based on smooth modulation neural network with multi-scale feature fusion

被引:8
|
作者
Hu, Mingdi [1 ,2 ]
Bai, Long [1 ,2 ]
Fan, Jiulun [1 ,2 ]
Zhao, Sirui [3 ]
Chen, Enhong [3 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Commun & Informat Engn, Xian 710121, Peoples R China
[2] Xian Univ Posts & Telecommun, Sch Artificial Intelligence, Xian 710121, Peoples R China
[3] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
vehicle color recognition; benchmark dataset; multi-scale feature fusion; long-tail distribution; improved smooth l1 loss; CLASSIFICATION;
D O I
10.1007/s11704-022-1389-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vehicle Color Recognition (VCR) plays a vital role in intelligent traffic management and criminal investigation assistance. However, the existing vehicle color datasets only cover 13 classes, which can not meet the current actual demand. Besides, although lots of efforts are devoted to VCR, they suffer from the problem of class imbalance in datasets. To address these challenges, in this paper, we propose a novel VCR method based on Smooth Modulation Neural Network with Multi-Scale Feature Fusion (SMNN-MSFF). Specifically, to construct the benchmark of model training and evaluation, we first present a new VCR dataset with 24 vehicle classes, Vehicle Color-24, consisting of 10091 vehicle images from a 100-hour urban road surveillance video. Then, to tackle the problem of long-tail distribution and improve the recognition performance, we propose the SMNN-MSFF model with multi-scale feature fusion and smooth modulation. The former aims to extract feature information from local to global, and the latter could increase the loss of the images of tail class instances for training with class-imbalance. Finally, comprehensive experimental evaluation on Vehicle Color-24 and previously three representative datasets demonstrate that our proposed SMNN-MSFF outperformed state-of-the-art VCR methods. And extensive ablation studies also demonstrate that each module of our method is effective, especially, the smooth modulation efficiently help feature learning of the minority or tail classes. Vehicle Color-24 and the code of SMNN-MSFF are publicly available and can contact the author to obtain.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] A Multi-Scale Feature Fusion Deep Learning Network for the Extraction of Cropland Based on Landsat Data
    Chen, Huiling
    He, Guojin
    Peng, Xueli
    Wang, Guizhou
    Yin, Ranyu
    REMOTE SENSING, 2024, 16 (21)
  • [42] MTMFNet: multi-threshold and multi-scale feature fusion network for text detection
    Dai, Lei
    Gao, Wen
    Tang, Chengyu
    Wang, Min
    Chen, Zhihua
    VISUAL COMPUTER, 2025,
  • [43] Human pose estimation based on feature enhancement and multi-scale feature fusion
    Cao, Dandan
    Liu, Weibin
    Xing, Weiwei
    Wei, Xiang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (03) : 643 - 650
  • [44] Wafer map defect recognition based on multi-scale feature fusion and attention spatial pyramid pooling
    Chen, Shouhong
    Huang, Zhentao
    Wang, Tao
    Hou, Xingna
    Ma, Jun
    JOURNAL OF INTELLIGENT MANUFACTURING, 2025, 36 (01) : 271 - 284
  • [45] MSFFNet: multi-scale feature fusion network with semantic optimization for crowd counting
    Rohra, Avinash
    Yin, Baoqun
    Bilal, Hazrat
    Kumar, Aakash
    Ali, Munawar
    Li, Yang
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)
  • [46] Multi-scale dehazing network via high-frequency feature fusion
    Xu, YuJie
    Zhang, YongJun
    Li, Zhi
    Cui, ZhongWei
    Yang, YiTong
    COMPUTERS & GRAPHICS-UK, 2022, 107 : 50 - 59
  • [47] MFF-Net: A multi-scale feature fusion network for birdsong classification
    Zhou, Hongfang
    Zheng, Kangyun
    Zhu, Wenjing
    Tong, Jiahao
    Cao, Chenhui
    Pan, Heng
    Li, Junhuai
    APPLIED ACOUSTICS, 2025, 232
  • [48] AMSFuse: Adaptive Multi-Scale Feature Fusion Network for Diabetic Retinopathy Classification
    Zhu, Chengzhang
    Alasri, Ahmed
    Xu, Tao
    Xiao, Yalong
    Noman, Abdulrahman
    Alsabri, Raeed
    Duan, Xuanchu
    Abdullah, Monir
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2025, 82 (03): : 5153 - 5167
  • [49] An encrypted traffic identification method based on multi-scale feature fusion
    Zhu, Peng
    Wang, Gang
    He, Jingheng
    Dong, Yueli
    Chang, Yu
    ARRAY, 2024, 21
  • [50] A Segmentation Algorithm of Colonoscopy Images Based on Multi-Scale Feature Fusion
    Yu, Jing
    Li, Zhengping
    Xu, Chao
    Feng, Bo
    ELECTRONICS, 2022, 11 (16)