Vehicle color recognition based on smooth modulation neural network with multi-scale feature fusion

被引:8
|
作者
Hu, Mingdi [1 ,2 ]
Bai, Long [1 ,2 ]
Fan, Jiulun [1 ,2 ]
Zhao, Sirui [3 ]
Chen, Enhong [3 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Commun & Informat Engn, Xian 710121, Peoples R China
[2] Xian Univ Posts & Telecommun, Sch Artificial Intelligence, Xian 710121, Peoples R China
[3] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
vehicle color recognition; benchmark dataset; multi-scale feature fusion; long-tail distribution; improved smooth l1 loss; CLASSIFICATION;
D O I
10.1007/s11704-022-1389-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vehicle Color Recognition (VCR) plays a vital role in intelligent traffic management and criminal investigation assistance. However, the existing vehicle color datasets only cover 13 classes, which can not meet the current actual demand. Besides, although lots of efforts are devoted to VCR, they suffer from the problem of class imbalance in datasets. To address these challenges, in this paper, we propose a novel VCR method based on Smooth Modulation Neural Network with Multi-Scale Feature Fusion (SMNN-MSFF). Specifically, to construct the benchmark of model training and evaluation, we first present a new VCR dataset with 24 vehicle classes, Vehicle Color-24, consisting of 10091 vehicle images from a 100-hour urban road surveillance video. Then, to tackle the problem of long-tail distribution and improve the recognition performance, we propose the SMNN-MSFF model with multi-scale feature fusion and smooth modulation. The former aims to extract feature information from local to global, and the latter could increase the loss of the images of tail class instances for training with class-imbalance. Finally, comprehensive experimental evaluation on Vehicle Color-24 and previously three representative datasets demonstrate that our proposed SMNN-MSFF outperformed state-of-the-art VCR methods. And extensive ablation studies also demonstrate that each module of our method is effective, especially, the smooth modulation efficiently help feature learning of the minority or tail classes. Vehicle Color-24 and the code of SMNN-MSFF are publicly available and can contact the author to obtain.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] UAV reaction detection based on multi-scale feature fusion
    He, Jianfeng
    Liu, Ming
    Yu, Chuanjiang
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 640 - 643
  • [32] A lightweight cross-scale feature fusion enhanced multi-level recurrent convolutional neural network for automatic modulation recognition
    Zhang, Ning
    Wang, Deqiang
    Wang, Ling
    DIGITAL SIGNAL PROCESSING, 2025, 158
  • [33] Construction of a Semantic Segmentation Network for the Overhead Catenary System Point Cloud Based on Multi-Scale Feature Fusion
    Xu, Tao
    Gao, Xianjun
    Yang, Yuanwei
    Xu, Lei
    Xu, Jie
    Wang, Yanjun
    REMOTE SENSING, 2022, 14 (12)
  • [34] MS-CLSTM: Myoelectric Manipulator Gesture Recognition Based on Multi-Scale Feature Fusion CNN-LSTM Network
    Wang, Ziyi
    Huang, Wenjing
    Qi, Zikang
    Yin, Shuolei
    BIOMIMETICS, 2024, 9 (12)
  • [35] Human pose estimation based on feature enhancement and multi-scale feature fusion
    Dandan Cao
    Weibin Liu
    Weiwei Xing
    Xiang Wei
    Signal, Image and Video Processing, 2023, 17 : 643 - 650
  • [36] AMFF-Net: An attention-based multi-scale feature fusion network for allergic pollen detection
    Li, Jianqiang
    Wang, Quanzeng
    Xiong, Chengyao
    Zhao, Linna
    Cheng, Wenxiu
    Xu, Xi
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [37] Hyperspectral image classification based on octave convolution and multi-scale feature fusion
    Li, Zhiyong
    Wen, Bo
    Luo, Yunzhong
    Li, Qiaochu
    Song, Lulu
    PRECISION ENGINEERING-JOURNAL OF THE INTERNATIONAL SOCIETIES FOR PRECISION ENGINEERING AND NANOTECHNOLOGY, 2022, 75 : 80 - 94
  • [38] FEATURE EXTRACTION OF GYMNASTICS IMAGES BASED ON MULTI-SCALE FEATURE FUSION ALGORITHM
    Tian, Kun
    Xia, Qionghua
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (05): : 3394 - 3407
  • [39] CMP-UNet: A Retinal Vessel Segmentation Network Based on Multi-Scale Feature Fusion
    Gu, Yanan
    Cao, Ruyi
    Wang, Dong
    Lu, Bibo
    ELECTRONICS, 2023, 12 (23)
  • [40] Multi-Scale Feature Fusion with Attention Mechanism Based on CGAN Network for Infrared Image Colorization
    Ai, Yibo
    Liu, Xiaoxi
    Zhai, Haoyang
    Li, Jie
    Liu, Shuangli
    An, Huilong
    Zhang, Weidong
    APPLIED SCIENCES-BASEL, 2023, 13 (08):