L-SSD: lightweight SSD target detection based on depth-separable convolution

被引:9
|
作者
Wang, Huilin [1 ]
Qian, Huaming [1 ]
Feng, Shuai [1 ]
Wang, Wenna [1 ]
机构
[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Asymmetric spatial attention; Feature fusion; Improved BiFPN; Local-global feature extraction; Lightweight target detection; MobileNetv2;
D O I
10.1007/s11554-024-01413-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The current target detection algorithm based on deep learning has many redundant convolution calculations, which are difficult to apply to low-energy mobile devices, such as intelligent inspection robots and automatic driving. To solve this problem, we propose a lightweight target detection algorithm, L-SSD, based on depth-separable convolution. First, we chose the lightweight network MobileNetv2 as the backbone feature extraction network, and we proposed an upsampling feature fusion module (UFFM) to fuse the output feature maps of MobileNetv2. Deep semantic information is introduced into the shallow feature map to improve the feature extraction capability while reducing the complexity of the model. Second, we propose a local-global feature extraction module (LGFEM), which uses LGFEM to generate five additional feature layers to expand the feature map's receptive field and improve the model's detection accuracy. Then, we use an improved weighted bidirectional feature pyramid (BiFPN) for feature fusion to construct a new feature pyramid that fully utilizes the feature information between different layers. Finally, we propose asymmetric spatial attention (ASA) that enhances the expression ability of the features before BiFPN feature fusion, providing good positional information for the feature pyramid. Experimental results on the PASCAL VOC and MS COCO datasets show that the model parameters and model complexity of L-SSD are reduced by 85.9% and 96.1%, respectively, compared to SSD. A detection speed of 106 frames per second was achieved in NVIDIA GeForce RTX 3060 with detection accuracies of 73.8% and 22.4%, respectively. The optimal balance of model parameters, model complexity, detection accuracy, and speed are achieved.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] L-SSD: lightweight SSD target detection based on depth-separable convolution
    Huilin Wang
    Huaming Qian
    Shuai Feng
    Wenna Wang
    Journal of Real-Time Image Processing, 2024, 21
  • [2] Lightweight Bridge Crack Detection Method Based on SegNet and Bottleneck Depth-Separable Convolution With Residuals
    Zheng, Xuan
    Zhang, Shuailong
    Li, Xue
    Li, Gang
    Li, Xiyuan
    IEEE ACCESS, 2021, 9 : 161649 - 161668
  • [3] Research on Lightweight Method of Insulator Target Detection Based on Improved SSD
    Zeng, Bing
    Zhou, Yu
    He, Dilin
    Zhou, Zhihao
    Hao, Shitao
    Yi, Kexin
    Li, Zhilong
    Zhang, Wenhua
    Xie, Yunmin
    SENSORS, 2024, 24 (18)
  • [4] A Mini-UAV Lightweight Target Detection Model Based on SSD
    Zhang, JiaHui
    Xie, RongLei
    Meng, ZhiJun
    Li, Gen
    Xin, ShuLin
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2999 - 3013
  • [5] Ssd-kdgan: a lightweight SSD target detection method based on knowledge distillation and generative adversarial networks
    Wang, Huilin
    Qian, Huaming
    Feng, Shuai
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (16): : 23544 - 23564
  • [6] CAL-SSD: lightweight SSD object detection based on coordinated attention
    Zhong, Xin
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [7] The Multi-Scale Depth-Separable Convolution Network for Fire and Smoke Detection
    Yan, Huihui
    Cui, Zhihua
    Zhao, Haotian
    Zhang, Jingbo
    Qin, Juanjuan
    Guo, Qian
    COMBUSTION SCIENCE AND TECHNOLOGY, 2024,
  • [8] Traffic Pedestrian Detection Algorithm based on Lightweight SSD
    Huang, JiaBao
    Cai, Qiong
    Chen, Yu
    Huang, QianQian
    Li, Fang
    THIRD INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION; NETWORK AND COMPUTER TECHNOLOGY (ECNCT 2021), 2022, 12167
  • [9] Compression algorithm for live face recognition model based on depth-separable convolution
    Wei, Jinhu
    Zhou, Yan
    Xie, Wei
    Yu, JinWei
    Li, WeiSheng
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1624 - 1628
  • [10] FD-SSD: An improved SSD object detection algorithm based on feature fusion and dilated convolution
    Yin, Qunjie
    Yang, Wenzhu
    Ran, Mengying
    Wang, Sile
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 98